会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明申请
    • Collocation translation from monolingual and available bilingual corpora
    • 单语和双语语料库的翻译
    • US20060282255A1
    • 2006-12-14
    • US11152540
    • 2005-06-14
    • Yajuan LuJianfeng GaoMing ZhouJohn ChenMu Li
    • Yajuan LuJianfeng GaoMing ZhouJohn ChenMu Li
    • G06F17/28
    • G06F17/2827
    • A system and method of extracting collocation translations is presented. The methods include constructing a collocation translation model using monolingual source and target language corpora as well as bilingual corpus, if available. The collocation translation model employs an expectation maximization algorithm with respect to contextual words surrounding collocations. The collocation translation model can be used later to extract a collocation translation dictionary. Optional filters based on context redundancy and/or bi-directional translation constrain can be used to ensure that only highly reliable collocation translations are included in the dictionary. The constructed collocation translation model and the extracted collocation translation dictionary can be used later for further natural language processing, such as sentence translation.
    • 提出了一种提取搭配翻译的系统和方法。 这些方法包括使用单语源语言和目标语言语料库以及双语语料库(如果可用)来构建搭配翻译模型。 搭配翻译模型采用围绕搭配的上下文单词的期望最大化算法。 搭配翻译模型可以随后用于提取搭配翻译字典。 可以使用基于上下文冗余和/或双向转换约束的可选过滤器来确保字典中仅包含高度可靠的并置转换。 构建的搭配翻译模型和提取的搭配翻译词典可以稍后用于进一步的自然语言处理,如句子翻译。
    • 6. 发明申请
    • Statistical machine translation processing
    • 统计机器翻译处理
    • US20120022850A1
    • 2012-01-26
    • US13250417
    • 2011-09-30
    • Chi-Ho LiMu LiDongdong ZhangMing Zhou
    • Chi-Ho LiMu LiDongdong ZhangMing Zhou
    • G06F17/28
    • G06F17/2818
    • A method of statistical machine translation (SMT) is provided. The method comprises generating reordering knowledge based on the syntax of a source language (SL) and a number of alignment matrices that map sample SL sentences with sample target language (TL) sentences. The method further comprises receiving a SL word string and parsing the SL word string into a parse tree that represents the syntactic properties of the SL word string. The nodes on the parse tree are reordered based on the generated reordering knowledge in order to provide reordered word strings. The method further comprises translating a number of reordered word strings to create a number of TL word strings, and identifying a statistically preferred TL word string as a preferred translation of the SL word string.
    • 提供了统计机器翻译(SMT)的方法。 该方法包括基于源语言(SL)的语法和将样本SL语句与样本目标语言(TL)语句对齐的多个对齐矩阵来生成重排序知识。 该方法还包括接收SL字串并将SL字串解析成表示SL字串的句法属性的解析树。 基于所生成的重新排序知识来重新排序解析树上的节点,以提供重新排序的字串。 该方法还包括翻译多个重新排序的字串以创建多个TL字串,并且将统计上优选的TL字串识别为SL字串的优选翻译。
    • 9. 发明申请
    • Statistical machine translation processing
    • 统计机器翻译处理
    • US20090106015A1
    • 2009-04-23
    • US11977133
    • 2007-10-23
    • Chi-Ho LiMu LiDongdong ZhangMing Zhou
    • Chi-Ho LiMu LiDongdong ZhangMing Zhou
    • G06F17/28
    • G06F17/2818
    • A method of statistical machine translation (SMT) is provided. The method comprises generating reordering knowledge based on the syntax of a source language (SL) and a number of alignment matrices that map sample SL sentences with sample target language (TL) sentences. The method further comprises receiving a SL word string and parsing the SL word string into a parse tree that represents the syntactic properties of the SL word string. The nodes on the parse tree are reordered based on the generated reordering knowledge in order to provide reordered word strings. The method further comprises translating a number of reordered word strings to create a number of TL word strings, and identifying a statistically preferred TL word string as a preferred translation of the SL word string.
    • 提供了统计机器翻译(SMT)的方法。 该方法包括基于源语言(SL)的语法和将样本SL语句与样本目标语言(TL)语句对齐的多个对齐矩阵来生成重新排序知识。 该方法还包括接收SL字串并将SL字串解析成表示SL字串的句法属性的解析树。 基于所生成的重新排序知识来重新排序解析树上的节点,以提供重新排序的字串。 该方法还包括翻译多个重新排序的字串以创建多个TL字串,并且将统计上优选的TL字串识别为SL字串的优选翻译。
    • 10. 发明授权
    • Post-processing system and method for correcting machine recognized text
    • 用于校正机器识别文本的后处理系统和方法
    • US07092567B2
    • 2006-08-15
    • US10288645
    • 2002-11-04
    • Yue MaJinhong Katherine GuoMu LiYu-kun TongTian-shun YaoJing-bo Zhu
    • Yue MaJinhong Katherine GuoMu LiYu-kun TongTian-shun YaoJing-bo Zhu
    • G06K9/34G06K9/72G06K9/03G06F17/27
    • G06K9/723G06K2209/01
    • A method of post-processing character data from an optical character recognition (OCR) engine and apparatus to perform the method. This exemplary method includes segmenting the character data into a set of initial words. The set of initial words is word level processed to determine at least one candidate word corresponding to each initial word. The set of initial words is segmented into a set of sentences. Each sentence in the set of sentences includes a plurality of initial words and candidate words corresponding to the initial words. A sentence is selected from the set of sentences. The selected sentence is word disambiguity processed to determine a plurality of final words. A final word is selected from the at least one candidate word corresponding to a matching initial word. The plurality of final words is then assembled as post-processed OCR data.
    • 一种后处理来自光学字符识别(OCR)引擎和装置的字符数据的方法。 该示例性方法包括将字符数据分割成一组初始字。 初始字的集合被处理为字处理以确定与每个初始字对应的至少一个候选字。 该组初始单词被分割成一组句子。 该组句子中的每个句子包括与初始词对应的多个初始词和候选词。 从一组句子中选出一个句子。 所选择的句子是处理的词消除歧义以确定多个最终词。 从对应于匹配的初始字的至少一个候选字中选择最终字。 然后将多个最终单词组装为后处理OCR数据。