会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明申请
    • Apparatus and methods for aligning words in bilingual sentences
    • 双语句子对齐词的装置和方法
    • US20060190241A1
    • 2006-08-24
    • US11137590
    • 2005-05-26
    • Cyril GoutteMichel SimardKenji YamadaEric GaussierArne Mauser
    • Cyril GoutteMichel SimardKenji YamadaEric GaussierArne Mauser
    • G06F17/28
    • G06F17/2827
    • Methods are disclosed for performing proper word alignment that satisfy constraints of coverage and transitive closure. Initially, a translation matrix which defines word association measures between source and target words of a corpus of bilingual translations of source and target sentences is computed. Subsequently, in a first method, the association measures in the translation matrix are factorized and orthogonalized to produce cepts for the source and target words, which resulting matrix factors may then be, optionally, multiplied to produce an alignment matrix. In a second method, the association measures in the translation matrix are thresholded, and then closed by transitivity, to produce an alignment matrix, which may then be, optionally, factorized to produce cepts. The resulting cepts or alignment matrices may then be used by any number of natural language applications for identifying words that are properly aligned.
    • 公开了用于执行满足覆盖和传递闭包约束的适当字对齐的方法。 最初,计算了定义源语句和目标语句双语翻译语料库的源词和目标词之间的词关联度量的翻译矩阵。 随后,在第一种方法中,翻译矩阵中的关联度量被分解和正交化以产生源词和目标词的尖叫,所得到的矩阵因子然后可以被乘以以产生对齐矩阵。 在第二种方法中,翻译矩阵中的关联度量被阈值化,然后由传递性闭合,以产生对准矩阵,其可以随后被分解以产生尖叫。 所得到的尖叫或对齐矩阵然后可以被任何数量的自然语言应用程序用于识别正确对准的单词。
    • 6. 发明授权
    • Method for multi-class, multi-label categorization using probabilistic hierarchical modeling
    • 使用概率分层建模的多类,多标签分类方法
    • US07139754B2
    • 2006-11-21
    • US10774966
    • 2004-02-09
    • Cyril GoutteEric Gaussier
    • Cyril GoutteEric Gaussier
    • G06F17/30
    • G06F17/30707Y10S707/99933Y10S707/99934Y10S707/99935Y10S707/99936
    • A method of categorizing objects in which there can be multiple categories of objects and each object can belong to more than one category is described. The method defines a set of categories in which at least one category is dependent on another category and then organizes the categories in a hierarchy that embodies any dependencies among them. Each object is assigned to one or more categories in the set. A set of labels corresponding to all combinations of any number of the categories is defined, wherein if an object is relevant to several categories, the object must be assigned the label corresponding to the subset of all relevant categories. Once the new labels are defined, the multi-category, multi-label problem has been reduced to a multi-category, single-label problem, and the categorization task is reduced down to choosing the single best label set for an object.
    • 描述了可以存在多个类别的对象和每个对象可以属于多于一个类别的对象的分类方法。 该方法定义了一组类别,其中至少一个类别依赖于另一个类别,然后组织在体现其中的任何依赖关系的层次结构中的类别。 每个对象被分配到集合中的一个或多个类别。 定义对应于任何数量的类别的所有组合的一组标签,其中如果对象与若干类别相关,则该对象必须被分配与所有相关类别的子集相对应的标签。 一旦定义了新标签,多类别,多标签问题已经被减少到多类别的单标签问题,并且分类任务减少到为对象选择单个最佳标签集。
    • 7. 发明授权
    • Apparatus and methods for aligning words in bilingual sentences
    • 双语句子对齐词的装置和方法
    • US07672830B2
    • 2010-03-02
    • US11137590
    • 2005-05-26
    • Cyril GoutteMichel SimardKenji YamadaEric GaussierArne Mauser
    • Cyril GoutteMichel SimardKenji YamadaEric GaussierArne Mauser
    • G06F17/28G06F17/27
    • G06F17/2827
    • Methods are disclosed for performing proper word alignment that satisfy constraints of coverage and transitive closure. Initially, a translation matrix which defines word association measures between source and target words of a corpus of bilingual translations of source and target sentences is computed. Subsequently, in a first method, the association measures in the translation matrix are factorized and orthogonalized to produce cepts for the source and target words, which resulting matrix factors may then be, optionally, multiplied to produce an alignment matrix. In a second method, the association measures in the translation matrix are thresholded, and then closed by transitivity, to produce an alignment matrix, which may then be, optionally, factorized to produce cepts. The resulting cepts or alignment matrices may then be used by any number of natural language applications for identifying words that are properly aligned.
    • 公开了用于执行满足覆盖和传递闭包约束的适当字对齐的方法。 最初,计算了定义源语句和目标语句双语翻译语料库的源词和目标词之间的词关联度量的翻译矩阵。 随后,在第一种方法中,翻译矩阵中的关联度量被分解和正交化以产生源词和目标词的尖叫,所得到的矩阵因子然后可以被乘以以产生对齐矩阵。 在第二种方法中,翻译矩阵中的关联度量被阈值化,然后由传递性闭合,以产生对准矩阵,其可以随后被分解以产生尖叫。 所得到的尖叫或对齐矩阵然后可以被任何数量的自然语言应用程序用于识别正确对准的单词。
    • 8. 发明授权
    • Machine translation using elastic chunks
    • 机械翻译使用弹性块
    • US07542893B2
    • 2009-06-02
    • US11431393
    • 2006-05-10
    • Nicola CanceddaMarc DymetmanEric GaussierCyril Goutte
    • Nicola CanceddaMarc DymetmanEric GaussierCyril Goutte
    • G06F17/28
    • G06F17/2818
    • A machine translation method includes receiving source text in a first language and retrieving text fragments in a target language from a library of bi-fragments to generate a target hypothesis. Each bi-fragment includes a text fragment from the first language and a corresponding text fragment from the second language. Some of the bi-fragments are modeled as elastic bi-fragments where a gap between words is able to assume a variable size corresponding to a number of other words to occupy the gap. The target hypothesis is evaluated with a translation scoring function which scores the target hypothesis according to a plurality of feature functions, at least one of the feature functions comprising a gap size scoring feature which favors hypotheses with statistically more probable gap sizes over hypotheses with statically less probable gap sizes.
    • 机器翻译方法包括以第一语言接收源文本并且从双片段的库中检索目标语言中的文本片段以生成目标假设。 每个双片段包括来自第一语言的文本片段和来自第二语言的相应文本片段。 一些双片段被建模为弹性双片段,其中词之间的间隙能够采用与多个其他单词相对应的可变大小来占据间隙。 目标假设用翻译评分函数评估,其根据多个特征函数对目标假设进行评分,特征函数中的至少一个包括间隙大小评分特征,其有利于具有统计学上更可能的间隔大小超过假设的假设,具有静态较小 可能的间隙大小。
    • 10. 发明申请
    • Method for multi-class, multi-label categorization using probabilistic hierarchical modeling
    • 使用概率分层建模的多类,多标签分类方法
    • US20050187892A1
    • 2005-08-25
    • US10774966
    • 2004-02-09
    • Cyril GoutteEric Gaussier
    • Cyril GoutteEric Gaussier
    • G06F7/00G06F17/30
    • G06F17/30707Y10S707/99933Y10S707/99934Y10S707/99935Y10S707/99936
    • A method for categorizing a set of objects includes defining a set of categories in which at least one category in the set is dependent on another category in the set; organizing the set of categories in a hierarchy that embodies any dependencies among the categories in the set; for each object, assigning to the object one or more categories l1 . . . lP where liε{1 . . . L} from a set {1 . . . L} of possible categories, wherein the assigned categories represent a subset of categories for which the object is relevant; defining a new set of labels z comprising all possible combinations of any number of the categories, zε{{1},{2}, . . . {L},{1,2}, . . . {1,L},{2,3}, . . . {1,2,3}, . . . {1,2, . . . L}}, such that if an object is relevant to several categories, the object must be assigned the label z corresponding to the subset of all relevant categories; and assigning to the object the several categories and the subcategories of the several categories.
    • 用于对一组对象进行分类的方法包括定义一组类别,其中集合中的至少一个类别依赖于集合中的另一类别; 在层次结构中组织一组体现集合中类别之间依赖关系的类别; 对于每个对象,向对象分配一个或多个类别l 1 。 。 。 其中,1≤ε≤1。 。 。 L}从集合{1。 。 。 L}的可能类别,其中所分配的类别表示对象对应的类别的子集; 定义一组新的标签z,其包括任何数量的类别的所有可能组合,zepsilon {{1},{2},...。 。 。 {L},{1,2},。 。 。 {1,L},{2,3},。 。 。 {1,2,3},。 。 。 {1,2,。 。 。 L}},使得如果对象与几个类别相关,则必须向对象分配与所有相关类别的子集相对应的标签z; 并向对象分配几个类别和几个类别的子类别。