专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20080300857A1 METHOD FOR ALIGNING SENTENCES AT THE WORD LEVEL ENFORCING SELECTIVE CONTIGUITY CONSTRAINTS 有权
标题翻译：在选择性连续性约束条件下，按照水平方向对准信号的方法
公开(公告)号：US20080300857A1
公开(公告)日：2008-12-04
申请号：US11756684
申请日：2007-06-01
申请人： Madalina Barbaiani , Nicola Cancedda , Christopher R. Dance , Szilard Zsolt Fazekas , Tamas Gaal , Eric Gaussier
发明人： Madalina Barbaiani , Nicola Cancedda , Christopher R. Dance , Szilard Zsolt Fazekas , Tamas Gaal , Eric Gaussier
IPC分类号： G06F17/28
CPC分类号： G06F17/2827
摘要： An alignment method includes, for a source sentence in a source language, identifying whether the sentence includes at least one candidate term comprising a contiguous subsequence of words of the source sentence. A target sentence in a target language is aligned with the source sentence. This includes developing a probabilistic model which models conditional probability distributions for alignments between words of the source sentence and words of the target sentence and generating an optimal alignment based on the probabilistic model, including, where the source sentence includes the at least one candidate term, enforcing a contiguity constraint which requires that all the words of the target sentence which are aligned with an identified candidate term form a contiguous subsequence of the target sentence.
摘要翻译：对准方法包括对于源语言的源语句，识别句子是否包括包括源语句的单词的连续子序列的至少一个候选词。目标语言中的目标句子与源语句对齐。这包括开发概率模型，其模拟条件概率分布，用于源语句的单词与目标句子的单词之间的对齐，并且基于概率模型产生最佳对齐，包括：源语句包括至少一个候选词，执行连续性约束，其要求与所识别的候选词对齐的目标句子的所有单词形成目标句子的连续子序列。

2. 发明授权

US09020804B2 Method for aligning sentences at the word level enforcing selective contiguity constraints 有权
标题翻译：用于在单词级别对齐句子的方法，强制选择性连续约束
公开(公告)号：US09020804B2
公开(公告)日：2015-04-28
申请号：US11756684
申请日：2007-06-01
申请人： Madalina Barbaiani , Nicola Cancedda , Christopher R. Dance , Szilárd Zsolt Fazekas , Tamás Gaál , Eric Gaussier
发明人： Madalina Barbaiani , Nicola Cancedda , Christopher R. Dance , Szilárd Zsolt Fazekas , Tamás Gaál , Eric Gaussier
IPC分类号： G06F17/28
CPC分类号： G06F17/2827
摘要： An alignment method includes, for a source sentence in a source language, identifying whether the sentence includes at least one candidate term comprising a contiguous subsequence of words of the source sentence. A target sentence in a target language is aligned with the source sentence. This includes developing a probabilistic model which models conditional probability distributions for alignments between words of the source sentence and words of the target sentence and generating an optimal alignment based on the probabilistic model, including, where the source sentence includes the at least one candidate term, enforcing a contiguity constraint which requires that all the words of the target sentence which are aligned with an identified candidate term form a contiguous subsequence of the target sentence.
摘要翻译：对准方法包括对于源语言的源语句，识别句子是否包括包括源语句的单词的连续子序列的至少一个候选词。目标语言中的目标句子与源语句对齐。这包括开发概率模型，其模拟条件概率分布，用于源语句的单词与目标句子的单词之间的对齐，并且基于概率模型产生最佳对齐，包括：源语句包括至少一个候选词，执行连续性约束，其要求与所识别的候选词对齐的目标句子的所有单词形成目标句子的连续子序列。

3. 发明授权

US07536295B2 Machine translation using non-contiguous fragments of text 失效
标题翻译：机器翻译使用不连续的文本片段
公开(公告)号：US07536295B2
公开(公告)日：2009-05-19
申请号：US11315043
申请日：2005-12-22
申请人： Nicola Cancedda , Bruno Cavestro , Marc Dymetman , Eric Gaussier , Cyril Goutte , Michel Simard , Kenji Yamada
发明人： Nicola Cancedda , Bruno Cavestro , Marc Dymetman , Eric Gaussier , Cyril Goutte , Michel Simard , Kenji Yamada
IPC分类号： G06F17/28
CPC分类号： G06F17/2827
摘要： A machine translation method for translating source text from a first language to target text in a second language includes receiving the source text in the first language and accessing a library of bi-fragments, each of the bi-fragments including a text fragment from the first language and a text fragment from the second language, at least some of the bi-fragments comprising non-contiguous bi-fragments in which at least one of the text fragment from the first language and the text fragment from the second language comprises a non-contiguous fragment.
摘要翻译：用于将源文本从第一语言翻译成以第二语言的目标文本的机器翻译方法包括以第一语言接收源文本并访问双片段的库，每个双片段包括来自第一语言的文本片段语言和来自第二语言的文本片段，至少一些双片段包括非连续双片段，其中来自第一语言的文本片段和来自第二语言的文本片段中的至少一个包含非连续双片段，连续片段

4. 发明授权

US07542893B2 Machine translation using elastic chunks 失效
标题翻译：机械翻译使用弹性块
公开(公告)号：US07542893B2
公开(公告)日：2009-06-02
申请号：US11431393
申请日：2006-05-10
申请人： Nicola Cancedda , Marc Dymetman , Eric Gaussier , Cyril Goutte
发明人： Nicola Cancedda , Marc Dymetman , Eric Gaussier , Cyril Goutte
IPC分类号： G06F17/28
CPC分类号： G06F17/2818
摘要： A machine translation method includes receiving source text in a first language and retrieving text fragments in a target language from a library of bi-fragments to generate a target hypothesis. Each bi-fragment includes a text fragment from the first language and a corresponding text fragment from the second language. Some of the bi-fragments are modeled as elastic bi-fragments where a gap between words is able to assume a variable size corresponding to a number of other words to occupy the gap. The target hypothesis is evaluated with a translation scoring function which scores the target hypothesis according to a plurality of feature functions, at least one of the feature functions comprising a gap size scoring feature which favors hypotheses with statistically more probable gap sizes over hypotheses with statically less probable gap sizes.
摘要翻译：机器翻译方法包括以第一语言接收源文本并且从双片段的库中检索目标语言中的文本片段以生成目标假设。每个双片段包括来自第一语言的文本片段和来自第二语言的相应文本片段。一些双片段被建模为弹性双片段，其中词之间的间隙能够采用与多个其他单词相对应的可变大小来占据间隙。目标假设用翻译评分函数评估，其根据多个特征函数对目标假设进行评分，特征函数中的至少一个包括间隙大小评分特征，其有利于具有统计学上更可能的间隔大小超过假设的假设，具有静态较小可能的间隙大小。

5. 发明申请

US20070265825A1 Machine translation using elastic chunks 失效
标题翻译：机械翻译使用弹性块
公开(公告)号：US20070265825A1
公开(公告)日：2007-11-15
申请号：US11431393
申请日：2006-05-10
申请人： Nicola Cancedda , Marc Dymetman , Eric Gaussier , Cyril Goutte
发明人： Nicola Cancedda , Marc Dymetman , Eric Gaussier , Cyril Goutte
IPC分类号： G06F17/28
CPC分类号： G06F17/2818
摘要： A machine translation method includes receiving source text in a first language and retrieving text fragments in a target language from a library of bi-fragments to generate a target hypothesis. Each bi-fragment includes a text fragment from the first language and a corresponding text fragment from the second language. Some of the bi-fragments are modeled as elastic bi-fragments where a gap between words is able to assume a variable size corresponding to a number of other words to occupy the gap. The target hypothesis is evaluated with a translation scoring function which scores the target hypothesis according to a plurality of feature functions, at least one of the feature functions comprising a gap size scoring feature which favors hypotheses with statistically more probable gap sizes over hypotheses with statically less probable gap sizes.
摘要翻译：机器翻译方法包括以第一语言接收源文本并且从双片段的库中检索目标语言中的文本片段以生成目标假设。每个双片段包括来自第一语言的文本片段和来自第二语言的相应文本片段。一些双片段被建模为弹性双片段，其中词之间的间隙能够采用与多个其他单词相对应的可变大小来占据间隙。目标假设用翻译评分函数评估，其根据多个特征函数对目标假设进行评分，特征函数中的至少一个包括间隙大小评分特征，其有利于具有统计学上更可能的间隔大小超过假设的假设，具有静态较小可能的间隙大小。

6. 发明申请

US20070150257A1 Machine translation using non-contiguous fragments of text 失效
标题翻译：机器翻译使用不连续的文本片段
公开(公告)号：US20070150257A1
公开(公告)日：2007-06-28
申请号：US11315043
申请日：2005-12-22
申请人： Nicola Cancedda , Bruno Cavestro , Marc Dymetman , Eric Gaussier , Cyril Goutte , Michel Simard , Kenji Yamada
发明人： Nicola Cancedda , Bruno Cavestro , Marc Dymetman , Eric Gaussier , Cyril Goutte , Michel Simard , Kenji Yamada
IPC分类号： G06F17/28
CPC分类号： G06F17/2827
摘要： A machine translation method for translating source text from a first language to target text in a second language includes receiving the source text in the first language and accessing a library of bi-fragments, each of the bi-fragments including a text fragment from the first language and a text fragment from the second language, at least some of the bi-fragments comprising non-contiguous bi-fragments in which at least one of the text fragment from the first language and the text fragment from the second language comprises a non-contiguous fragment.
摘要翻译：用于将源文本从第一语言翻译成以第二语言的目标文本的机器翻译方法包括以第一语言接收源文本并访问双片段的库，每个双片段包括来自第一语言的文本片段语言和来自第二语言的文本片段，至少一些双片段包括非连续双片段，其中来自第一语言的文本片段和来自第二语言的文本片段中的至少一个包含非连续双片段，连续片段

7. 发明授权

US09552355B2 Dynamic bi-phrases for statistical machine translation 有权
标题翻译：用于统计机器翻译的动态双语短语
公开(公告)号：US09552355B2
公开(公告)日：2017-01-24
申请号：US12784040
申请日：2010-05-20
申请人： Marc Dymetman , Wilker Ferreira Aziz , Nicola Cancedda , Jean-Marc Coursimault , Vassilina Nikoulina , Lucia Specia
发明人： Marc Dymetman , Wilker Ferreira Aziz , Nicola Cancedda , Jean-Marc Coursimault , Vassilina Nikoulina , Lucia Specia
IPC分类号： G06F17/28
CPC分类号： G06F17/2827 , G06F17/2818
摘要： A system and a method for phrase-based translation are disclosed. The method includes receiving source language text to be translated into target language text. One or more dynamic bi-phrases are generated, based on the source text and the application of one or more rules, which may be based on user descriptions. A dynamic feature value is associated with each of the dynamic bi-phrases. For a sentence of the source text, static bi-phrases are retrieved from a bi-phrase table, each of the static bi-phrases being associated with one or more values of static features. Any of the dynamic bi-phrases which each cover at least one word of the source text are also retrieved, which together form a set of active bi-phrases. Translation hypotheses are generated using active bi-phrases from the set and scored with a translation scoring model which takes into account the static and dynamic feature values of the bi-phrases used in the respective hypothesis. A translation, based on the hypothesis scores, is then output.
摘要翻译：公开了一种用于基于短语的翻译的系统和方法。该方法包括接收要翻译成目标语言文本的源语言文本。基于可以基于用户描述的一个或多个规则的源文本和应用来生成一个或多个动态双词组。动态特征值与每个动态双词组相关联。对于源文本的句子，从双词表检索静态双词组，每个静态双词组与一个或多个静态特征值相关联。还检索了每个覆盖源文本的至少一个单词的动态双词组，其一起形成一组活动双词组。使用来自集合的活动双词组产生翻译假设，并用考虑到各自假设中使用的双词组的静态和动态特征值的翻译评分模型进行评分。然后输出基于假设分数的翻译。

8. 发明授权

US08612205B2 Word alignment method and system for improved vocabulary coverage in statistical machine translation 有权
标题翻译：字对齐方法和系统，用于改进统计机器翻译中的词汇覆盖
公开(公告)号：US08612205B2
公开(公告)日：2013-12-17
申请号：US12814657
申请日：2010-06-14
申请人： Gregory Alan Hanneman , Nicola Cancedda , Marc Dymetman
发明人： Gregory Alan Hanneman , Nicola Cancedda , Marc Dymetman
IPC分类号： G06F17/28 , G06F17/20 , G06F17/27 , G10L21/00 , G06F17/30
CPC分类号： G06F17/2827
摘要： A system and method for generating word alignments from pairs of aligned text strings are provided. A corpus of text strings provides pairs of text strings, primarily sentences, in source and target languages. A first alignment between a text string pair creates links therebetween. Each link links a single token of the first text string to a single token of the second text string. A second alignment also creates links between the text string pair. In some cases, these links may correspond to bi-phrases. A modified first alignment is generated by selectively modifying links in the first alignment which include a word which is infrequent in the corpus, based on links generated in the second alignment. This results in removing at least some of the links for the infrequent words, allowing more compact and better quality bi-phrases, with higher vocabulary coverage, to be extracted for use in a machine translation system.
摘要翻译：提供了用于从对齐的文本串对中生成字对齐的系统和方法。文本字符串的语料库以源语言和目标语言提供了一对文本字符串，主要是句子。文本串对之间的第一对齐在其间创建链接。每个链接将第一个文本字符串的单个标记链接到第二个文本字符串的单个标记。第二个对齐也创建文本串对之间的链接。在某些情况下，这些链接可能对应于双语短语。通过基于在第二对准中产生的链接，通过选择性地修改第一对齐中的链接来生成修改的第一对准，该链接包括语料库中不频繁的单词。这导致删除不频繁的单词的至少一些链接，允许提取具有较高词汇覆盖率的更紧凑和更好质量的双语短语以用于机器翻译系统。

9. 发明申请

US20130042108A1 PRIVATE ACCESS TO HASH TABLES 有权
标题翻译：私人访问HASH表
公开(公告)号：US20130042108A1
公开(公告)日：2013-02-14
申请号：US13204894
申请日：2011-08-08
申请人： Nicola Cancedda
发明人： Nicola Cancedda
IPC分类号： H04L9/32
CPC分类号： G06F21/6227 , G06F21/602 , G06F21/606 , G06F2221/2115 , H04L9/0894 , H04L9/321
摘要： A server and a client mutually exclusively execute server-side and client-side commutative cryptographic processes and server-side and client-side commutative permutation processes. The server has access to a hash table, while the client does not. The server and client perform a method including: encrypting and reordering the hash table using the server; communicating the encrypted and reordered hash table to the client; further encrypting and further reordering the hash table using the client; communicating the further encrypted and further reordered hash table back to the server; and partially decrypting and partially undoing the reordering using the server to generate a double-blind hash table. To read an entry, the client hashes and permute an index key and communicates same to the server which retrieves an item from the double-blind hash table using the hashed and permuted index key and sends it back to the client which decrypts the retrieved item.
摘要翻译：服务器和客户端互斥地执行服务器端和客户端交换密码过程以及服务器端和客户端交换排列过程。服务器可以访问哈希表，而客户端则不能访问。服务器和客户机执行一种方法，包括：使用服务器对散列表进行加密和重新排序; 将加密和重新排序的哈希表传送给客户端; 使用客户端进一步加密和进一步重新排序哈希表; 将进一步加密和进一步重新排序的哈希表传送回服务器; 并使用服务器部分地解密并部分地解除重排序以生成双盲哈希表。为了读取条目，客户端将使用散列和置换的索引关键字将索引关键字进行散列并置换索引关键字并将其从使用双盲散列表检索项目的服务器进行通信，并将其发送回解密检索项目的客户端。

10. 发明授权

US07587307B2 Method and apparatus for evaluating machine translation quality 有权
标题翻译：评估机器翻译质量的方法和装置
公开(公告)号：US07587307B2
公开(公告)日：2009-09-08
申请号：US10737972
申请日：2003-12-18
申请人： Nicola Cancedda , Kenji Yamada
发明人： Nicola Cancedda , Kenji Yamada
IPC分类号： G06F17/27 , G06F17/28
CPC分类号： G06F17/2854 , G06F17/28 , G06F17/3069
摘要： Quality of machine translation of natural language is determined by computing a sequence kernel that provides a measure of similarity between a first sequence of symbols representing a machine translation in a target natural language and a second sequence of symbols representing a reference translation in the target natural language. The measure of similarity takes into account the existence of non-contiguous subsequences shared by the first sequence of symbols and the second sequence of symbols. When the similarity measure does not meet an acceptable threshold level, the translation model of the machine translator may be adjusted to improve subsequent translations performed by the machine translator.
摘要翻译：自然语言的机器翻译的质量通过计算提供表示目标自然语言中的机器翻译的第一符号序列与表示目标自然语言中的参考翻译的第二符号序列之间的相似度的度量来确定。相似性的度量考虑了由第一符号序列和第二符号序列共享的非连续子序列的存在。当相似性度量不满足可接受的阈值水平时，可以调整机器翻译器的翻译模型，以改善由机器翻译器执行的后续翻译。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式