会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Dynamic bi-phrases for statistical machine translation
    • 用于统计机器翻译的动态双语短语
    • US09552355B2
    • 2017-01-24
    • US12784040
    • 2010-05-20
    • Marc DymetmanWilker Ferreira AzizNicola CanceddaJean-Marc CoursimaultVassilina NikoulinaLucia Specia
    • Marc DymetmanWilker Ferreira AzizNicola CanceddaJean-Marc CoursimaultVassilina NikoulinaLucia Specia
    • G06F17/28
    • G06F17/2827G06F17/2818
    • A system and a method for phrase-based translation are disclosed. The method includes receiving source language text to be translated into target language text. One or more dynamic bi-phrases are generated, based on the source text and the application of one or more rules, which may be based on user descriptions. A dynamic feature value is associated with each of the dynamic bi-phrases. For a sentence of the source text, static bi-phrases are retrieved from a bi-phrase table, each of the static bi-phrases being associated with one or more values of static features. Any of the dynamic bi-phrases which each cover at least one word of the source text are also retrieved, which together form a set of active bi-phrases. Translation hypotheses are generated using active bi-phrases from the set and scored with a translation scoring model which takes into account the static and dynamic feature values of the bi-phrases used in the respective hypothesis. A translation, based on the hypothesis scores, is then output.
    • 公开了一种用于基于短语的翻译的系统和方法。 该方法包括接收要翻译成目标语言文本的源语言文本。 基于可以基于用户描述的一个或多个规则的源文本和应用来生成一个或多个动态双词组。 动态特征值与每个动态双词组相关联。 对于源文本的句子,从双词表检索静态双词组,每个静态双词组与一个或多个静态特征值相关联。 还检索了每个覆盖源文本的至少一个单词的动态双词组,其一起形成一组活动双词组。 使用来自集合的活动双词组产生翻译假设,并用考虑到各自假设中使用的双词组的静态和动态特征值的翻译评分模型进行评分。 然后输出基于假设分数的翻译。
    • 4. 发明授权
    • Word alignment method and system for improved vocabulary coverage in statistical machine translation
    • 字对齐方法和系统,用于改进统计机器翻译中的词汇覆盖
    • US08612205B2
    • 2013-12-17
    • US12814657
    • 2010-06-14
    • Gregory Alan HannemanNicola CanceddaMarc Dymetman
    • Gregory Alan HannemanNicola CanceddaMarc Dymetman
    • G06F17/28G06F17/20G06F17/27G10L21/00G06F17/30
    • G06F17/2827
    • A system and method for generating word alignments from pairs of aligned text strings are provided. A corpus of text strings provides pairs of text strings, primarily sentences, in source and target languages. A first alignment between a text string pair creates links therebetween. Each link links a single token of the first text string to a single token of the second text string. A second alignment also creates links between the text string pair. In some cases, these links may correspond to bi-phrases. A modified first alignment is generated by selectively modifying links in the first alignment which include a word which is infrequent in the corpus, based on links generated in the second alignment. This results in removing at least some of the links for the infrequent words, allowing more compact and better quality bi-phrases, with higher vocabulary coverage, to be extracted for use in a machine translation system.
    • 提供了用于从对齐的文本串对中生成字对齐的系统和方法。 文本字符串的语料库以源语言和目标语言提供了一对文本字符串,主要是句子。 文本串对之间的第一对齐在其间创建链接。 每个链接将第一个文本字符串的单个标记链接到第二个文本字符串的单个标记。 第二个对齐也创建文本串对之间的链接。 在某些情况下,这些链接可能对应于双语短语。 通过基于在第二对准中产生的链接,通过选择性地修改第一对齐中的链接来生成修改的第一对准,该链接包括语料库中不频繁的单词。 这导致删除不频繁的单词的至少一些链接,允许提取具有较高词汇覆盖率的更紧凑和更好质量的双语短语以用于机器翻译系统。
    • 5. 发明申请
    • PRIVATE ACCESS TO HASH TABLES
    • 私人访问HASH表
    • US20130042108A1
    • 2013-02-14
    • US13204894
    • 2011-08-08
    • Nicola Cancedda
    • Nicola Cancedda
    • H04L9/32
    • G06F21/6227G06F21/602G06F21/606G06F2221/2115H04L9/0894H04L9/321
    • A server and a client mutually exclusively execute server-side and client-side commutative cryptographic processes and server-side and client-side commutative permutation processes. The server has access to a hash table, while the client does not. The server and client perform a method including: encrypting and reordering the hash table using the server; communicating the encrypted and reordered hash table to the client; further encrypting and further reordering the hash table using the client; communicating the further encrypted and further reordered hash table back to the server; and partially decrypting and partially undoing the reordering using the server to generate a double-blind hash table. To read an entry, the client hashes and permute an index key and communicates same to the server which retrieves an item from the double-blind hash table using the hashed and permuted index key and sends it back to the client which decrypts the retrieved item.
    • 服务器和客户端互斥地执行服务器端和客户端交换密码过程以及服务器端和客户端交换排列过程。 服务器可以访问哈希表,而客户端则不能访问。 服务器和客户机执行一种方法,包括:使用服务器对散列表进行加密和重新排序; 将加密和重新排序的哈希表传送给客户端; 使用客户端进一步加密和进一步重新排序哈希表; 将进一步加密和进一步重新排序的哈希表传送回服务器; 并使用服务器部分地解密并部分地解除重排序以生成双盲哈希表。 为了读取条目,客户端将使用散列和置换的索引关键字将索引关键字进行散列并置换索引关键字并将其从使用双盲散列表检索项目的服务器进行通信,并将其发送回解密检索项目的客户端。
    • 6. 发明授权
    • Method and apparatus for evaluating machine translation quality
    • 评估机器翻译质量的方法和装置
    • US07587307B2
    • 2009-09-08
    • US10737972
    • 2003-12-18
    • Nicola CanceddaKenji Yamada
    • Nicola CanceddaKenji Yamada
    • G06F17/27G06F17/28
    • G06F17/2854G06F17/28G06F17/3069
    • Quality of machine translation of natural language is determined by computing a sequence kernel that provides a measure of similarity between a first sequence of symbols representing a machine translation in a target natural language and a second sequence of symbols representing a reference translation in the target natural language. The measure of similarity takes into account the existence of non-contiguous subsequences shared by the first sequence of symbols and the second sequence of symbols. When the similarity measure does not meet an acceptable threshold level, the translation model of the machine translator may be adjusted to improve subsequent translations performed by the machine translator.
    • 自然语言的机器翻译的质量通过计算提供表示目标自然语言中的机器翻译的第一符号序列与表示目标自然语言中的参考翻译的第二符号序列之间的相似度的度量来确定 。 相似性的度量考虑了由第一符号序列和第二符号序列共享的非连续子序列的存在。 当相似性度量不满足可接受的阈值水平时,可以调整机器翻译器的翻译模型,以改善由机器翻译器执行的后续翻译。
    • 8. 发明授权
    • Methods and systems for securely accessing translation resource manager
    • 安全访问翻译资源管理器的方法和系统
    • US09213696B2
    • 2015-12-15
    • US13590372
    • 2012-08-21
    • Nicola Cancedda
    • Nicola Cancedda
    • G06F17/28H04L9/32H04L9/06H04L9/08
    • G06F17/289H04L9/0643H04L9/0822H04L9/3236
    • The disclosed embodiments relate to systems and methods for securely accessing a phrase table. One or more records in the phrase table are encrypted using a first set of keys. The first set of keys is encrypted using a second key. A decoder module is compiled based on the second key. Thereafter, the one or more encrypted records and/or the decoder module are transmitted to the first computing device at the client side. The first set of encrypted keys is transmitted to a second computing device. The first computing device transmits a request to the second computing device to send an encrypted key. The decoder module decrypts the encrypted key to generate a key. The first computing device uses the key to decrypt one or more encrypted records.
    • 所公开的实施例涉及用于安全地访问短语表的系统和方法。 短语表中的一个或多个记录使用第一组密钥进行加密。 第一组密钥使用第二个密钥进行加密。 基于第二密钥编译解码器模块。 此后,一个或多个加密记录和/或解码器模块在客户端被发送到第一计算设备。 第一组加密密钥被发送到第二计算设备。 第一计算设备向第二计算设备发送请求以发送加密的密钥。 解码器模块解密加密的密钥以产生密钥。 第一计算设备使用密钥来解密一个或多个加密记录。
    • 9. 发明授权
    • Phrase-based statistical machine translation as a generalized traveling salesman problem
    • 基于短语的统计机器翻译作为广义旅行推销员的问题
    • US08504353B2
    • 2013-08-06
    • US12509633
    • 2009-07-27
    • Mikhail ZaslavskiyMarc DymetmanNicola Cancedda
    • Mikhail ZaslavskiyMarc DymetmanNicola Cancedda
    • G06F17/28
    • G06F17/2818
    • Systems and methods are described that facilitate phrase-based statistical machine translation (SMT) incorporating bigram (or higher n-gram) language models by modeling bi-phrases as nodes in a graph. Additionally, construction of a translation is modeled as a “tour” amongst the nodes of the graph, such that a translation solution is generated by treating the graph as a generalized traveling salesman problem (GTSP) and solving for an optimal tour. The overall cost of a tour is computed by adding the costs associated with the edges traversed during the tour. Thus, the described systems and methods map the SMT problem directly into a GTSP problem, which itself can be directly converted into a TSP problem.
    • 描述了系统和方法,其通过将双语作为图形中的节点建模来促进通过混合二进制(或更高n-gram)语言模型的基于短语的统计机器翻译(SMT)。 另外,翻译的构建被建模为图形的节点之间的“旅游”,使得通过将图形视为广义旅行推销员问题(GTSP)并解决最佳旅游来生成翻译解决方案。 旅游的总体成本是通过添加与旅行中遍历的边缘相关联的成本来计算的。 因此,所描述的系统和方法将SMT问题直接映射到GTSP问题中,其本身可以直接转换成TSP问题。
    • 10. 发明申请
    • SYSTEM AND METHOD FOR PRODUCTIVE GENERATION OF COMPOUND WORDS IN STATISTICAL MACHINE TRANSLATION
    • 统计机器翻译中复合词的生成系统与方法
    • US20130030787A1
    • 2013-01-31
    • US13189738
    • 2011-07-25
    • Nicola CanceddaSara Stymne
    • Nicola CanceddaSara Stymne
    • G06F17/28
    • G06F17/2881G06F17/2755G06F17/2818
    • A method and a system for making merging decisions for a translation are disclosed which are suited to use where the target language is a productive compounding one. The method includes outputting decisions on merging of pairs of words in a translated text string with a merging system. The merging system can include a set of stored heuristics and/or a merging model. In the case of heuristics, these can include a heuristic by which two consecutive words in the string are considered for merging if the first word of the two consecutive words is recognized as a compound modifier and their observed frequency f1 as a closed compound word is larger than an observed frequency f2 of the two consecutive words as a bigram. In the case of a merging model, it can be one that is trained on features associated with pairs of consecutive tokens of text strings in a training set and predetermined merging decisions for the pairs. A translation in the target language is output, based on the merging decisions for the translated text string.
    • 公开了一种用于进行翻译的合并决定的方法和系统,其适用于目标语言是生产性复合的目的语言。 该方法包括输出关于将翻译的文本串中的单词对合并到合并系统的决定。 合并系统可以包括一组存储的启发式和/或合并模型。 在启发式的情况下,这些可以包括启发式,如果两个连续字中的第一个单词被识别为复合修饰符,则将字符串中的两个连续字考虑用于合并,并且其作为闭合复合词的观察频率f1较大 比观察到的两个连续词的频率f2为一个二进制数。 在合并模型的情况下,它可以是针对与训练集中的文本串的连续令牌的对相关联的特征以及针对对的预定的合并决策而训练的特征。 基于翻译文本字符串的合并决定,输出目标语言的翻译。