专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20180089178A1 MINING MULTI-LINGUAL DATA 审中-公开
公开(公告)号：US20180089178A1
公开(公告)日：2018-03-29
申请号：US15823492
申请日：2017-11-27
申请人： Facebook, Inc.
发明人： Matthias Gerhard Eck , Ying Zhang , Yury Andreyevich Zemlyanskiy , Alexander Waibel
IPC分类号： G06F17/28 , G06F17/30
CPC分类号： G06F17/289 , G06F16/951 , G06F17/2818 , G06F17/2827
摘要： Technology is disclosed for mining training data to create machine translation engines. Training data can be mined as translation pairs from single content items that contain multiple languages; multiple content items in different languages that are related to the same or similar target; or multiple content items that are generated by the same author in different languages. Locating content items can include identifying potential sources of translation pairs that fall into these categories and applying filtering techniques to quickly gather those that are good candidates for being actual translation pairs. When actual translation pairs are located, they can be used to retrain a machine translation engine as in-domain for social media content items.

2. 发明授权

US09864744B2 Mining multi-lingual data 有权
公开(公告)号：US09864744B2
公开(公告)日：2018-01-09
申请号：US14559540
申请日：2014-12-03
申请人： Facebook, Inc.
发明人： Matthias Gerhard Eck , Ying Zhang , Yury Andreyevich Zemlyanskiy , Alexander Waibel
IPC分类号： G06F17/30 , G06F17/28
CPC分类号： G06F17/289 , G06F17/2818 , G06F17/2827 , G06F17/30864
摘要： Technology is disclosed for mining training data to create machine translation engines. Training data can be mined as translation pairs from single content items that contain multiple languages; multiple content items in different languages that are related to the same or similar target; or multiple content items that are generated by the same author in different languages. Locating content items can include identifying potential sources of translation pairs that fall into these categories and applying filtering techniques to quickly gather those that are good candidates for being actual translation pairs. When actual translation pairs are located, they can be used to retrain a machine translation engine as in-domain for social media content items.

3. 发明申请

US20160162575A1 MINING MULTI-LINGUAL DATA 有权
标题翻译：采矿多元数据
公开(公告)号：US20160162575A1
公开(公告)日：2016-06-09
申请号：US14559540
申请日：2014-12-03
申请人： Facebook, Inc.
发明人： Matthias Gerhard Eck , Ying Zhang , Yury Andreyevich Zemlyanskiy , Alexander Waibel
IPC分类号： G06F17/30 , G06F17/28
CPC分类号： G06F17/289 , G06F17/2818 , G06F17/2827 , G06F17/30864
摘要： Technology is disclosed for mining training data to create machine translation engines. Training data can be mined as translation pairs from single content items that contain multiple languages; multiple content items in different languages that are related to the same or similar target; or multiple content items that are generated by the same author in different languages. Locating content items can include identifying potential sources of translation pairs that fall into these categories and applying filtering techniques to quickly gather those that are good candidates for being actual translation pairs. When actual translation pairs are located, they can be used to retrain a machine translation engine as in-domain for social media content items.
摘要翻译：技术被披露用于挖掘培训数据以创建机器翻译引擎。训练数据可以作为包含多种语言的单个内容项目的翻译对进行挖掘; 与相同或相似目标相关的不同语言的多个内容项目; 或由不同语言的同一作者生成的多个内容项。查找内容项可以包括识别属于这些类别的翻译对的潜在来源，并应用过滤技术来快速收集那些作为实际翻译对的良好候选者。当实际的翻译对被定位时，它们可以用于重新训练机器翻译引擎作为社交媒体内容项目的域内。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式