会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明申请
    • MINING MULTI-LINGUAL DATA
    • 采矿多元数据
    • US20160162575A1
    • 2016-06-09
    • US14559540
    • 2014-12-03
    • Facebook, Inc.
    • Matthias Gerhard EckYing ZhangYury Andreyevich ZemlyanskiyAlexander Waibel
    • G06F17/30G06F17/28
    • G06F17/289G06F17/2818G06F17/2827G06F17/30864
    • Technology is disclosed for mining training data to create machine translation engines. Training data can be mined as translation pairs from single content items that contain multiple languages; multiple content items in different languages that are related to the same or similar target; or multiple content items that are generated by the same author in different languages. Locating content items can include identifying potential sources of translation pairs that fall into these categories and applying filtering techniques to quickly gather those that are good candidates for being actual translation pairs. When actual translation pairs are located, they can be used to retrain a machine translation engine as in-domain for social media content items.
    • 技术被披露用于挖掘培训数据以创建机器翻译引擎。 训练数据可以作为包含多种语言的单个内容项目的翻译对进行挖掘; 与相同或相似目标相关的不同语言的多个内容项目; 或由不同语言的同一作者生成的多个内容项。 查找内容项可以包括识别属于这些类别的翻译对的潜在来源,并应用过滤技术来快速收集那些作为实际翻译对的良好候选者。 当实际的翻译对被定位时,它们可以用于重新训练机器翻译引擎作为社交媒体内容项目的域内。