会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 8. 发明申请
    • METHOD AND APPARATUS FOR PARAPHRASE ACQUISITION
    • 方法和装置获取
    • US20130103390A1
    • 2013-04-25
    • US13655852
    • 2012-10-19
    • Atsushi FUJITAPierre ISABELLE
    • Atsushi FUJITAPierre ISABELLE
    • G06F17/27
    • G06F17/2765
    • A computer based natural language processing method for identifying paraphrases in corpora using statistical analysis comprises deriving a set of starting paraphrases (SPs) from a parallel corpus, each SP having at least two phrases that are phrase aligned; generating a set of paraphrase patterns (PPs) by identifying shared terms within two aligned phrases of an SP, and defining a PP having slots in place of the shared terms, in right hand side (RHS) and left hand side (LHS) expressions; and collecting output paraphrases (OPs) by identifying instances of the PPs in a non-parallel corpus. By using the reliably derived paraphrase information from a small parallel corpus to generate the PPs, and extending the range of instances of the PPs over the large non-parallel corpus, better coverage of the paraphrases in the language and fewer errors are encountered.
    • 基于计算机的自然语言处理方法,用于使用统计分析来识别语料库中的释义,包括从平行语料库导出一组起始释义(SP),每个SP具有短语对齐的至少两个短语; 通过识别SP的两个对齐短语内的共享术语,并在右侧(RHS)和左侧(LHS)表达式中定义具有代替共享术语的时隙的PP的生成一组释义模式(PP); 并通过在非平行语料库中识别PP的实例来收集输出解释(OPs)。 通过使用来自小平行语料库的可靠导出的释义信息来生成PP,并且将PP的实例的范围扩展到大的非平行语料库上,遇到语言中的释义更好的覆盖和更少的错误。