会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 9. 发明申请
    • SYSTEMS AND METHODS FOR GENERATING CONCEPTS FROM A DOCUMENT CORPUS
    • 用于从文献公司产生概念的系统和方法
    • US20170060991A1
    • 2017-03-02
    • US15348333
    • 2016-11-10
    • LexisNexis, a division of Reed Elsevier Inc.
    • Paul ZhangSanjay SharmaDavid SteinerMark David WassonHarry R. SilverRobin Warling
    • G06F17/30G06Q50/18G06F17/22
    • G06F16/313G06F16/36G06F16/93G06F17/2211G06Q50/18
    • Systems and method for generating concepts from a document corpus are disclosed. In one embodiment, a method for generating concepts from a document includes retrieving, a plurality of terms stored within a first lexicon. The method further includes, for individual terms stored within the first lexicon: determining a first frequency of the term within the document corpus, and determining a second frequency of the term within a comparison document corpus including a plurality of comparison documents, wherein the comparison document corpus is different from the document corpus. The method further includes, for individual terms within the first lexicon: determining a difference between the first frequency and the second frequency, comparing the difference between the first frequency and the second frequency to a comparison metric, and, when the difference between the first frequency and the second frequency satisfies the comparison metric, storing the term as a concept within a second lexicon.
    • 公开了从文档语料库生成概念的系统和方法。 在一个实施例中,用于从文档生成概念的方法包括检索存储在第一词典中的多个项。 该方法还包括:对于存储在第一词典中的各个术语:确定该文档语料库内的术语的第一频率,以及确定包括多个比较文档的比较文档语料库内的术语的第二频率,其中比较文档 语料库与文档语料库不同。 该方法还包括:对于第一词典内的各个术语:确定第一频率和第二频率之间的差异,将第一频率和第二频率之间的差值比较为比较度量,以及当第一频率 并且第二频率满足比较度量,将术语作为概念存储在第二词汇中。