会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 11. 发明授权
    • System and method for text cleaning by classifying sentences using numerically represented features
    • 通过使用数字表示的特征对句子进行分类来进行文本清理的系统和方法
    • US08380492B2
    • 2013-02-19
    • US12775580
    • 2010-05-07
    • Liqin XuHyun Chul Lee
    • Liqin XuHyun Chul Lee
    • G06F17/27G06F17/20G06F17/21
    • G06F17/27
    • A method and system for cleaning an electronic document are provided. The method comprises: identifying at least one sentence in the electronic document; numerically representing features of the sentence to obtain a numeric feature representation associated with the sentence; inputting the numeric feature representation into a machine learning classifier, the machine learning classifier being configured to determine, based on each numeric feature representation, whether the sentence associated with that numeric feature representation is a bad sentence; and removing sentences determined to be bad sentences from the electronic document to create a cleaned document.
    • 提供了一种用于清洁电子文档的方法和系统。 该方法包括:识别电子文档中的至少一个句子; 数字地表示句子的特征以获得与句子相关联的数字特征表示; 将所述数字特征表示输入到机器学习分类器中,所述机器学习分类器被配置为基于每个数字特征表示来确定与所述数字特征表示相关联的句子是否是坏句子; 并且从电子文档中移除确定为不良句子的句子以创建清洁的文档。
    • 14. 发明授权
    • System and method for phrase identification
    • 用于短语识别的系统和方法
    • US08868469B2
    • 2014-10-21
    • US12775547
    • 2010-05-07
    • Liqin XuHyun Chul Lee
    • Liqin XuHyun Chul Lee
    • G06F15/18G06F17/27
    • G06F17/27
    • A phrase identification system and method are provided. The method comprises: identifying one or more phrase candidates in the electronic document; selecting one of the phrase candidates; numerically representing features of the selected phrase candidates to obtain a numeric feature representation associated with that phrase candidate; and inputting the numeric feature representation into a machine learning classifier, the machine learning classifier being configured to determine, based on each numeric feature representation, whether the phrase candidate associated with that numeric feature representation is a phrase.
    • 提供短语识别系统和方法。 该方法包括:识别电子文档中的一个或多个短语候选者; 选择短语候选人之一; 数字地表示所选择的短语候选的特征以获得与该短语候选相关联的数字特征表示; 以及将所述数字特征表示输入到机器学习分类器中,所述机器学习分类器被配置为基于每个数字特征表示来确定与所述数字特征表示相关联的短语候选是短语。
    • 19. 发明授权
    • System and method for matching comment data to text data
    • 将注释数据与文本数据相匹配的系统和方法
    • US08972413B2
    • 2015-03-03
    • US13253157
    • 2011-10-05
    • Hyun Chul LeeLiqin XuKe Zeng
    • Hyun Chul LeeLiqin XuKe Zeng
    • G06F7/04G06F17/30G06F17/24
    • G06F17/30722G06F17/241G06F17/30011G06F17/30342G06F17/30616G06F17/30864
    • Methods and comment association systems for associating one or more comments with one or more primary electronic documents are described. In one aspect, the method comprises: identifying, at a comment association system, one or more key terms from at least a portion of the one or more primary electronic documents; identifying, at the comment association system, one or more comments associated with the identified key terms; determining, at the comment association system, whether an identified comment is sufficiently related to the one or more primary electronic documents by calculating one or more relation score for that identified comment and comparing the relation score to one or more threshold; and if the identified comment is sufficiently related to the one or more primary electronic documents, then associating the identified comment with the one or more primary electronic documents at the comment association system.
    • 描述用于将一个或多个注释与一个或多个主要电子文档相关联的方法和注释关联系统。 一方面,该方法包括:在注释关联系统处从一个或多个主要电子文档的至少一部分识别一个或多个关键术语; 在所述评论关联系统处识别与所识别的关键术语相关联的一个或多个注释; 在所述评论关联系统处,通过计算所识别的评论的一个或多个关系得分并将所述关系得分与一个或多个阈值进行比较来确定所识别的注释是否与所述一个或多个主要电子文档充分相关; 并且如果所识别的注释与所述一个或多个主要电子文档充分相关,则将所识别的注释与所述注释关联系统上的所述一个或多个主要电子文档相关联。
    • 20. 发明授权
    • Systems and methods for ranking document clusters
    • 用于对文档集群进行排序的系统和方法
    • US08612447B2
    • 2013-12-17
    • US13293190
    • 2011-11-10
    • Francisco Javier Estrada GuadarramaDarius BraziunasHyun Chul Lee
    • Francisco Javier Estrada GuadarramaDarius BraziunasHyun Chul Lee
    • G06F17/30
    • G06F17/3053G06F17/30864
    • Document cluster ranking systems and methods of ranking document clusters are described. In some example embodiments, the method comprises: obtaining, at a document cluster ranking system, a value associated with a first feature for each of a plurality of document clusters; based on the values associated with the first feature, automatically generating, at the document cluster ranking system, a plurality of first feature bins, each first feature bin defining a range of values and a bin identifier; and obtaining a score for one of the document clusters, by: i) identifying the first feature bin having a range of values which includes the obtained value associated with the first feature for that one of the document clusters; and ii) determining a score for that document cluster based on the first feature bin identifier for the identified first feature bin.
    • 描述文档集群排名系统和排序文档集群的方法。 在一些示例性实施例中,该方法包括:在文档集群排名系统处获取与多个文档簇中的每一个的第一特征相关联的值; 基于与所述第一特征相关联的值,在所述文档簇排序系统处自动生成多个第一特征区块,每个第一特征区段定义值范围和区块标识符; 以及通过以下方式获得所述文档簇中的一个的分数:i)识别具有包括与所述文档簇中的所述一个的所述第一特征相关联的所获得的值的值范围的所述第一特征块; 以及ii)基于所识别的第一特征仓的第一特征箱标识符来确定该文档簇的得分。