会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Text representation and method
    • 文本表示和方法
    • US07003516B2
    • 2006-02-21
    • US10438486
    • 2003-05-15
    • Peter J. DehlingerShao Chin
    • Peter J. DehlingerShao Chin
    • G06F17/30
    • G06F17/2715G06F17/30705Y10S707/917Y10S707/99933Y10S707/99934Y10S707/99935
    • A computer method for representing a natural-language document in a vector form suitable for text manipulation operations is disclosed. The method involves determining (a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged word groups in the document, a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term, and optionally related to the inverse document frequency of that word in one or more libraries of texts. Also disclosed are a computer-readable code for carrying out the method, a computer system that employs the code, and a vector produced by the method.
    • 公开了一种用于以适于文本操作操作的向量形式表示自然语言文档的计算机方法。 该方法包括确定(a)由非通用单词组合的多个项目中的每一个以及可选地在该文档中的近似排列的单词组中的每一个,与该图书馆中该术语的发生频率相关的术语的选择性值 相对于一个或多个其他领域的文本的一个或多个其他文库中相同词语的发生频率,在一个领域中的文本。 文档被表示为术语的向量,其中分配给每个术语的系数包括为该术语确定的选择性值的函数,并且可选地与一个或多个文本库中该单词的逆文档频率相关。 还公开了用于执行该方法的计算机可读代码,采用代码的计算机系统以及由该方法产生的向量。
    • 3. 发明申请
    • Code, method, and system for manipulating texts
    • 用于操纵文本的代码,方法和系统
    • US20050120011A1
    • 2005-06-02
    • US10993462
    • 2004-11-18
    • Peter DehlingerShao Chin
    • Peter DehlingerShao Chin
    • G06F7/00G06F17/27
    • G06F17/2705
    • Disclosed are a computer-readable code, system and method for combining texts to form novel combinations of texts related to a desired target concept, where the concept is represented in the form of a natural-language text or a list of descriptive word and/or word-group terms. The system operates to find primary and secondary groups of texts having highest term match scores with a first and second subset of terms in the concept, respectively. It then generates pairs of texts containing a text from each of the primary and secondary groups of database texts, and selects for presentation to the user, those pairs of texts having highest overlap scores as determined from one or more of (i) term overlap, (ii) term coverage, (iii) feature-specific cross-correlation, (iv) attribute-specific correlation, and (v) citation score of one or both texts in the pair.
    • 公开了一种计算机可读代码,系统和方法,用于组合文本以形成与期望的目标概念相关的文本的新颖组合,其中概念以自然语言文本或描述性词语的列表和/或 字组词汇。 系统操作以分别在概念中找到具有最高术语匹配分数的主要和次要文本组,其具有术语的第一和第二子集。 然后,它产生包含来自数据库文本的主要和次要组中的文本的文本对,并且选择用于呈现给用户,具有最高重叠分数的那些文本对由从(i)术语重叠中的一个或多个确定, (ii)术语覆盖,(iii)特征互相关,(iv)属性特异性相关性和(v)该对中的一个或两个文本的引用得分。
    • 4. 发明申请
    • Text representation and method
    • 文本表示和方法
    • US20040064304A1
    • 2004-04-01
    • US10438486
    • 2003-05-15
    • WORD DATA CORP
    • Peter J. DehlingerShao Chin
    • G06F017/27
    • G06F17/2715G06F17/30705Y10S707/917Y10S707/99933Y10S707/99934Y10S707/99935
    • A computer method for representing a natural-language document in a vector form suitable for text manipulation operations is disclosed. The method involves determining (a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged word groups in the document, a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term, and optionally related to the inverse document frequency of that word in one or more libraries of texts. Also disclosed are a computer-readable code for carrying out the method, a computer system that employs the code, and a vector produced by the method.
    • 公开了一种用于以适于文本操作操作的向量形式表示自然语言文档的计算机方法。 该方法包括确定(a)由非通用单词组合的多个项目中的每一个以及可选地在该文档中的近似排列的单词组中的每一个,与该图书馆中该术语的发生频率相关的术语的选择性值 相对于一个或多个其他领域的文本的一个或多个其他文库中相同词语的发生频率,在一个领域中的文本。 文档被表示为术语的向量,其中分配给每个术语的系数包括为该术语确定的选择性值的函数,并且可选地与一个或多个文本库中该单词的逆文档频率相关。 还公开了用于执行该方法的计算机可读代码,采用代码的计算机系统以及由该方法产生的向量。
    • 5. 发明授权
    • Text-classification system and method
    • 文本分类系统和方法
    • US07016895B2
    • 2006-03-21
    • US10374877
    • 2003-02-25
    • Peter J. DehlingerShao Chin
    • Peter J. DehlingerShao Chin
    • G06F17/30G06F7/00
    • G06F17/2715G06F17/30705G06F17/30707Y10S707/917Y10S707/99933Y10S707/99934Y10S707/99935
    • Disclosed are a computer-readable code, system and method for classifying a target document in the form of a digitally encoded natural-language text as belonging to one or more of two or more different classes. Each of a plurality of non-generic words and optionally, words groups characterizing the target document is selected as a descriptive term if the term has an above-threshold selectivity value in at least one library of texts in a field, where the selectivity value of a term is a measure of the field-specificity of that term. There is then determined, for each of the plurality of sample texts having associated classification identifiers, a match score related to the number of descriptive terms present in or derived from that text that match those in the target text. From the selected matched texts, and the associated classification identifiers, a classification determination of the target document is made.
    • 公开了一种计算机可读的代码,系统和方法,用于将数字编码的自然语言文本的形式的目标文档分类为属于两个或多个不同类别中的一个或多个。 如果术语在至少一个字段中的文本库中具有高于阈值的选择性值,那么选择表征目标文档的多个非通用词和可选地,表征目标文档的单词组作为描述性术语,其中, 一个术语是衡量该术语的领域特异性的量度。 然后,对于具有相关联的分类标识符的多个样本文本中的每一个,确定与存在于或从文本中匹配目标文本中的文本的描述性词语的数量相关的匹配分数。 从所选择的匹配文本和相关联的分类标识符中,进行目标文档的分类确定。
    • 6. 发明申请
    • SYSTEM AND METHOD FOR MATCHING EXPERTISE
    • 用于匹配专业的系统和方法
    • US20080183759A1
    • 2008-07-31
    • US12021063
    • 2008-01-28
    • Peter J. Dehlinger
    • Peter J. Dehlinger
    • G06F17/30
    • G06F16/382
    • Disclosed are a method, machine-readable code, and a database for use in identifying, among a group of patent practitioners, one or more practitioners having expertise related to a given invention or technology. In the method, a search query related to the given invention or technology is used to identify one or more texts of patent abstracts or claims or patent class definitions having high term matches with the user-input query. The identified text(s) are linked to patent-class tags associated with the texts, and the identified tags are linked to one or more members of a group of patent practitioners who wrote and/or prosecuted patents having the patent-class assignments.
    • 公开了一种方法,机器可读代码和数据库,用于在一组专利从业者中识别具有与给定发明或技术相关的专业知识的一个或多个从业者。 在该方法中,使用与给定发明或技术相关的搜索查询来识别与用户输入查询具有高度匹配的专利摘要或权利要求或专利类定义的一个或多个文本。 所识别的文本被链接到与文本相关联的专利类标签,并且所识别的标签与一组专利人员的一个或多个成员相关联,所述专利人员撰写和/或起诉具有专利类别分配的专利。
    • 7. 发明授权
    • Text-classification code, system and method
    • 文本分类代码,系统和方法
    • US07024408B2
    • 2006-04-04
    • US10612644
    • 2003-07-01
    • Peter J. DehlingerShao Chin
    • Peter J. DehlingerShao Chin
    • G06F17/30
    • G06F17/2785G06F17/30707Y10S707/917Y10S707/931Y10S707/942Y10S707/99935Y10S707/99936
    • Disclosed are a computer-readable code, system and method for classifying a target document in the form of a digitally encoded natural-language text as belonging to one or more of two or more different classes. For each of a plurality of non-generic words and/or words groups characterizing the target document, there is determined a selectivity value calculated as the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively, and the document is represented as a vector of terms, where the coefficient assigned to each term is a function of the selectivity value determined for that term. There is then determined, for each of the plurality of sample texts having associated classification identifiers, a match score related to the number of descriptive terms present in or derived from that text that match those in the target text. From the selected matched texts, and the associated classification identifiers, a classification determination of the target document is made.
    • 公开了一种计算机可读的代码,系统和方法,用于将数字编码的自然语言文本的形式的目标文档分类为属于两个或多个不同类别中的一个或多个。 对于表征目标文档的多个非通用单词和/或单词组中的每一个,确定在相对于出现频率的一个字段中的文本库中计算为该术语的出现频率的选择性值 分别在一个或多个其他领域中的一个或多个其他文本库中的相同术语,并且文档被表示为术语的向量,其中分配给每个术语的系数是为该术语确定的选择性值的函数 。 然后,对于具有相关联的分类标识符的多个样本文本中的每一个,确定与存在于或从文本中匹配目标文本中的文本的描述性词语的数量相关的匹配分数。 从所选择的匹配文本和相关联的分类标识符中,进行目标文档的分类确定。