专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US07386442B2 Code, system and method for representing a natural-language text in a form suitable for text manipulation 失效
标题翻译：用于以适合于文本操作的形式表示自然语言文本的代码，系统和方法
公开(公告)号：US07386442B2
公开(公告)日：2008-06-10
申请号：US10612732
申请日：2003-07-01
申请人： Peter J. Dehlinger , Shao Chin
发明人： Peter J. Dehlinger , Shao Chin
IPC分类号： G06F7/00 , G06F17/20 , G06F17/21 , G06F17/27
CPC分类号： G06F17/2715 , G06F17/30705 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936
摘要： A computer method, system and code, for representing a natural-language document in a vector form suitable for text manipulation operations are disclosed. The method involves determining (a) for each of a plurality of terms selected from one of (i) non-generic words in the document, (ii) proximately arranged word groups in the document, and (iii) a combination of (i) and (ii), a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term.
摘要翻译：公开了一种用于以适合于文本操作操作的向量形式表示自然语言文档的计算机方法，系统和代码。该方法包括确定（a）从文档中的（i）非通用单词之一中选择的多个项中的每一个，（ii）文档中的近似排列的单词组，以及（iii）组合（i）和（ii）与一个领域的文本图书馆中该术语的发生频率有关的术语的选择性值，相对于一个或多个其他一个或多个文本文库中同一术语的出现频率，更多的其他领域。文档被表示为项的向量，其中分配给每个项的系数包括为该项确定的选择性值的函数。

2. 发明授权

US07003516B2 Text representation and method 失效
标题翻译：文本表示和方法
公开(公告)号：US07003516B2
公开(公告)日：2006-02-21
申请号：US10438486
申请日：2003-05-15
申请人： Peter J. Dehlinger , Shao Chin
发明人： Peter J. Dehlinger , Shao Chin
IPC分类号： G06F17/30
CPC分类号： G06F17/2715 , G06F17/30705 , Y10S707/917 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935
摘要： A computer method for representing a natural-language document in a vector form suitable for text manipulation operations is disclosed. The method involves determining (a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged word groups in the document, a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term, and optionally related to the inverse document frequency of that word in one or more libraries of texts. Also disclosed are a computer-readable code for carrying out the method, a computer system that employs the code, and a vector produced by the method.
摘要翻译：公开了一种用于以适于文本操作操作的向量形式表示自然语言文档的计算机方法。该方法包括确定（a）由非通用单词组合的多个项目中的每一个以及可选地在该文档中的近似排列的单词组中的每一个，与该图书馆中该术语的发生频率相关的术语的选择性值相对于一个或多个其他领域的文本的一个或多个其他文库中相同词语的发生频率，在一个领域中的文本。文档被表示为术语的向量，其中分配给每个术语的系数包括为该术语确定的选择性值的函数，并且可选地与一个或多个文本库中该单词的逆文档频率相关。还公开了用于执行该方法的计算机可读代码，采用代码的计算机系统以及由该方法产生的向量。

3. 发明申请

US20050120011A1 Code, method, and system for manipulating texts 审中-公开
标题翻译：用于操纵文本的代码，方法和系统
公开(公告)号：US20050120011A1
公开(公告)日：2005-06-02
申请号：US10993462
申请日：2004-11-18
申请人： Peter Dehlinger , Shao Chin
发明人： Peter Dehlinger , Shao Chin
IPC分类号： G06F7/00 , G06F17/27
CPC分类号： G06F17/2705
摘要： Disclosed are a computer-readable code, system and method for combining texts to form novel combinations of texts related to a desired target concept, where the concept is represented in the form of a natural-language text or a list of descriptive word and/or word-group terms. The system operates to find primary and secondary groups of texts having highest term match scores with a first and second subset of terms in the concept, respectively. It then generates pairs of texts containing a text from each of the primary and secondary groups of database texts, and selects for presentation to the user, those pairs of texts having highest overlap scores as determined from one or more of (i) term overlap, (ii) term coverage, (iii) feature-specific cross-correlation, (iv) attribute-specific correlation, and (v) citation score of one or both texts in the pair.
摘要翻译：公开了一种计算机可读代码，系统和方法，用于组合文本以形成与期望的目标概念相关的文本的新颖组合，其中概念以自然语言文本或描述性词语的列表和/或字组词汇。系统操作以分别在概念中找到具有最高术语匹配分数的主要和次要文本组，其具有术语的第一和第二子集。然后，它产生包含来自数据库文本的主要和次要组中的文本的文本对，并且选择用于呈现给用户，具有最高重叠分数的那些文本对由从（i）术语重叠中的一个或多个确定，（ii）术语覆盖，（iii）特征互相关，（iv）属性特异性相关性和（v）该对中的一个或两个文本的引用得分。

4. 发明申请

US20040064304A1 Text representation and method 失效
标题翻译：文本表示和方法
公开(公告)号：US20040064304A1
公开(公告)日：2004-04-01
申请号：US10438486
申请日：2003-05-15
申请人： WORD DATA CORP
发明人： Peter J. Dehlinger , Shao Chin
IPC分类号： G06F017/27
CPC分类号： G06F17/2715 , G06F17/30705 , Y10S707/917 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935
摘要： A computer method for representing a natural-language document in a vector form suitable for text manipulation operations is disclosed. The method involves determining (a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged word groups in the document, a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term, and optionally related to the inverse document frequency of that word in one or more libraries of texts. Also disclosed are a computer-readable code for carrying out the method, a computer system that employs the code, and a vector produced by the method.
摘要翻译：公开了一种用于以适于文本操作操作的向量形式表示自然语言文档的计算机方法。该方法包括确定（a）由非通用单词组合的多个项目中的每一个以及可选地在该文档中的近似排列的单词组中的每一个，与该图书馆中该术语的发生频率相关的术语的选择性值相对于一个或多个其他领域的文本的一个或多个其他文库中相同词语的发生频率，在一个领域中的文本。文档被表示为术语的向量，其中分配给每个术语的系数包括为该术语确定的选择性值的函数，并且可选地与一个或多个文本库中该单词的逆文档频率相关。还公开了用于执行该方法的计算机可读代码，采用代码的计算机系统以及由该方法产生的向量。

5. 发明授权

US07016895B2 Text-classification system and method 失效
标题翻译：文本分类系统和方法
公开(公告)号：US07016895B2
公开(公告)日：2006-03-21
申请号：US10374877
申请日：2003-02-25
申请人： Peter J. Dehlinger , Shao Chin
发明人： Peter J. Dehlinger , Shao Chin
IPC分类号： G06F17/30 , G06F7/00
CPC分类号： G06F17/2715 , G06F17/30705 , G06F17/30707 , Y10S707/917 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935
摘要： Disclosed are a computer-readable code, system and method for classifying a target document in the form of a digitally encoded natural-language text as belonging to one or more of two or more different classes. Each of a plurality of non-generic words and optionally, words groups characterizing the target document is selected as a descriptive term if the term has an above-threshold selectivity value in at least one library of texts in a field, where the selectivity value of a term is a measure of the field-specificity of that term. There is then determined, for each of the plurality of sample texts having associated classification identifiers, a match score related to the number of descriptive terms present in or derived from that text that match those in the target text. From the selected matched texts, and the associated classification identifiers, a classification determination of the target document is made.
摘要翻译：公开了一种计算机可读的代码，系统和方法，用于将数字编码的自然语言文本的形式的目标文档分类为属于两个或多个不同类别中的一个或多个。如果术语在至少一个字段中的文本库中具有高于阈值的选择性值，那么选择表征目标文档的多个非通用词和可选地，表征目标文档的单词组作为描述性术语，其中，一个术语是衡量该术语的领域特异性的量度。然后，对于具有相关联的分类标识符的多个样本文本中的每一个，确定与存在于或从文本中匹配目标文本中的文本的描述性词语的数量相关的匹配分数。从所选择的匹配文本和相关联的分类标识符中，进行目标文档的分类确定。

6. 发明申请

US20080183759A1 SYSTEM AND METHOD FOR MATCHING EXPERTISE 审中-公开
标题翻译：用于匹配专业的系统和方法
公开(公告)号：US20080183759A1
公开(公告)日：2008-07-31
申请号：US12021063
申请日：2008-01-28
申请人： Peter J. Dehlinger
发明人： Peter J. Dehlinger
IPC分类号： G06F17/30
CPC分类号： G06F16/382
摘要： Disclosed are a method, machine-readable code, and a database for use in identifying, among a group of patent practitioners, one or more practitioners having expertise related to a given invention or technology. In the method, a search query related to the given invention or technology is used to identify one or more texts of patent abstracts or claims or patent class definitions having high term matches with the user-input query. The identified text(s) are linked to patent-class tags associated with the texts, and the identified tags are linked to one or more members of a group of patent practitioners who wrote and/or prosecuted patents having the patent-class assignments.
摘要翻译：公开了一种方法，机器可读代码和数据库，用于在一组专利从业者中识别具有与给定发明或技术相关的专业知识的一个或多个从业者。在该方法中，使用与给定发明或技术相关的搜索查询来识别与用户输入查询具有高度匹配的专利摘要或权利要求或专利类定义的一个或多个文本。所识别的文本被链接到与文本相关联的专利类标签，并且所识别的标签与一组专利人员的一个或多个成员相关联，所述专利人员撰写和/或起诉具有专利类别分配的专利。

7. 发明授权

US07024408B2 Text-classification code, system and method 失效
标题翻译：文本分类代码，系统和方法
公开(公告)号：US07024408B2
公开(公告)日：2006-04-04
申请号：US10612644
申请日：2003-07-01
申请人： Peter J. Dehlinger , Shao Chin
发明人： Peter J. Dehlinger , Shao Chin
IPC分类号： G06F17/30
CPC分类号： G06F17/2785 , G06F17/30707 , Y10S707/917 , Y10S707/931 , Y10S707/942 , Y10S707/99935 , Y10S707/99936
摘要： Disclosed are a computer-readable code, system and method for classifying a target document in the form of a digitally encoded natural-language text as belonging to one or more of two or more different classes. For each of a plurality of non-generic words and/or words groups characterizing the target document, there is determined a selectivity value calculated as the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively, and the document is represented as a vector of terms, where the coefficient assigned to each term is a function of the selectivity value determined for that term. There is then determined, for each of the plurality of sample texts having associated classification identifiers, a match score related to the number of descriptive terms present in or derived from that text that match those in the target text. From the selected matched texts, and the associated classification identifiers, a classification determination of the target document is made.
摘要翻译：公开了一种计算机可读的代码，系统和方法，用于将数字编码的自然语言文本的形式的目标文档分类为属于两个或多个不同类别中的一个或多个。对于表征目标文档的多个非通用单词和/或单词组中的每一个，确定在相对于出现频率的一个字段中的文本库中计算为该术语的出现频率的选择性值分别在一个或多个其他领域中的一个或多个其他文本库中的相同术语，并且文档被表示为术语的向量，其中分配给每个术语的系数是为该术语确定的选择性值的函数。然后，对于具有相关联的分类标识符的多个样本文本中的每一个，确定与存在于或从文本中匹配目标文本中的文本的描述性词语的数量相关的匹配分数。从所选择的匹配文本和相关联的分类标识符中，进行目标文档的分类确定。

8. 发明授权

US07181451B2 Processing input text to generate the selectivity value of a word or word group in a library of texts in a field is related to the frequency of occurrence of that word or word group in library 失效
标题翻译：处理输入文本以生成字段中文本库中的单词或单词组的选择性值与库中该单词或单词组的出现频率有关
公开(公告)号：US07181451B2
公开(公告)日：2007-02-20
申请号：US10261971
申请日：2002-09-30
申请人： Peter J. Dehlinger , Shao Chin
发明人： Peter J. Dehlinger , Shao Chin
IPC分类号： G06F17/30 , G06F7/00 , G06F17/28 , G06F17/00
CPC分类号： G06F17/30705 , G06F17/27 , Y10S707/99931 , Y10S707/99936
摘要： Disclosed is an automated system, machine-readable storage medium embodying computer-executable code, and method for generating descriptive words and optionally, multi-word groups derived from a digitally encoded, natural-language input text that describes a concept, invention, or event in a selected field. The system includes (a) an electronic digital computer, (b) a database of words and optionally, word-groups derived from a plurality of texts, and (c) machine-readable storage medium embodying computer-executable code for accessing the database. The database provides, or can be used to calculate, a selectivity value for each of the words and optionally, word groups contained in or derived from the input text. Words and optionally, word groups having an above-threshold selectivity value are selected as descriptive terms from the input text.
摘要翻译：公开了一种体现计算机可执行代码的自动化系统，机器可读存储介质，以及用于生成描述词和可选地从数字编码的自然语言输入文本导出的多字组的方法，其描述概念，发明或事件在选定的字段。该系统包括（a）电子数字计算机，（b）词汇数据库和可选地从多个文本导出的单词组，以及（c）体现用于访问数据库的计算机可执行代码的机器可读存储介质。该数据库提供或可以用于计算每个单词的选择性值，以及可选地，包含在输入文本中或从输入文本导出的单词组。词和可选地，具有高于阈值选择性值的单词组从输入文本中被选择为描述性词语。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式