会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • System and method for the indexing of organic chemical structures mined from text documents
    • 从文本文件开采有机化学结构索引的系统和方法
    • US07899827B2
    • 2011-03-01
    • US10797359
    • 2004-03-09
    • Stephen BoyerAnna Rosa CodenJames William Cooper
    • Stephen BoyerAnna Rosa CodenJames William Cooper
    • G06F17/30
    • G06F19/707
    • Disclosed is a method, a computer program product and a system for processing documents that contain chemical names. The system has a unit to partition document text and to assign semantic meaning to words; a unit to recognize any substructures present in the chemical name fragments; and a unit to determine structural connectivity information of the chemical name fragments and recognized substructures and to store the determined structural connectivity information in a searchable index. The system further includes a unit to search a text index using at least one of a fragment name and a substructure name and to search the structure index by at least one of fragment connectivity and substructure connectivity. At an intersection of the search results from the structure index and the text index, the system operates to identify at least one document that contains a reference to a corresponding chemical compound.
    • 公开了一种用于处理含有化学名称的文件的方法,计算机程序产品和系统。 该系统具有分隔文档文本并为语义分配语义的单位; 识别化学名称片段中存在的任何亚结构的单元; 以及用于确定化学名称片段和识别的子结构的结构连接性信息并将确定的结构连接性信息存储在可搜索的索引中的单元。 该系统还包括使用片段名称和子结构名称中的至少一个来搜索文本索引的单元,并且通过片段连接性和子结构连接性中的至少一个来搜索结构索引。 在结构索引和文本索引的搜索结果的交集处,系统操作以识别包含对相应化合物的引用的至少一个文档。
    • 7. 发明授权
    • System and method for the recognition of organic chemical names in text documents
    • 用于识别文本文件中有机化学名称的系统和方法
    • US07676358B2
    • 2010-03-09
    • US10670675
    • 2003-09-24
    • Anna Rosa CodenJames William Cooper
    • Anna Rosa CodenJames William Cooper
    • G06F17/28
    • G06F17/278
    • This invention provides a method, a system and a computer program for recognizing technical terms. In the preferred embodiment the technical terms are chemical names, and in a most preferred embodiment the technical terms are organic chemical names. A computer program product stores in a computer readable form a set of computer program instructions for directing at least one computer to process a text document. The set of computer program instructions include instructions for assigning corresponding associated parts of speech to words found in the document. The instructions for assigning include instructions to apply a plurality of regular expressions, rules and a plurality of dictionaries to recognize organic chemical name fragments, to combine recognized organic chemical name fragments into a complete organic chemical name, and to assign the complete organic chemical name with one part of speech. The regular expressions include a plurality of patterns, individual ones of which are comprised of at least one of characters, numbers and punctuation. For example, the punctuation can comprise at least one of parenthesis, square bracket, hyphen, colon and semi-colon, and the characters can comprise at least one of upper case C, O, R, N and H, and further comprise strings of at least one of lower case xy, ene, ine, yl, ane and oic.
    • 本发明提供一种用于识别技术术语的方法,系统和计算机程序。 在优选实施方案中,技术术语是化学名称,并且在最优选的实施方案中,技术术语是有机化学名称。 计算机程序产品以计算机可读形式存储用于指导至少一台计算机处理文本文档的一组计算机程序指令。 该组计算机程序指令包括用于将相应的相关词组分配给文档中找到的单词的指令。 用于分配的指令包括应用多个正则表达式,规则和多个词典来识别有机化学名称片段的指令,将已识别的有机化学名称片段合并成完整的有机化学名称,并将完整的有机化学名称与 一部分讲话。 正则表达式包括多个模式,其中各个模式由字符,数字和标点符号中的至少一个组成。 例如,标点符号可以包括括号,方括号,连字符,冒号和分号中的至少一个,并且字符可以包括大写C,O,R,N和H中的至少一个,并且还包括 小写xy,ene,ine,yl,ane和oic中的至少一个。
    • 8. 发明授权
    • System, method, and program product for identifying and describing topics in a collection of electronic documents
    • 用于识别和描述电子文档集合中的主题的系统,方法和程序产品
    • US06775677B1
    • 2004-08-10
    • US09517540
    • 2000-03-02
    • Rie Kubota AndoBranimir Konstantinov BoguraevRoy Jefferson ByrdJames William CooperMary Susan Neff
    • Rie Kubota AndoBranimir Konstantinov BoguraevRoy Jefferson ByrdJames William CooperMary Susan Neff
    • G06F1700
    • G06F17/3069Y10S707/99933Y10S707/99942Y10S707/99943
    • To identify and describe one or more topics in one or more documents in a document set, a term set process creates a basic term set from the document set where the term set comprises one or more basic terms of one or more words in the document. A document vector process then creates a document vector for each document. The document vector has a document vector direction representing what the document is about. A topic vector process then creates one or more topic vectors from the document vectors. Each topic vector has a topic vector direction representing a topic in the document set. A topic term set process creates a topic term set for each topic vector that comprises one or more of the basic terms describing the topic represented by the topic vector. Each of the basic terms in the topic term set associated with the relevancy of the basic term. A topic-document relevance process creates a topic-document relevance for each topic vector and each document vector. The topic-document relevance representing the relevance of the document to the topic. A topic sentence set process creates a topic sentence set for each topic vector that comprises of one or more topic sentences that describe the topic represented by the topic vector. Each of the topic sentences is then associated with the relevance of the topic sentence to the topic represented by the topic vector.
    • 为了识别和描述文档集中的一个或多个文档中的一个或多个主题,术语集合过程从文档集创建基本术语集合,其中术语集合包括文档中的一个或多个单词的一个或多个基本术语。 文档向量过程然后为每个文档创建文档向量。 文档向量具有表示文档的文档向量方向。 然后,主题向量过程从文档向量创建一个或多个主题向量。 每个主题向量具有表示文档集中的主题的主题向量方向。 主题术语集过程为每个主题向量创建一个主题术语集,其包括描述由主题向量表示的主题的一个或多个基本术语。 与基本术语的相关性相关的主题术语集中的每个基本术语。 主题文档相关性过程为每个主题向量和每个文档向量创建主题文档相关性。 主题 - 文档相关性表示文档与主题的相关性。 主题句集过程为每个主题向量创建一个主题句集,该主题向量包含一个或多个描述由主题向量表示的主题的主题句子。 然后,每个主题句与主题句与由主题向量表示的主题的相关性相关联。
    • 10. 发明授权
    • System and method for implementing cooperative text searching
    • 实现合作文本搜索的系统和方法
    • US06185553B2
    • 2001-02-06
    • US09062272
    • 1998-04-15
    • Roy Jefferson ByrdJames William Cooper
    • Roy Jefferson ByrdJames William Cooper
    • G06F1730
    • G06Q10/10Y10S707/99931Y10S707/99933Y10S707/99934Y10S707/99935Y10S707/99936Y10S707/99937Y10S707/99944
    • Two or more client users, connected by one or more networks to the server, cooperatively search a database. The server has a data structure that has two or more cooperative user identifiers. Each cooperative user identifier represents one of the clients that has indicated a desired to establish a cooperative search. The data structure further has a session identifier that associates two or more of the cooperative user identifiers as session participants in an established a cooperative session. A command process, executing on the server, receives a query from one of the session participants (clients), accesses results of the query from a search engine, and distributes the results to all of the session participants. Queries of the cooperative session are related to indexed terms so that future uses will find this relationship when using similar queries. These are applications in searching for sales and service information.
    • 由一个或多个网络连接到服务器的两个或多个客户端用户协同地搜索数据库。 服务器具有具有两个或多个协作用户标识符的数据结构。 每个协作用户标识符表示已经指示建立合作搜索的客户端之一。 数据结构还具有会话标识符,其将两个或多个协作用户标识符作为会话参与者在建立的协作会话中相关联。 在服务器上执行的命令处理从一个会话参与者(客户机)接收查询,从搜索引擎访问查询的结果,并将结果分发给所有的会话参与者。 合作会话的查询与索引条款相关,以便将来的使用将在使用类似查询时找到此关系。 这些是搜索销售和服务信息的应用程序。