会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Information retrieval utilizing semantic representation of text by
identifying hypernyms and indexing multiple tokenized semantic
structures to a same passage of text
    • 信息检索利用文本的语义表示,通过识别多义词,并将多个标记语义结构索引到同一段文本
    • US6161084A
    • 2000-12-12
    • US366499
    • 1999-08-03
    • John J. MesserlyGeorge E. HeidornStephen D. RichardsonWilliam B. DolanKaren Jensen
    • John J. MesserlyGeorge E. HeidornStephen D. RichardsonWilliam B. DolanKaren Jensen
    • G06F17/27G06F17/30
    • G06F17/30684G06F17/271G06F17/277G06F17/2785Y10S707/99932Y10S707/99935
    • The present invention is directed to performing information retrieval utilizing semantic representation of text. In a preferred embodiment, a tokenizer generates from an input string information retrieval tokens that characterize the semantic relationship expressed in the input string. The tokenizer first creates from the input string a primary logical form characterizing a semantic relationship between selected words in the input string. The tokenizer then identifies hypernyms that each have an "is a" relationship with one of the selected words in the input string. The tokenizer then constructs from the primary logical form one or more alternative logical forms. The tokenizer constructs each alternative logical form by, for each of one or more of the selected words in the input string, replacing the selected word in the primary logical form with an identified hypernym of the selected word. Finally, the tokenizer generates tokens representing both the primary logical form and the alternative logical forms. The tokenizer is preferably used to generate tokens for both constructing an index representing target documents and processing a query against that index.
    • 本发明旨在利用文本的语义表示来执行信息检索。 在优选实施例中,标记器从输入字符串生成表征输入字符串中表达的语义关系的信息检索令牌。 标记器首先从输入字符串创建表示输入字符串中所选择的单词之间的语义关系的主逻辑形式。 然后,标记器识别每个与输入字符串中所选择的一个字符之间具有“是”关系的超文本。 然后,标记器从主逻辑形式构造一个或多个替代的逻辑形式。 令牌化器通过输入字符串中的一个或多个所选择的单词中的每个替换逻辑形式来构造每个备选逻辑形式,用所选择的单词的所识别的超级词替换主逻辑形式中的所选择的单词。 最后,tokenizer生成表示主逻辑表单和替代逻辑表单的令牌。 令牌化器优选地用于生成用于构建表示目标文档的索引并针对该索引处理查询的令牌。
    • 3. 发明授权
    • Determining similarity between words
    • 确定单词之间的相似性
    • US6098033A
    • 2000-08-01
    • US904223
    • 1997-07-31
    • Stephen D. RichardsonWilliam B. Dolan
    • Stephen D. RichardsonWilliam B. Dolan
    • G06F17/27
    • G06F17/277G06F17/2785
    • The present invention provides a facility for determining similarity between two input words utilizing the frequencies with which path patterns occurring between the words occur between words known to be synonyms. A preferred embodiment of the facility utilizes a training phase and a similarity determination phase. In the training phase, the facility first identifies, for a number of pairs of synonyms, the most salient semantic relation paths between each pair of synonyms. The facility then extracts from these semantic relation paths their path patterns, which each comprise a series of directional relation types. The number of times that each path pattern occurs between pairs of synonyms, called the frequency of the path pattern, is counted. In the training phase, the facility identifies the most salient semantic relation paths between the input words, and extracts their path patterns. The facility then averages the frequencies counted in the training phase for the path patterns extracted for the input words in order to obtain a quantitative measure of the similarity between the input words.
    • 本发明提供了一种用于确定两个输入词之间的相似性的设施,该两个输入字利用在已知是同义词的单词之间出现的词之间出现的路径模式的频率。 设施的优选实施例利用训练阶段和相似性确定阶段。 在训练阶段,设施首先识别多对同义词,即每对同义词之间最突出的语义关系路径。 然后,该设施从这些语义关系路径中提取它们的路径模式,每个路径模式包括一系列方向关系类型。 计算每个路径模式发生在同义词对之间的次数,称为路径模式的频率。 在训练阶段,设备识别输入单词之间最突出的语义关系路径,并提取其路径模式。 然后,该设施对为训练阶段计数的针对输入词提取的路径模式的频率进行平均,以便获得输入单词之间的相似性的定量测量。
    • 8. 发明授权
    • Information retrieval utilizing semantic representation of text and based on constrained expansion of query words
    • 使用文本的语义表示并基于查询词的约束扩展的信息检索
    • US06246977B1
    • 2001-06-12
    • US09368071
    • 1999-08-03
    • John J. MesserlyGeorge E. HeidornStephen D. RichardsonWilliam B. DolanKaren Jensen
    • John J. MesserlyGeorge E. HeidornStephen D. RichardsonWilliam B. DolanKaren Jensen
    • G06F1727
    • G06F17/30684G06F17/271G06F17/277G06F17/2785Y10S707/99932Y10S707/99935
    • The present invention is directed to performing information retrieval utilizing semantic representation of text. In a preferred embodiment, a tokenizer generates from an input string information retrieval tokens that characterize the semantic relationship expressed in the input string. The tokenizer first creates from the input string a primary logical form characterizing a semantic relationship between selected words in the input string. The tokenizer then identifies hypemyms that each have an “is a” relationship with one of the selected words in the input string. The tokenizer then constructs from the primary logical form one or more alternative logical forms. The tokenizer constructs each alternative logical form by, for each of one or more of the selected words in the input string, replacing the selected word in the primary logical form with an identified hypernym of the selected word. Finally, the tokenizer generates tokens representing both the primary logical form and the alternative logical forms. The tokenizer is preferably used to generate tokens for both constructing an index representing target documents and processing a query against that index.
    • 本发明旨在利用文本的语义表示来执行信息检索。 在优选实施例中,标记器从输入字符串生成表征输入字符串中表达的语义关系的信息检索令牌。 标记器首先从输入字符串创建表示输入字符串中所选择的单词之间的语义关系的主逻辑形式。 然后,标记器识别每个与输入字符串中所选择的一个字符之间具有“是”关系的次要词。 然后,标记器从主逻辑形式构造一个或多个替代的逻辑形式。 令牌化器通过输入字符串中的一个或多个所选择的单词中的每个替换逻辑形式来构造每个备选逻辑形式,用所选择的单词的所识别的超级词替换主逻辑形式中的所选择的单词。 最后,tokenizer生成表示主逻辑表单和替代逻辑表单的令牌。 令牌化器优选地用于生成用于构建表示目标文档的索引并针对该索引处理查询的令牌。
    • 9. 发明授权
    • Information retrieval utilizing semantic representation of text
    • 利用文本语义表示的信息检索
    • US6076051A
    • 2000-06-13
    • US886814
    • 1997-03-07
    • John J. MesserlyGeorge E. HeidornStephen D. RichardsonWilliam B. DolanKaren Jensen
    • John J. MesserlyGeorge E. HeidornStephen D. RichardsonWilliam B. DolanKaren Jensen
    • G06F17/27G06F17/30
    • G06F17/30684G06F17/271G06F17/277G06F17/2785Y10S707/99932Y10S707/99935
    • The present invention is directed to performing information retrieval utilizing semantic representation of text. In a preferred embodiment, a tokenizer generates from an input string information retrieval tokens that characterize the semantic relationship expressed in the input string. The tokenizer first creates from the input string a primary logical form characterizing a semantic relationship between selected words in the input string. The tokenizer then identifies hypernyms that each have an "is a" relationship with one of the selected words in the input string. The tokenizer then constructs from the primary logical form one or more alternative logical forms. The tokenizer constructs each alternative logical form by, for each of one or more of the selected words in the input string, replacing the selected word in the primary logical form with an identified hypernym of the selected word. Finally, the tokenizer generates tokens representing both the primary logical form and the alternative logical forms. The tokenizer is preferably used to generate tokens for both constructing an index representing target documents and processing a query against that index.
    • 本发明旨在利用文本的语义表示来执行信息检索。 在优选实施例中,标记器从输入字符串生成表征输入字符串中表达的语义关系的信息检索令牌。 标记器首先从输入字符串创建表示输入字符串中所选择的单词之间的语义关系的主逻辑形式。 然后,标记器识别每个与输入字符串中所选择的一个字符之间具有“是”关系的超文本。 然后,标记器从主逻辑形式构造一个或多个替代的逻辑形式。 令牌化器通过输入字符串中的一个或多个所选择的单词中的每个替换逻辑形式来构造每个备选逻辑形式,用所选择的单词的所识别的超级词替换主逻辑形式中的所选择的单词。 最后,tokenizer生成表示主逻辑表单和替代逻辑表单的令牌。 令牌化器优选地用于生成用于构建表示目标文档的索引并针对该索引处理查询的令牌。
    • 10. 发明授权
    • Identifying salient semantic relation paths between two words
    • 识别两个词之间的突出语义关系路径
    • US06070134A
    • 2000-05-30
    • US904418
    • 1997-07-31
    • Stephen D. RichardsonWilliam B. Dolan
    • Stephen D. RichardsonWilliam B. Dolan
    • G06F17/30
    • G06F17/30734
    • The present invention identifies salient semantic relation paths between two words using a knowledge base. For a group of semantic relations occurring in the knowledge base, the facility models with a mathematical function the relation between a frequency of occurrence of unique semantic relations and the number of unique semantic relations that occur at that frequency. This mathematical function has a vertex frequency identifying a transition point in the mathematical function. The facility then determines the level of salience of unique semantic relations of the group such that the level of salience of unique semantic relations increases with the frequency of occurrence of the unique semantic relations approaches the vertex frequency of the mathematical function with which the relation between the frequency of occurrence of the unique semantic relations and the number of unique semantic relations occurring at that frequency is modeled. The facility is then able to determine the level of salience of a particular path between two words by combining the levels of salience determined for the semantic relations in the path.
    • 本发明使用知识库识别两个词之间的突出语义关系路径。 对于在知识库中发生的一组语义关系,设备模型具有数学函数,即唯一语义关系的发生频率与在该频率发生的唯一语义关系的数量之间的关系。 该数学函数具有标识数学函数中的转变点的顶点频率。 该设备然后确定该组的唯一语义关系的显着性水平,使得独特语义关系的显着性水平随着独特语义关系的出现频率而增加接近数学函数的顶点频率, 建立了唯一语义关系的发生频率和在该频率发生的唯一语义关系的数量。 然后,该设施能够通过组合为路径中的语义关系确定的显着性水平来确定两个单词之间的特定路径的显着性水平。