会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 26. 发明申请
    • Systems and methods that rank search results
    • 搜索结果排名的系统和方法
    • US20050234904A1
    • 2005-10-20
    • US10820947
    • 2004-04-08
    • Eric BrillJesper LindMarc SmithWensi XiDuncan Davenport
    • Eric BrillJesper LindMarc SmithWensi XiDuncan Davenport
    • G06F7/00G06F17/30
    • G06F17/30675
    • The present invention provides systems and methods that rank search results. Such ranking typically includes determining a relevance of individual search results via one or more feature-based relevance functions. These functions can be tailored to users and/or applications, and typically are based on scoped information (e.g., lexical), digital artifact author related attributes, digital artifact source repository attributes, and/or relationships between features, for example. In addition, relevance functions can be generated via training sets (e.g., machine learning) or initial guesses that are iteratively refined over time. Upon determining relevance, search results can be ordered with respect to one another, based on respective relevances. Additionally, thresholding can be utilized to mitigate returning results likely to be non-relevant to the query, user and/or application.
    • 本发明提供了对搜索结果进行排序的系统和方法。 这种排名通常包括通过一个或多个基于特征的相关性功能来确定各个搜索结果的相关性。 这些功能可以针对用户和/或应用而定制,并且通常基于例如范围限定的信息(例如,词汇),数字人工制品相关属性,数字工件源存储库属性和/或特征之间的关系。 此外,可以通过随时间迭代地改进的训练集(例如,机器学习)或初始猜测来生成相关函数。 在确定相关性之后,可以基于相应的相关性来相对于彼此订购搜索结果。 此外,可以利用阈值来减轻可能与查询,用户和/或应用程序不相关的返回结果。
    • 27. 发明申请
    • Building and using subwebs for focused search
    • 建立和使用子网进行重点搜索
    • US20050165753A1
    • 2005-07-28
    • US10778498
    • 2004-02-13
    • Harr ChenRaman ChandrasekarSimon CorstonEric Brill
    • Harr ChenRaman ChandrasekarSimon CorstonEric Brill
    • G06N99/00G06F17/00G06F17/30G06F7/00
    • G06F17/30867Y10S707/99933
    • A system that facilitates performance of a focused search over a collection of sites comprises a subweb that corresponds to a topic and/or user characteristic(s) that are of interest to the user. The subweb includes a plurality of domains and/or paths (e.g. sites) that are related to the topic and/or the user characteristic(s). Each of the sites within the subweb is assigned a weight that indicates relevance of the site to the desirable topic and/or user characteristic(s). A search engine employs the subweb to facilitate focusing a search over a collection of sites. The search engine receives a query, and utilizes the subweb to focus a search over the selection of sites corresponding to the topic and/or user characteristic(s) represented by the subweb. The results from the search are returned to the user based at least in part upon the relevance weights assigned to the sites within the subweb.
    • 有助于在站点集合上进行聚焦搜索的性能的系统包括对应于用户感兴趣的主题和/或用户特征的子网。 子网包括与主题和/或用户特征相关的多个域和/或路径(例如站点)。 子网站中的每个站点都被分配一个权重,指示站点与期望主题和/或用户特征的相关性。 搜索引擎使用子网站来促进将搜索集中在一系列网站上。 搜索引擎接收查询,并利用子网将搜索集中在与由子网站所代表的主题和/或用户特征相对应的站点的选择上。 至少部分地基于分配给子网站内的站点的相关性权重将搜索结果返回给用户。
    • 28. 发明申请
    • Cost-benefit approach to automatically composing answers to questions by extracting information from large unstructured corpora
    • 通过从大型非结构化语料库中提取信息来自动构成问题答案的成本效益方法
    • US20050033711A1
    • 2005-02-10
    • US10635274
    • 2003-08-06
    • Eric HorvitzDavid AzariSusan DumaisEric Brill
    • Eric HorvitzDavid AzariSusan DumaisEric Brill
    • G06F17/30G06F17/00G06F7/00G06N5/02
    • G06F17/30684G06F17/30687Y10S707/99933
    • The present invention relates to a system and methodology to facilitate extraction of information from a large unstructured corpora such as from the World Wide Web and/or other unstructured sources. Information in the form of answers to questions can be automatically composed from such sources via probabilistic models and cost-benefit analyses to guide resource-intensive information-extraction procedures employed by a knowledge-based question answering system. The analyses can leverage predictions of the ultimate quality of answers generated by the system provided by Bayesian or other statistical models. Such predictions, when coupled with a utility model can provide the system with the ability to make decisions about the number of queries issued to a search engine (or engines), given the cost of queries and the expected value of query results in refining an ultimate answer. Given a preference model, information extraction actions can be taken with the highest expected utility. In this manner, the accuracy of answers to questions can be balanced with the cost of information extraction and analysis to compose the answers.
    • 本发明涉及一种便利从诸如万维网和/或其他非结构化来源的大型非结构化语料库提取信息的系统和方法。 通过概率模型和成本效益分析,可以通过这些来源自动构成问题答案形式的信息,以指导基于知识的问答系统采用的资源密集型信息提取程序。 分析可以利用由贝叶斯或其他统计模型提供的系统生成的答案的最终质量的预测。 当与实用新型相结合时,这种预测可以为系统提供对发出给搜索引擎(或引擎)的查询数量的决定的能力,考虑到查询的成本和查询结果的期望值来提炼最终的 回答。 给定一个偏好模型,可以采用最高预期效用的信息提取动作。 以这种方式,可以将问题答案的准确性与信息提取和分析的成本进行平衡,以构成答案。