会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Adaptive web crawling using a statistical model
    • 使用统计模型的自适应网络爬网
    • US07328401B2
    • 2008-02-05
    • US11022054
    • 2004-12-22
    • Kenji C ObataDmitriy Meyerzon
    • Kenji C ObataDmitriy Meyerzon
    • G06F7/00G06F15/16G06F17/00
    • G06F17/30864Y10S707/99931Y10S707/99933
    • A computer based system and method of retrieving information pertaining to documents on a computer network is disclosed. The method includes selecting a set of documents to be accessed during a Web crawl by utilizing a statistical model to determine which previously retrieved documents are most likely to have changed since last accessed. The statistical model is continuously improving its accuracy by training internal probability distributions to reflect the actual experience with change rate patterns of the documents accessed. The decision made whether to access the document is based on the probability of change compared against a desired synchronization level, random selections, maximum limits on the amount of time since the document was last accessed, and other criterion. Once the decision to access is made, the document is checked for changes and this information is used to train the statistical model.
    • 公开了一种基于计算机的系统和检索与计算机网络上的文件有关的信息的方法。 该方法包括通过利用统计模型来选择要在Web爬行期间访问的一组文档,以确定先前检索到的文档最近可能自上次访问以来发生变化。 统计模型通过训练内部概率分布来不断提高其准确性,以反映所访问文件的变化率模式的实际经验。 是否访问文档的决定是基于与所需同步级别进行比较的更改概率,随机选择,自上次访问文档以来的时间量的最大限制以及其他标准。 一旦作出决定,将对文件进行更改检查,并将此信息用于训练统计模型。
    • 2. 发明授权
    • Method for normalizing document metadata to improve search results using an alias relationship directory service
    • 使用别名关系目录服务来归一化文档元数据以改进搜索结果的方法
    • US07228301B2
    • 2007-06-05
    • US10609315
    • 2003-06-27
    • Dmitriy MeyerzonKenji C. Obata
    • Dmitriy MeyerzonKenji C. Obata
    • G06F7/00
    • G06F17/30067Y10S707/99933Y10S707/99935Y10S707/99942Y10S707/99943
    • The present invention provides methods, systems, and computer program products for normalizing document search terms through use of an alias database, as may be found in an alias relationship file, such as a directory service. A gatherer module receives as input (or crawls through) several documents in series or in parallel and can recognize data segments as related to one of the aliases in the alias relationship file. The gatherer then associates the document appropriately so that a search engine may find all documents associated with a search term, regardless of whether the term has undergone several name changes (various aliases) over the course of time. Accordingly, a user may then search for a person's name, and receive as a search result all documents listing the person's name, as well as documents listing, for example, only the person's email address.
    • 本发明提供了用于通过使用别名数据库对文档搜索项进行归一化的方法,系统和计算机程序产品,如在别名关系文件(例如目录服务)中可以找到的。 采集器模块以串联或并行方式接收多个文档的输入(或爬网),并且可以识别与别名关系文件中的其中一个别名相关的数据段。 收集者然后适当地关联文档,使得搜索引擎可以查找与搜索词相关联的所有文档,而不管该术语是否在时间上经历了多个名称改变(各种别名)。 因此,用户可以搜索个人的姓名,并且作为搜索结果接收列出该人的姓名的所有文件,以及例如仅列出该人的电子邮件地址的文档。
    • 3. 发明授权
    • Search results ranking using editing distance and document information
    • 使用编辑距离和文档信息搜索结果排名
    • US08812493B2
    • 2014-08-19
    • US12101951
    • 2008-04-11
    • Vladimir TankovichHang LiDmitriy MeyerzonJun Xu
    • Vladimir TankovichHang LiDmitriy MeyerzonJun Xu
    • G06F7/00
    • G06F17/2211G06F17/30864
    • Architecture for extracting document information from documents received as search results based on a query string, and computing an edit distance between the data string and the query string. The edit distance is employed in determining relevance of the document as part of result ranking by detecting near-matches of a whole query or part of the query. The edit distance evaluates how close the query string is to a given data stream that includes document information such as TAUC (title, anchor text, URL, clicks) information, etc. The architecture includes the index-time splitting of compound terms in the URL to allow the more effective discovery of query terms. Additionally, index-time filtering of anchor text is utilized to find the top N anchors of one or more of the document results. The TAUC information can be input to a neural network (e.g., 2-layer) to improve relevance metrics for ranking the search results.
    • 用于基于查询字符串从作为搜索结果接收的文档提取文档信息的结构,以及计算数据串和查询字符串之间的编辑距离。 编辑距离用于通过检测整个查询或部分查询的近似匹配来确定文档作为结果排名的一部分的相关性。 编辑距离评估查询字符串与包含诸如TAUC(标题,锚文本,URL,点击)信息等文档信息的给定数据流的距离。该体系结构包括索引时间分割URL中的复合术语 以便更有效地发现查询条款。 另外,使用锚文本的索引时间过滤来查找一个或多个文档结果的前N个锚点。 可以将TAUC信息输入到神经网络(例如,2层),以改进用于对搜索结果排序的相关性度量。
    • 4. 发明授权
    • Name search using a ranking function
    • 使用排序功能命名搜索
    • US08645417B2
    • 2014-02-04
    • US12141082
    • 2008-06-18
    • Dirk H. GroeneveldDmitriy MeyerzonDavid MowattJessica A. Alspaugh
    • Dirk H. GroeneveldDmitriy MeyerzonDavid MowattJessica A. Alspaugh
    • G06F7/00G06F17/30
    • G06F17/30657G06F17/30864
    • An approach is described for performing a name search using a name search operation and a ranking operation. The name search operation may take text as input and apply a fuzzy matching operation and a lookup operation to generate a collection of candidate names with respective probability scores. In other cases, speech or handwriting recognition may generate the collection of candidate names and probability scores. The ranking operation may then rank these candidate names using a ranking function. The ranking function may rank the candidate names based on the probability scores associated with the names and at least one other factor. One such factor may reflect whether information provided by a user matches profile information associated with a candidate name under consideration. Another factor may reflect an extent of a nexus between the user and a person associated with the candidate name. Other types of factors can be used.
    • 描述了使用名称搜索操作和排序操作执行姓名搜索的方法。 名称搜索操作可以将文本作为输入并应用模糊匹配操作和查找操作以生成具有相应概率得分的候选名称的集合。 在其他情况下,语音或手写识别可能产生候选名称和概率分数的集合。 然后,排序操作可以使用排序函数对这些候选名称进行排名。 排名函数可以基于与名称和至少一个其他因素相关联的概率分数对候选名称进行排名。 一个这样的因素可以反映用户提供的信息是否匹配与考虑的候选名称相关联的简档信息。 另一个因素可能反映了用户与与候选人名称相关联的人之间的关联程度。 可以使用其他类型的因素。
    • 6. 发明申请
    • DISCOVERING EXPERTISE USING DOCUMENT METADATA IN PART TO RANK AUTHORS
    • 发现使用文件元数据的部分作者
    • US20120310928A1
    • 2012-12-06
    • US13150710
    • 2011-06-01
    • Aninda RayDmitriy Meyerzon
    • Aninda RayDmitriy Meyerzon
    • G06F17/30G06F7/00
    • G06F17/30979
    • Expertise mining features are provided based in part on the use of an expertise mining algorithm and expertise mining queries. A method of an embodiment operates to provide an expanded feedback query based in part on search results using an expertise mining query and a number of author-ranking heuristics used to rank authors and/or co-authors (e.g., primary authors, secondary authors, etc.) as part of an expertise mining operation. A search system of an embodiment includes an author ranker component to rank authors based in part on an expertise mining query and author-ranking heuristics, and a query expander component to provide expanded queries as part of identifying relevant search results. Other embodiments are also disclosed.
    • 专业挖掘功能部分基于专业挖掘算法和专业挖掘查询的使用而提供。 实施例的方法用于使用专业知识挖掘查询和用于对作者和/或共同作者进行排名的多个作者排名启发法(例如,主要作者,次要作者, 等等)作为专业挖掘操作的一部分。 实施例的搜索系统包括作者角色组件,其部分地基于专业挖掘查询和作者排名启发式排序作者,以及查询扩展器组件,用于提供扩展查询作为标识相关搜索结果的一部分。 还公开了其他实施例。
    • 7. 发明授权
    • Techniques to perform relative ranking for search results
    • 执行搜索结果相对排名的技术
    • US08266144B2
    • 2012-09-11
    • US13175043
    • 2011-07-01
    • Vladimir TankovichDmitriy MeyerzonMichael TaylorStephen Robertson
    • Vladimir TankovichDmitriy MeyerzonMichael TaylorStephen Robertson
    • G06F17/30
    • G06F17/3053
    • Techniques to perform relative ranking for search results are described. An apparatus may include an enhanced search component operative to receive a search query and provide ranked search results responsive to the search query. The enhanced search component may comprise a resource search module operative to search for resources using multiple search terms from the search query, and output a set of resources having some or all of the search terms. The enhanced search component may also comprise a proximity generation module communicatively coupled to the resource search module, the proximity generation module operative to receive the set of resources, retrieve search term position information for each resource, and generate a proximity feature value based on the search term position information. The enhanced search component may further comprise a resource ranking module communicatively coupled to the resource search module and the proximity generation module, the resource ranking module to receive the proximity feature values, and rank the resources based in part on the proximity feature values. Other embodiments are described and claimed.
    • 描述了对搜索结果执行相对排名的技术。 装置可以包括增强的搜索组件,其操作以接收搜索查询并且响应于搜索查询提供排名的搜索结果。 增强搜索组件可以包括资源搜索模块,其可操作以使用来自搜索查询的多个搜索项来搜索资源,并且输出具有部分或全部搜索项的一组资源。 增强搜索组件还可以包括通信地耦合到资源搜索模块的邻近生成模块,用于接收资源集合的邻近生成模块,检索每个资源的搜索项位置信息,以及基于搜索生成接近特征值 期限位置信息。 增强搜索组件还可以包括资源排序模块,其通信地耦合到资源搜索模块和邻近生成模块,用于接收邻近特征值的资源排名模块,以及部分地基于邻近特征值对资源进行排名。 描述和要求保护其他实施例。
    • 10. 发明授权
    • Techniques to perform relative ranking for search results
    • 执行搜索结果相对排名的技术
    • US07974974B2
    • 2011-07-05
    • US12051847
    • 2008-03-20
    • Vladimir TankovichDmitriy MeyerzonMichael TaylorStephen Robertson
    • Vladimir TankovichDmitriy MeyerzonMichael TaylorStephen Robertson
    • G06F17/30
    • G06F17/3053
    • Techniques to perform relative ranking for search results are described. An apparatus may include an enhanced search component operative to receive a search query and provide ranked search results responsive to the search query. The enhanced search component may comprise a resource search module operative to search for resources using multiple search terms from the search query, and output a set of resources having some or all of the search terms. The enhanced search component may also comprise a proximity generation module communicatively coupled to the resource search module, the proximity generation module operative to receive the set of resources, retrieve search term position information for each resource, and generate a proximity feature value based on the search term position information. The enhanced search component may further comprise a resource ranking module communicatively coupled to the resource search module and the proximity generation module, the resource ranking module to receive the proximity feature values, and rank the resources based in part on the proximity feature values. Other embodiments are described and claimed.
    • 描述了对搜索结果执行相对排名的技术。 装置可以包括增强的搜索组件,其操作以接收搜索查询并且响应于搜索查询提供排名的搜索结果。 增强搜索组件可以包括资源搜索模块,其可操作以使用来自搜索查询的多个搜索项来搜索资源,并且输出具有部分或全部搜索项的一组资源。 增强搜索组件还可以包括通信地耦合到资源搜索模块的邻近生成模块,用于接收资源集合的邻近生成模块,检索每个资源的搜索项位置信息,以及基于搜索生成接近特征值 期限位置信息。 增强搜索组件还可以包括资源排序模块,其通信地耦合到资源搜索模块和邻近生成模块,用于接收邻近特征值的资源排名模块,以及部分地基于邻近特征值对资源进行排名。 描述和要求保护其他实施例。