专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US07328401B2 Adaptive web crawling using a statistical model 失效
标题翻译：使用统计模型的自适应网络爬网
公开(公告)号：US07328401B2
公开(公告)日：2008-02-05
申请号：US11022054
申请日：2004-12-22
申请人： Kenji C Obata , Dmitriy Meyerzon
发明人： Kenji C Obata , Dmitriy Meyerzon
IPC分类号： G06F7/00 , G06F15/16 , G06F17/00
CPC分类号： G06F17/30864 , Y10S707/99931 , Y10S707/99933
摘要： A computer based system and method of retrieving information pertaining to documents on a computer network is disclosed. The method includes selecting a set of documents to be accessed during a Web crawl by utilizing a statistical model to determine which previously retrieved documents are most likely to have changed since last accessed. The statistical model is continuously improving its accuracy by training internal probability distributions to reflect the actual experience with change rate patterns of the documents accessed. The decision made whether to access the document is based on the probability of change compared against a desired synchronization level, random selections, maximum limits on the amount of time since the document was last accessed, and other criterion. Once the decision to access is made, the document is checked for changes and this information is used to train the statistical model.
摘要翻译：公开了一种基于计算机的系统和检索与计算机网络上的文件有关的信息的方法。该方法包括通过利用统计模型来选择要在Web爬行期间访问的一组文档，以确定先前检索到的文档最近可能自上次访问以来发生变化。统计模型通过训练内部概率分布来不断提高其准确性，以反映所访问文件的变化率模式的实际经验。是否访问文档的决定是基于与所需同步级别进行比较的更改概率，随机选择，自上次访问文档以来的时间量的最大限制以及其他标准。一旦作出决定，将对文件进行更改检查，并将此信息用于训练统计模型。

2. 发明授权

US07228301B2 Method for normalizing document metadata to improve search results using an alias relationship directory service 有权
标题翻译：使用别名关系目录服务来归一化文档元数据以改进搜索结果的方法
公开(公告)号：US07228301B2
公开(公告)日：2007-06-05
申请号：US10609315
申请日：2003-06-27
申请人： Dmitriy Meyerzon , Kenji C. Obata
发明人： Dmitriy Meyerzon , Kenji C. Obata
IPC分类号： G06F7/00
CPC分类号： G06F17/30067 , Y10S707/99933 , Y10S707/99935 , Y10S707/99942 , Y10S707/99943
摘要： The present invention provides methods, systems, and computer program products for normalizing document search terms through use of an alias database, as may be found in an alias relationship file, such as a directory service. A gatherer module receives as input (or crawls through) several documents in series or in parallel and can recognize data segments as related to one of the aliases in the alias relationship file. The gatherer then associates the document appropriately so that a search engine may find all documents associated with a search term, regardless of whether the term has undergone several name changes (various aliases) over the course of time. Accordingly, a user may then search for a person's name, and receive as a search result all documents listing the person's name, as well as documents listing, for example, only the person's email address.
摘要翻译：本发明提供了用于通过使用别名数据库对文档搜索项进行归一化的方法，系统和计算机程序产品，如在别名关系文件（例如目录服务）中可以找到的。采集器模块以串联或并行方式接收多个文档的输入（或爬网），并且可以识别与别名关系文件中的其中一个别名相关的数据段。收集者然后适当地关联文档，使得搜索引擎可以查找与搜索词相关联的所有文档，而不管该术语是否在时间上经历了多个名称改变（各种别名）。因此，用户可以搜索个人的姓名，并且作为搜索结果接收列出该人的姓名的所有文件，以及例如仅列出该人的电子邮件地址的文档。

3. 发明授权

US08812493B2 Search results ranking using editing distance and document information 有权
标题翻译：使用编辑距离和文档信息搜索结果排名
公开(公告)号：US08812493B2
公开(公告)日：2014-08-19
申请号：US12101951
申请日：2008-04-11
申请人： Vladimir Tankovich , Hang Li , Dmitriy Meyerzon , Jun Xu
发明人： Vladimir Tankovich , Hang Li , Dmitriy Meyerzon , Jun Xu
IPC分类号： G06F7/00
CPC分类号： G06F17/2211 , G06F17/30864
摘要： Architecture for extracting document information from documents received as search results based on a query string, and computing an edit distance between the data string and the query string. The edit distance is employed in determining relevance of the document as part of result ranking by detecting near-matches of a whole query or part of the query. The edit distance evaluates how close the query string is to a given data stream that includes document information such as TAUC (title, anchor text, URL, clicks) information, etc. The architecture includes the index-time splitting of compound terms in the URL to allow the more effective discovery of query terms. Additionally, index-time filtering of anchor text is utilized to find the top N anchors of one or more of the document results. The TAUC information can be input to a neural network (e.g., 2-layer) to improve relevance metrics for ranking the search results.
摘要翻译：用于基于查询字符串从作为搜索结果接收的文档提取文档信息的结构，以及计算数据串和查询字符串之间的编辑距离。编辑距离用于通过检测整个查询或部分查询的近似匹配来确定文档作为结果排名的一部分的相关性。编辑距离评估查询字符串与包含诸如TAUC（标题，锚文本，URL，点击）信息等文档信息的给定数据流的距离。该体系结构包括索引时间分割URL中的复合术语以便更有效地发现查询条款。另外，使用锚文本的索引时间过滤来查找一个或多个文档结果的前N个锚点。可以将TAUC信息输入到神经网络（例如，2层），以改进用于对搜索结果排序的相关性度量。

4. 发明授权

US08645417B2 Name search using a ranking function 有权
标题翻译：使用排序功能命名搜索
公开(公告)号：US08645417B2
公开(公告)日：2014-02-04
申请号：US12141082
申请日：2008-06-18
申请人： Dirk H. Groeneveld , Dmitriy Meyerzon , David Mowatt , Jessica A. Alspaugh
发明人： Dirk H. Groeneveld , Dmitriy Meyerzon , David Mowatt , Jessica A. Alspaugh
IPC分类号： G06F7/00 , G06F17/30
CPC分类号： G06F17/30657 , G06F17/30864
摘要： An approach is described for performing a name search using a name search operation and a ranking operation. The name search operation may take text as input and apply a fuzzy matching operation and a lookup operation to generate a collection of candidate names with respective probability scores. In other cases, speech or handwriting recognition may generate the collection of candidate names and probability scores. The ranking operation may then rank these candidate names using a ranking function. The ranking function may rank the candidate names based on the probability scores associated with the names and at least one other factor. One such factor may reflect whether information provided by a user matches profile information associated with a candidate name under consideration. Another factor may reflect an extent of a nexus between the user and a person associated with the candidate name. Other types of factors can be used.
摘要翻译：描述了使用名称搜索操作和排序操作执行姓名搜索的方法。名称搜索操作可以将文本作为输入并应用模糊匹配操作和查找操作以生成具有相应概率得分的候选名称的集合。在其他情况下，语音或手写识别可能产生候选名称和概率分数的集合。然后，排序操作可以使用排序函数对这些候选名称进行排名。排名函数可以基于与名称和至少一个其他因素相关联的概率分数对候选名称进行排名。一个这样的因素可以反映用户提供的信息是否匹配与考虑的候选名称相关联的简档信息。另一个因素可能反映了用户与与候选人名称相关联的人之间的关联程度。可以使用其他类型的因素。

5. 发明申请

US20130110860A1 USER PIPELINE CONFIGURATION FOR RULE-BASED QUERY TRANSFORMATION, GENERATION AND RESULT DISPLAY 有权
标题翻译：用户管道配置，用于基于规则的查询转换，生成和结果显示
公开(公告)号：US20130110860A1
公开(公告)日：2013-05-02
申请号：US13287717
申请日：2011-11-02
申请人： Viktoriya Taranov , Pedro Dantas DeRose , Victor Poznanski , Yauhen Shnitko , Puneet Narula , Dmitriy Meyerzon
发明人： Viktoriya Taranov , Pedro Dantas DeRose , Victor Poznanski , Yauhen Shnitko , Puneet Narula , Dmitriy Meyerzon
IPC分类号： G06F17/30
CPC分类号： G06F17/30448
摘要： A query pipeline for an enterprise search system is configurable by a user of the system. A user may create rules for custom query transformation and parallel query generation, federation of queries, mixing of results and application of display layouts to the received search results. A user interface (UI) assists a user in configuring the search pipeline. For example, a user may enter condition action rules for queries that affect how a query is transformed, how parallel queries are generated, how queries are federated, how search results are ranked and displayed, how rules are ordered and the like.
摘要翻译：用于企业搜索系统的查询流水线可由系统的用户配置。用户可以创建用于自定义查询转换和并行查询生成，查询联合，结果混合和显示布局应用于接收到的搜索结果的规则。用户界面（UI）帮助用户配置搜索管道。例如，用户可以为影响查询如何转换的查询，如何并行查询生成，查询如何联合，查询结果如何排序和显示，规则如何排序等输入条件操作规则。

6. 发明申请

US20120310928A1 DISCOVERING EXPERTISE USING DOCUMENT METADATA IN PART TO RANK AUTHORS 有权
标题翻译：发现使用文件元数据的部分作者
公开(公告)号：US20120310928A1
公开(公告)日：2012-12-06
申请号：US13150710
申请日：2011-06-01
申请人： Aninda Ray , Dmitriy Meyerzon
发明人： Aninda Ray , Dmitriy Meyerzon
IPC分类号： G06F17/30 , G06F7/00
CPC分类号： G06F17/30979
摘要： Expertise mining features are provided based in part on the use of an expertise mining algorithm and expertise mining queries. A method of an embodiment operates to provide an expanded feedback query based in part on search results using an expertise mining query and a number of author-ranking heuristics used to rank authors and/or co-authors (e.g., primary authors, secondary authors, etc.) as part of an expertise mining operation. A search system of an embodiment includes an author ranker component to rank authors based in part on an expertise mining query and author-ranking heuristics, and a query expander component to provide expanded queries as part of identifying relevant search results. Other embodiments are also disclosed.
摘要翻译：专业挖掘功能部分基于专业挖掘算法和专业挖掘查询的使用而提供。实施例的方法用于使用专业知识挖掘查询和用于对作者和/或共同作者进行排名的多个作者排名启发法（例如，主要作者，次要作者，等等）作为专业挖掘操作的一部分。实施例的搜索系统包括作者角色组件，其部分地基于专业挖掘查询和作者排名启发式排序作者，以及查询扩展器组件，用于提供扩展查询作为标识相关搜索结果的一部分。还公开了其他实施例。

7. 发明授权

US08266144B2 Techniques to perform relative ranking for search results 有权
标题翻译：执行搜索结果相对排名的技术
公开(公告)号：US08266144B2
公开(公告)日：2012-09-11
申请号：US13175043
申请日：2011-07-01
申请人： Vladimir Tankovich , Dmitriy Meyerzon , Michael Taylor , Stephen Robertson
发明人： Vladimir Tankovich , Dmitriy Meyerzon , Michael Taylor , Stephen Robertson
IPC分类号： G06F17/30
CPC分类号： G06F17/3053
摘要： Techniques to perform relative ranking for search results are described. An apparatus may include an enhanced search component operative to receive a search query and provide ranked search results responsive to the search query. The enhanced search component may comprise a resource search module operative to search for resources using multiple search terms from the search query, and output a set of resources having some or all of the search terms. The enhanced search component may also comprise a proximity generation module communicatively coupled to the resource search module, the proximity generation module operative to receive the set of resources, retrieve search term position information for each resource, and generate a proximity feature value based on the search term position information. The enhanced search component may further comprise a resource ranking module communicatively coupled to the resource search module and the proximity generation module, the resource ranking module to receive the proximity feature values, and rank the resources based in part on the proximity feature values. Other embodiments are described and claimed.
摘要翻译：描述了对搜索结果执行相对排名的技术。装置可以包括增强的搜索组件，其操作以接收搜索查询并且响应于搜索查询提供排名的搜索结果。增强搜索组件可以包括资源搜索模块，其可操作以使用来自搜索查询的多个搜索项来搜索资源，并且输出具有部分或全部搜索项的一组资源。增强搜索组件还可以包括通信地耦合到资源搜索模块的邻近生成模块，用于接收资源集合的邻近生成模块，检索每个资源的搜索项位置信息，以及基于搜索生成接近特征值期限位置信息。增强搜索组件还可以包括资源排序模块，其通信地耦合到资源搜索模块和邻近生成模块，用于接收邻近特征值的资源排名模块，以及部分地基于邻近特征值对资源进行排名。描述和要求保护其他实施例。

8. 发明授权

US08224847B2 Relevant individual searching using managed property and ranking features 有权
标题翻译：使用管理财产和排名特征的相关个人搜索
公开(公告)号：US08224847B2
公开(公告)日：2012-07-17
申请号：US12608181
申请日：2009-10-29
申请人： Boxin Li , Dmitriy Meyerzon , Jessica Alspaugh , Victor Poznanski
发明人： Boxin Li , Dmitriy Meyerzon , Jessica Alspaugh , Victor Poznanski
IPC分类号： G06F17/00
CPC分类号： G06F17/30699
摘要： Embodiments are configured to provide information relevant to individuals of interest to a searching user. In an embodiment, a method includes identifying relevant individuals of a network using a relevance model that includes the use of a number of managed properties and ranking features to identify relevant individuals of a defined network. The relevance model of one embodiment is defined by a schema that includes a textual matching ranking feature, social distance ranking feature, a levels to top ranking feature, and a proximity ranking feature.
摘要翻译：实施例被配置为向搜索用户提供与感兴趣的个人相关的信息。在一个实施例中，一种方法包括使用包括使用多个管理属性和排序特征来识别所定义的网络的相关个体的相关性模型来识别网络的相关个体。一个实施例的相关性模型由包括文本匹配排名特征，社交距离排名特征，级别与顶级排名特征以及接近度排名特征的模式来定义。

9. 发明申请

US20120041960A9 RANKING FUNCTIONS USING DOCUMENT USAGE STATISTICS 审中-公开
标题翻译：使用文件使用统计的排名函数
公开(公告)号：US20120041960A9
公开(公告)日：2012-02-16
申请号：US12359939
申请日：2009-01-26
申请人： Dmitriy Meyerzon , Hugo Zaragoza , Kyle Peltonen , Andrew DeBruyne
发明人： Dmitriy Meyerzon , Hugo Zaragoza , Kyle Peltonen , Andrew DeBruyne
IPC分类号： G06F17/30
CPC分类号： G06F17/30864 , G06F17/3053 , G06F17/30675 , Y10S707/99932 , Y10S707/99933 , Y10S707/99937 , Y10S707/99938
摘要： Methods of providing a document relevance score to a document on a network are disclosed. Computer readable medium having stored thereon computer-executable instructions for performing a method of providing a document relevance score to a document on a network are also disclosed. Further, computing systems containing at least one application module, wherein the at least one application module comprises application code for performing methods of providing a document relevance score to a document on a network are disclosed.
摘要翻译：公开了向网络上的文档提供文档相关性分数的方法。还公开了其上存储有用于执行向网络上的文档提供文档相关性得分的方法的计算机可执行指令的计算机可读介质。此外，公开了包含至少一个应用模块的计算系统，其中所述至少一个应用模块包括用于执行向网络上的文档提供文档相关性分数的方法的应用代码。

10. 发明授权

US07974974B2 Techniques to perform relative ranking for search results 有权
标题翻译：执行搜索结果相对排名的技术
公开(公告)号：US07974974B2
公开(公告)日：2011-07-05
申请号：US12051847
申请日：2008-03-20
申请人： Vladimir Tankovich , Dmitriy Meyerzon , Michael Taylor , Stephen Robertson
发明人： Vladimir Tankovich , Dmitriy Meyerzon , Michael Taylor , Stephen Robertson
IPC分类号： G06F17/30
CPC分类号： G06F17/3053
摘要： Techniques to perform relative ranking for search results are described. An apparatus may include an enhanced search component operative to receive a search query and provide ranked search results responsive to the search query. The enhanced search component may comprise a resource search module operative to search for resources using multiple search terms from the search query, and output a set of resources having some or all of the search terms. The enhanced search component may also comprise a proximity generation module communicatively coupled to the resource search module, the proximity generation module operative to receive the set of resources, retrieve search term position information for each resource, and generate a proximity feature value based on the search term position information. The enhanced search component may further comprise a resource ranking module communicatively coupled to the resource search module and the proximity generation module, the resource ranking module to receive the proximity feature values, and rank the resources based in part on the proximity feature values. Other embodiments are described and claimed.
摘要翻译：描述了对搜索结果执行相对排名的技术。装置可以包括增强的搜索组件，其操作以接收搜索查询并且响应于搜索查询提供排名的搜索结果。增强搜索组件可以包括资源搜索模块，其可操作以使用来自搜索查询的多个搜索项来搜索资源，并且输出具有部分或全部搜索项的一组资源。增强搜索组件还可以包括通信地耦合到资源搜索模块的邻近生成模块，用于接收资源集合的邻近生成模块，检索每个资源的搜索项位置信息，以及基于搜索生成接近特征值期限位置信息。增强搜索组件还可以包括资源排序模块，其通信地耦合到资源搜索模块和邻近生成模块，用于接收邻近特征值的资源排名模块，以及部分地基于邻近特征值对资源进行排名。描述和要求保护其他实施例。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式