会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Query modification
    • 查询修改
    • US08819000B1
    • 2014-08-26
    • US13461315
    • 2012-05-01
    • Anurag AcharyaAlexandre A. Verstak
    • Anurag AcharyaAlexandre A. Verstak
    • G06F17/30
    • G06F17/30G06F17/30672G06F17/30864
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for query modification. In one aspect, a method includes receiving an original query including a first limitation. First search results responsive to a modified query are obtained, where the first limitation has been omitted from the modified query. One or more common characteristics shared by two or more resources are identified. Each of the two or more resources corresponds to a different highly-ranked result of the first search results. A second modified query including the original query and a second limitation representing the one or more common characteristics is generated. Second search results responsive to the second modified query are obtained. The second search results are provided in a response to the original query.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于查询修改。 一方面,一种方法包括接收包括第一限制的原始查询。 获得响应于修改查询的第一搜索结果,其中已经从修改的查询中省略了第一个限制。 识别由两个或多个资源共享的一个或多个共同特征。 两个或更多个资源中的每一个对应于第一搜索结果的不同高度排名的结果。 生成包括原始查询和表示一个或多个共同特征的第二限制的第二修改查询。 获得响应于第二修改查询的第二搜索结果。 响应于原始查询提供第二个搜索结果。
    • 8. 发明授权
    • Scheduler for search engine crawler
    • 搜索引擎抓取器的计划程序
    • US08042112B1
    • 2011-10-18
    • US10882956
    • 2004-06-30
    • Huican ZhuMaximilian IbelAnurag AcharyaHoward Bradley Gobioff
    • Huican ZhuMaximilian IbelAnurag AcharyaHoward Bradley Gobioff
    • G06F9/46G06F7/00
    • G06F17/30864
    • A search engine crawler includes a distributed set of schedulers that are associated with one or more segments of document identifiers (e.g., URLs) corresponding to documents on a network (e.g., WWW). Each scheduler handles the scheduling of document identifiers (for crawling) for a subset of the known document identifiers. Using a starting set of document identifiers, such as the document identifiers crawled (or scheduled for crawling) during the most recent completed crawl, the scheduler removes from the starting set those document identifiers that have been unreachable in each of the last X crawls. Other filtering mechanisms may also be used to filter out some of the document identifiers in the starting set. The resulting list of document identifiers is written to a scheduled output file for use in a next crawl cycle.
    • 搜索引擎爬行器包括与一个或多个文档标识符(例如,URL)相关联的分布式的一组调度器,对应于网络上的文档(例如,WWW)。 每个调度器处理已知文档标识符的子集的文档标识符(用于爬行)的调度。 使用文档标识符的起始集合,例如在最近完成的爬网期间爬行(或计划进行爬网)的文档标识符,调度程序从起始设置中删除那些在最后一次X爬网中的每一个中都无法访问的文档标识符。 其他过滤机制也可用于过滤出起始集中的一些文档标识符。 生成的文档标识符列表将写入一个预定的输出文件,以供下一个爬网周期使用。
    • 10. 发明授权
    • Search engine cache control
    • 搜索引擎缓存控制
    • US07840557B1
    • 2010-11-23
    • US10845283
    • 2004-05-12
    • Benjamin T. SmithAnurag Acharya
    • Benjamin T. SmithAnurag Acharya
    • G06F7/00G06F17/30
    • G06F12/0875
    • A search query containing at least one term is received at a search controller from a query server and preferably normalized and hashed into a representation of the search query. The representation of the search query is transmitted towards a cache containing multiple query result entries. Each query result entry contains a list of documents associated with the previously searched search query. The cache is then searched and query result entries for the search query are sent to the search controller from the cache. Subsequently, it is determined whether the query result entries are current versions for the search query. If the query result entries are not the current versions, then current versions of the query result entries are obtained.
    • 包含至少一个术语的搜索查询在搜索控制器处从查询服务器接收,并且优选地被标准化并被散列成搜索查询的表示。 搜索查询的表示被发送到包含多个查询结果条目的高速缓存。 每个查询结果条目包含与先前搜索的搜索查询相关联的文档列表。 然后搜索缓存,并将搜索查询的查询结果条目从缓存发送到搜索控制器。 随后,确定查询结果条目是否是用于搜索查询的当前版本。 如果查询结果条目不是当前版本,则获取当前版本的查询结果条目。