会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Scheduling resource crawls
    • 调度资源爬网
    • US08868541B2
    • 2014-10-21
    • US13011426
    • 2011-01-21
    • Zhen LinKeith Stevens
    • Zhen LinKeith Stevens
    • G06F17/30
    • G06F17/30864
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for scheduling resource crawls. In one aspect, a framework is provided for scheduling resource crawls such that a crawl scheduler determines the health of a document, i.e., whether it can be crawled, the popularity of the document, and the frequency of “interesting,” i.e., substantive, content changes, and based on this information, estimates an appropriate crawl interval for each web resource to improve crawl resource utilization.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于调度资源爬行。 在一个方面,提供了一种用于调度资源爬行的框架,使得爬行调度程序确定文档的健康状况,即是否可以爬行,文档的普及以及“有趣”的频率,即实质的, 内容更改,并且基于此信息,估计每个Web资源的适当爬网间隔以提高爬网资源利用率。
    • 4. 发明申请
    • SCHEDULING RESOURCE CRAWLS
    • 调度资源CRAWLS
    • US20130144858A1
    • 2013-06-06
    • US13011426
    • 2011-01-21
    • Zhen LinKeith Stevens
    • Zhen LinKeith Stevens
    • G06F17/30
    • G06F17/30864
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for scheduling resource crawls. In one aspect, a framework is provided for scheduling resource crawls such that a crawl scheduler determines the health of a document, i.e., whether it can be crawled, the popularity of the document, and the frequency of “interesting,” i.e., substantive, content changes, and based on this information, estimates an appropriate crawl interval for each web resource to improve crawl resource utilization.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于调度资源爬行。 在一个方面,提供了一种用于调度资源爬行的框架,使得爬行调度程序确定文档的健康状况,即是否可以爬行,文档的普及以及“有趣”的频率,即实质的, 内容更改,并且基于此信息,估计每个Web资源的适当爬网间隔以提高爬网资源利用率。