会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Apparatus and methods for classification of web sites
    • 网站分类的设备和方法
    • US07792951B2
    • 2010-09-07
    • US10315705
    • 2002-12-10
    • Nagui HalimZhen LiuMark Steven SquillanteHonghui XiaShun-Zheng YuLi Zhang
    • Nagui HalimZhen LiuMark Steven SquillanteHonghui XiaShun-Zheng YuLi Zhang
    • G06F15/173
    • G06F17/3071
    • Apparatus and methods for classifying web sites are provided. With the apparatus and methods, traffic data is obtained for a plurality of web sites. This patterns, or templates, for each web site are generated based on this traffic data and the patterns are clustered into classes of web sites using a clustering algorithm. The clusters, or classes, are then profiled to generate a template for each class. The template for each class is generated by first shifting the patterns for each web site that is part of the class to compensate for effects like time zone differences, if any, and then identifying a pattern that is most similar to all of the patterns in the class. Once the template for each class is generated, this template is then used with traffic data from a new web site to classify the new web site into one of the existing classes. In other words, when traffic data for a new web site is received, a pattern for the traffic data of the new web site is generated and compared to the templates for the various classes. If a matching class template is identified, the new web site is classified into the corresponding class. If the pattern for the new web site does not match any of the existing templates, a new template and class may be generated based on the pattern for the new web site.
    • 提供了分类网站的装置和方法。 利用该装置和方法,获得多个网站的交通数据。 基于该流量数据生成每个网站的这种模式或模板,并且使用聚类算法将模式聚类成网站类。 然后,对集群或类进行概要分析以为每个类生成一个模板。 每个类的模板是通过首先移动作为类的一部分的每个网站的模式来生成的,以补偿诸如时​​区差异的效果(如果有的话),然后识别最相似于所有模式中的模式 类。 一旦生成了每个类的模板,该模板随后与来自新网站的流量数据一起使用,将新网站分类到现有的一个类中。 换句话说,当接收到新的网站的交通数据时,生成用于新网站的交通数据的模式,并与各种类别的模板进行比较。 如果识别出匹配的类模板,则将新的网站分类到相应的类中。 如果新网站的模式与任何现有模板不匹配,则可能会根据新网站的模式生成新的模板和类。
    • 6. 发明授权
    • Computer resource proportional utilization and response time scheduling
    • 计算机资源比例利用和响应时间调度
    • US06263359B1
    • 2001-07-17
    • US08862044
    • 1997-05-22
    • Liana Liyow FongMark Steven SquillanteRoger Eldred Hough
    • Liana Liyow FongMark Steven SquillanteRoger Eldred Hough
    • G06F900
    • G06F9/4881
    • A method of scheduling jobs to be executed by a resource in a computer system wherein the jobs are grouped in “classes.” The job classes vying for the resource's attention are arranged in a hierarchy. Each job class has a time-function value that controls when the job class is selected by the resource if processing time becomes available. Within a particular level of the hierarchy, scheduling priorities are defined by one or more time-based functions, each of which may be constant or dynamically varying. When constant time-based functions are used, each job class has a schedule value that remains fixed with time. When dynamic time-based functions are used, job class “time-function values” are modified to alter the timing by which the job class(es) acquire the resource.
    • 调度由计算机系统中的资源执行的作业的方法,其中作业被分组为“类”。 争取资源注意力的工作阶层排列在一个层次结构中。 每个作业类都有一个时间函数值,用于在处理时间变得可用时控制资源选择何时作业类。 在层级的特定级别内,调度优先级由一个或多个基于时间的函数定义,每个函数可以是常数或动态变化的。 当使用基于时间的常量函数时,每个作业类都有一个随时间保持固定的调度值。 当使用基于动态时间的函数时,修改作业类“时间函数值”以改变作业类获取资源的时间。
    • 7. 发明授权
    • Method and apparatus for web crawler data collection
    • 网页抓取器数据采集的方法和装置
    • US07454410B2
    • 2008-11-18
    • US10434653
    • 2003-05-09
    • Mark Steven SquillanteJoel Leonard WolfPhilip Shi-Lung Yu
    • Mark Steven SquillanteJoel Leonard WolfPhilip Shi-Lung Yu
    • G06F17/30
    • G06F17/30864Y10S707/99932Y10S707/99933Y10S707/99935
    • A Web crawler data collection method is provided for collecting information associated with a plurality of queries, which is used to calculate estimates of return probabilities, clicking probabilities and incorrect response probabilities. The estimated return probabilities relate to a probability that a search engine will return a particular Web page in a particular position of a particular query result page. The estimated clicking probabilities relate to a frequency with which a client selects a returned Web page in a particular position of a particular query result. The estimated incorrect response probabilities relate to the probability that a query to a stale version of a particular Web page yields an incorrect or vacuous response. Further, information may be collected regarding the characteristics and update time distributions of a plurality of Web pages.
    • 提供了一种网络爬虫数据收集方法,用于收集与多个查询相关联的信息,其用于计算返回概率的估计,点击概率和不正确的响应概率。 估计的返回概率与搜索引擎将在特定查询结果页的特定位置返回特定网页的概率有关。 估计的点击概率与客户端在特定查询结果的特定位置中选择返回的网页的频率有关。 估计的不正确的响应概率与对特定网页的陈旧版本的查询产生不正确或空虚的响应的概率有关。 此外,可以收集关于多个网页的特征和更新时间分布的信息。