会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Updating search engine document index based on calculated age of changed portions in a document
    • 根据文档中已更改部分的计算年龄更新搜索引擎文档索引
    • US08423885B1
    • 2013-04-16
    • US13209593
    • 2011-08-15
    • Joachim KupkeJeff Cox
    • Joachim KupkeJeff Cox
    • G06F17/00G06F17/30
    • G06F17/30864
    • A system receives a document that includes new content and aged content, and compares the document with a prior version of the document that includes the aged content but not the new content. The system also separates the new content and the aged content based on the comparison, determines ages associated with the new content and the aged content, and determines whether the ages of the new content and the aged content are greater than or equal to an age threshold. The system further calculates a checksum of the document based on the aged content when the age of the aged content is greater than or equal to the age threshold, and the age of the new content is less than the age threshold, and stores the calculated checksum.
    • 系统接收包含新内容和老化内容的文档,并将文档与包含老化内容但不包含新内容的文档的先前版本进行比较。 该系统还基于比较分离新内容和老化内容,确定与新内容相关联的年龄和老化内容,并确定新内容和老年内容的年龄是否大于或等于年龄阈值 。 当老龄化的年龄大于或等于年龄阈值时,该系统还基于老化内容计算文档的校验和,并且新内容的年龄小于年龄阈值,并存储所计算的校验和 。
    • 2. 发明授权
    • Updating search engine document index based on calculated age of changed portions in a document
    • 根据文档中已更改部分的计算年龄更新搜索引擎文档索引
    • US08001462B1
    • 2011-08-16
    • US12363529
    • 2009-01-30
    • Joachim KupkeJeff Cox
    • Joachim KupkeJeff Cox
    • G06F17/00G06F17/30
    • G06F17/30864
    • A system receives a document that includes new content and aged content, and compares the document with a prior version of the document that includes the aged content but not the new content. The system also separates the new content and the aged content based on the comparison, determines ages associated with the new content and the aged content, and determines whether the ages of the new content and the aged content are greater than or equal to an age threshold. The system further calculates a checksum of the document based on the aged content when the age of the aged content is greater than or equal to the age threshold, and the age of the new content is less than the age threshold, and stores the calculated checksum.
    • 系统接收包含新内容和老化内容的文档,并将文档与包含老化内容但不包含新内容的文档的先前版本进行比较。 该系统还基于比较分离新内容和老化内容,确定与新内容相关联的年龄和老化内容,并确定新内容和老年内容的年龄是否大于或等于年龄阈值 。 当老龄化的年龄大于或等于年龄阈值时,该系统还基于老化内容计算文档的校验和,并且新内容的年龄小于年龄阈值,并且存储所计算的校验和 。
    • 3. 发明授权
    • Detection of proxy pad sites
    • 代理服务器站点检测
    • US08874565B1
    • 2014-10-28
    • US12345188
    • 2008-12-29
    • Rupesh KapoorDavid Michael ProudfootJoachim Kupke
    • Rupesh KapoorDavid Michael ProudfootJoachim Kupke
    • G06F17/30
    • G06F17/30613G06F17/3053G06F17/3071G06F17/30864
    • A system may identify a set of first documents associated with an organization, and identify clusters to which the first documents belong. Each of a number of the identified clusters may include a group of documents that includes one of the first documents and one or more second documents associated with one or more different organizations. The system may determine a quality score for each of the documents in each of the identified clusters, and determine, for each of the number of the identified clusters, whether the quality score of the one of the first documents in the identified cluster is higher than the quality score of the one or more second documents in the identified cluster. The system may generate a proxy pad score based on the determinations, and store the proxy pad score.
    • 系统可以标识与组织相关联的一组第一文档,并且识别第一文档所属的群集。 多个所识别的集群中的每一个可以包括一组文档,其包括第一文档之一和与一个或多个不同组织相关联的一个或多个第二文档。 所述系统可以确定每个所识别的集群中的每个文档的质量得分,并且对于所识别的集群中的每一个,确定所识别的集群中的所述第一文档之一的质量得分是否高于 所识别的群集中的一个或多个第二个文档的质量得分。 该系统可以基于确定产生代理贴片分数,并存储代理贴片分数。
    • 4. 发明授权
    • Detection of bounce pad sites
    • 检测弹跳垫位置
    • US08521746B1
    • 2013-08-27
    • US13226565
    • 2011-09-07
    • Rupesh KapoorDavid Michael ProudfootJoachim Kupke
    • Rupesh KapoorDavid Michael ProudfootJoachim Kupke
    • G06F17/30
    • G06F17/30864
    • A system may identify a set of related documents, identify one or more documents in the set of related documents that are sources of redirects, and identify organizations that are targets of the redirects. The system may also determine a redirect score based on the number of the identified documents that are sources of the redirects, determine a spam score based on a number of the organizations that are targets of the redirects, determine whether to classify the set of related documents as a bounce pad based on the redirect score and the spam score, and storing information associated with the result of the determination of whether to classify the set of related documents as a bounce pad.
    • 系统可以识别一组相关文档,识别作为重定向源的相关文档集合中的一个或多个文档,并且识别作为重定向目标的组织。 系统还可以基于作为重定向源的所识别的文档的数量来确定重定向分数,基于作为重定向的目标的组织的数量确定垃圾邮件分数,确定是否对该组相关文档进行分类 作为基于重定向分数和垃圾邮件分数的反弹垫,并且存储与确定是否将该组相关文档分类为反弹垫的结果相关联的信息。
    • 5. 发明授权
    • Detection of bounce pad sites
    • 检测弹跳垫位置
    • US08037073B1
    • 2011-10-11
    • US12345203
    • 2008-12-29
    • Rupesh KapoorDavid Michael ProudfootJoachim Kupke
    • Rupesh KapoorDavid Michael ProudfootJoachim Kupke
    • G06F17/30
    • G06F17/30864
    • A system may identify a set of related documents, identify one or more documents in the set of related documents that are sources of redirects, and identify organizations that are targets of the redirects. The system may also determine a redirect score based on the number of the identified documents that are sources of the redirects, determine a spam score based on a number of the organizations that are targets of the redirects, determine whether to classify the set of related documents as a bounce pad based on the redirect score and the spam score, and storing information associated with the result of the determination of whether to classify the set of related documents as a bounce pad.
    • 系统可以识别一组相关文档,识别作为重定向源的相关文档集合中的一个或多个文档,并且识别作为重定向目标的组织。 系统还可以基于作为重定向源的所识别的文档的数量来确定重定向分数,基于作为重定向的目标的组织的数量来确定垃圾邮件分数,确定是否对该组相关文档进行分类 作为基于重定向分数和垃圾邮件分数的反弹垫,并且存储与确定是否将该组相关文档分类为反弹垫的结果相关联的信息。
    • 6. 发明授权
    • Clustering by previous representative
    • 以前的代表聚集
    • US07836108B1
    • 2010-11-16
    • US12059628
    • 2008-03-31
    • Joachim KupkeDavid Michael Proudfoot
    • Joachim KupkeDavid Michael Proudfoot
    • G06F17/30G06F12/00
    • G06F17/30864G06F17/30705
    • A method may include identifying documents in a current clustering operation, assigning the identified documents to one or more clusters, selecting a current representative document for each of the one or more clusters, determining whether the current representative document has been re-crawled, determining a previous representative document with which the current representative document was previously associated in a prior clustering operation, if it is determined that the current representative document has not been re-crawled, determining one of the one or more clusters to which the previous representative document has been assigned in the current clustering operation, combining one of the one or more clusters associated with the current representative document that has not been re-crawled with the one of the one or more clusters associated with the previous representative document into a combined cluster, and storing information regarding the combined cluster.
    • 方法可以包括在当前聚类操作中识别文档,将所识别的文档分配给一个或多个聚类,为一个或多个聚类中的每一个选择当前代表性文档,确定当前代表文档是否已被重新爬行,确定 如果确定当前代表性文档未被重新爬行,则确定当前代表性文档之前与之前相关联的代表性文档的前一代表性文档,确定先前代表文档已经被分配到的一个或多个聚类中的一个 在当前的聚类操作中分配,将与未被重新爬行的当前代表性文档相关联的一个或多个集群中的一个与与先前代表性文档相关联的一个或多个集群中的一个集成到组合集群中,并存储 关于组合集群的信息。