会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 21. 发明授权
    • Method and system for quantifying the quality of search results based on cohesion
    • 基于凝聚力量化搜索结果质量的方法和系统
    • US07720870B2
    • 2010-05-18
    • US11959182
    • 2007-12-18
    • Luciano BarbosaFlavio JunqueiraVassilis PlachourasRicardo Baeza-Yates
    • Luciano BarbosaFlavio JunqueiraVassilis PlachourasRicardo Baeza-Yates
    • G06F17/30
    • G06F17/3069G06F17/30864
    • A method and system for quantifying the quality of search results from a search engine based on cohesion. The method and system include modeling a set of search engine search results as a cluster and measuring the cohesion of the cluster. In an embodiment, the cohesion of the cluster is the average similarity between the cluster elements to a centroid vector. The centroid vector is the average of the weights of the vectors of the cluster. The similarity between the centroid vector and the cluster's elements is the cosine similarity measure. Each document in the set of search results is represented by a vector where each cell of the vector represents a stemmed word. Each cell has a cell value which is the frequency of the corresponding stemmed word in a document multiplied by a weight that takes into account the location of the stemmed word within the document.
    • 一种用于量化基于内聚的搜索引擎的搜索结果的质量的方法和系统。 该方法和系统包括将一组搜索引擎搜索结果建模为群集并测量群集的内聚。 在一个实施例中,聚类的内聚性是聚类元素与质心向量之间的平均相似度。 质心矢量是聚类向量权重的平均值。 质心向量与簇的元素之间的相似度是余弦相似性度量。 搜索结果集中的每个文档由向量表示,其中向量的每个单元表示一个被干扰的单词。 每个单元格具有一个单元格值,该单元格值是文档中相应词干词的频率乘以一个考虑到文档中的词干词的位置的权重。
    • 23. 发明申请
    • SYSTEM AND METHODOLOGY FOR A MULTI-SITE SEARCH ENGINE
    • 多站点搜索引擎的系统和方法
    • US20100094853A1
    • 2010-04-15
    • US12250929
    • 2008-10-14
    • LUCA TELLOLIFlavio JunqueriaAristides GionisVassilis PlachourasRicardo Baeza-Yates
    • LUCA TELLOLIFlavio JunqueriaAristides GionisVassilis PlachourasRicardo Baeza-Yates
    • G06F7/06G06F17/30
    • G06F17/30864
    • Techniques for query processing in a multi-site search engine are described. During an indexing phase, each site of a multi-site search engine indexes a set of assigned web resources and each site calculates, for each term in the set of assigned web resources, a site-specific upper bound ranking score on the contribution of the term to the search engine ranking function for a query containing the term. During a propagation phase, all sites exchange their site-specific upper bound ranking scores with each other. In response to a site receiving a query, the site determines the set of locally matching resources and compares the ranking score of a locally matching resource with the site-specific upper bound ranking scores for the terms of the query that were received during the propagation phase and determines whether to communicate the query to other sites. By exchanging appropriately defined site-specific upper bound ranking scores, the site initially receiving the query can determine whether the locally matching resources would be identical to the resources obtained from a single-site search system without having to communicate the query to each of the other sites.
    • 描述了在多站点搜索引擎中查询处理的技术。 在索引阶段期间,多站点搜索引擎的每个站点对一组分配的web资源进行索引,并且每个站点针对所分配的web资源集合中的每个术语来计算一个站点特定的上限排名分数 用于包含该术语的查询的搜索引擎排名函数。 在传播阶段,所有的站点彼此交换其站点特定的上限排名得分。 响应于接收到查询的站点,站点确定本地匹配资源的集合,并将本地匹配资源的排名得分与在传播阶段期间接收到的查询的项的站点特定上限排名得分进行比较 并确定是否将查询传递给其他站点。 通过交换适当定义的站点特定上限排名得分,最初接收查询的站点可以确定本地匹配资源是否与从单站点搜索系统获得的资源相同,而不必将查询传递给其他每个 网站。
    • 27. 发明申请
    • SYSTEM FOR REFRESHING CACHE RESULTS
    • 刷新缓存结果的系统
    • US20090204753A1
    • 2009-08-13
    • US12028373
    • 2008-02-08
    • William Havinden Bridge, JR.Flavio P. JunqueiraVassilis Plachouras
    • William Havinden Bridge, JR.Flavio P. JunqueiraVassilis Plachouras
    • G06F12/00
    • G06F12/123G06F12/122
    • A system and method for refreshing a cache based on query responses provided by a searching system in response to queries, includes providing a cache entry for each unique query, if space is available in the cache, and assigning a temperature value to each cache entry based on a frequency of occurrence of the corresponding query An age value is assigned to each cache entry based on a time of last refresh or creation of the corresponding query response. The age of the cache entries is periodically updated, and the temperature of a cache entry is updated when a corresponding query reoccurs. If system resources are available, the query response of a cache entry is refreshed based on the temperature and age of the cache entry. If resources are not available, the refreshing is limited.
    • 一种用于基于由搜索系统响应于查询而提供的查询响应来刷新高速缓存的系统和方法,包括为每个唯一查询提供高速缓存条目,如果高速缓存中有可用的空间,以及为每个高速缓存条目分配温度值 在相应查询的发生频率上根据上次刷新的时间或相应的查询响应的创建将年龄值分配给每个高速缓存条目。 周期性地更新缓存条目的时代,并且当对应的查询重新出现时,缓存条目的温度被更新。 如果系统资源可用,则缓存条目的查询响应将根据缓存条目的温度和年龄进行刷新。 如果资源不可用,刷新是有限的。
    • 28. 发明申请
    • METHOD AND SYSTEM FOR QUANTIFYING THE QUALITY OF SEARCH RESULTS BASED ON COHESION
    • 基于联合搜索结果质量的方法和系统
    • US20090157652A1
    • 2009-06-18
    • US11959182
    • 2007-12-18
    • Luciano BarbosaFlavio JunqueiraVassilis PlachourasRicardo Baeza-Yates
    • Luciano BarbosaFlavio JunqueiraVassilis PlachourasRicardo Baeza-Yates
    • G06F7/00
    • G06F17/3069G06F17/30864
    • A method and system for quantifying the quality of search results from a search engine based on cohesion. The method and system include modeling a set of search engine search results as a cluster and measuring the cohesion of the cluster. In an embodiment, the cohesion of the cluster is the average similarity between the cluster elements to a centroid vector. The centroid vector is the average of the weights of the vectors of the cluster. The similarity between the centroid vector and the cluster's elements is the cosine similarity measure. Each document in the set of search results is represented by a vector where each cell of the vector represents a stemmed word. Each cell has a cell value which is the frequency of the corresponding stemmed word in a document multiplied by a weight that takes into account the location of the stemmed word within the document.
    • 一种用于量化基于内聚的搜索引擎的搜索结果的质量的方法和系统。 该方法和系统包括将一组搜索引擎搜索结果建模为群集并测量群集的内聚。 在一个实施例中,聚类的内聚性是聚类元素与质心向量之间的平均相似度。 质心矢量是聚类向量权重的平均值。 质心向量与簇的元素之间的相似度是余弦相似性度量。 搜索结果集中的每个文档由向量表示,其中向量的每个单元表示一个被干扰的单词。 每个单元格具有一个单元格值,该单元格值是文档中相应词干词的频率乘以一个考虑到文档中的词干词的位置的权重。