会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明申请
    • METHOD AND SYSTEM FOR WEB EXTRACTION
    • 网络提取的方法和系统
    • US20120005207A1
    • 2012-01-05
    • US12828305
    • 2010-07-01
    • Pankaj GulhaneSrinivasan Hanumantha Rao SengameduAshwin TengliRajeev Rastogi
    • Pankaj GulhaneSrinivasan Hanumantha Rao SengameduAshwin TengliRajeev Rastogi
    • G06F17/30
    • G06F16/9535
    • A method includes generating, a plurality of sets of pairs of records from a set of records, for each attribute-position pair in the set of records. Each attribute-position pair being indicative of a position of an attribute in a record. Further, the method includes forming, electronically, a plurality of groups, each group comprising two attribute-position pairs having different attributes. Further, the method also includes determining, electronically for each group, number of pairs of records that are common in the two attribute-position pairs of that group. Furthermore, the method includes extracting results based on a first group of the plurality of groups if the number of pairs of records that are common in the two attribute-position pairs of the first group is greater than a second threshold, is highest among the plurality of groups, and no group having three or more attribute-position pairs with different attributes is possible.
    • 一种方法包括针对该组记录中的每个属性位置对,从一组记录生成多组记录对。 每个属性位置对指示记录中属性的位置。 此外,该方法包括以电子方式形成多个组,每个组包括具有不同属性的两个属性位置对。 此外,该方法还包括以电子方式确定每组的在该组的两个属性位置对中共有的记录对数。 此外,该方法包括:如果第一组的两个属性位置对中共同的记录对数大于第二阈值,则基于多个组中的第一组来提取结果,在多个组中是最高的 的组,并且没有具有三个或更多个具有不同属性的属性位置对的组是可能的。
    • 4. 发明申请
    • METHOD AND SYSTEM FOR DETERMINING SIMILARITY SCORE
    • 用于确定相似度的方法和系统
    • US20110225173A1
    • 2011-09-15
    • US12721577
    • 2010-03-11
    • Pankaj GulhaneSrinivasan Hanumantha Rao SengameduAshwin TengliRajeev Rastogi
    • Pankaj GulhaneSrinivasan Hanumantha Rao SengameduAshwin TengliRajeev Rastogi
    • G06F17/30
    • G06K9/3266G06K9/723G06K2209/01
    • A method includes generating, electronically, one or more matching patterns for one or more pairs of attribute values. Each pair includes two attribute values. The two attribute values include a first attribute value from a first record and a second attribute value from a second record. The first attribute value and the second attribute value satisfy a first criterion. Further, the method includes identifying, electronically, matching segment between the first attribute value and the second attribute value of a first pair. The method also includes repeating identifying for each pair. Moreover, the method includes computing a similarity score for the first pair using one of the first pair and the matching segment based on the one or more matching patterns and matching segments of the one or more pairs satisfying a second criterion. The method also includes repeating computing for each pair.
    • 一种方法包括以电子方式生成一对或多对属性值的一个或多个匹配模式。 每对包含两个属性值。 两个属性值包括来自第一记录的第一属性值和来自第二记录的第二属性值。 第一属性值和第二属性值满足第一标准。 此外,该方法包括识别电子地匹配第一属性值与第一对的第二属性值之间的片段。 该方法还包括每对重复识别。 此外,该方法包括基于一个或多个匹配模式和满足第二标准的一个或多个对中的匹配片段,使用第一对和匹配片段中的一个来计算第一对的相似性得分。 该方法还包括对每对重复计算。
    • 5. 发明授权
    • Method and system for determining similarity score
    • 确定相似度得分的方法和系统
    • US08620930B2
    • 2013-12-31
    • US12721577
    • 2010-03-11
    • Pankaj GulhaneSrinivasan Hanumantha Rao SengameduAshwin TengliRajeev Rastogi
    • Pankaj GulhaneSrinivasan Hanumantha Rao SengameduAshwin TengliRajeev Rastogi
    • G06F7/00
    • G06K9/3266G06K9/723G06K2209/01
    • A method includes generating, electronically, one or more matching patterns for one or more pairs of attribute values. Each pair includes two attribute values. The two attribute values include a first attribute value from a first record and a second attribute value from a second record. The first attribute value and the second attribute value satisfy a first criterion. Further, the method includes identifying, electronically, matching segment between the first attribute value and the second attribute value of a first pair. The method also includes repeating identifying for each pair. Moreover, the method includes computing a similarity score for the first pair using one of the first pair and the matching segment based on the one or more matching patterns and matching segments of the one or more pairs satisfying a second criterion. The method also includes repeating computing for each pair.
    • 一种方法包括以电子方式生成一对或多对属性值的一个或多个匹配模式。 每对包含两个属性值。 两个属性值包括来自第一记录的第一属性值和来自第二记录的第二属性值。 第一属性值和第二属性值满足第一标准。 此外,该方法包括识别电子地匹配第一属性值与第一对的第二属性值之间的片段。 该方法还包括对每对重复识别。 此外,该方法包括基于一个或多个匹配模式和满足第二标准的一个或多个对中的匹配片段,使用第一对和匹配片段中的一个来计算第一对的相似性得分。 该方法还包括对每对重复计算。
    • 10. 发明授权
    • Sketch-based multi-query processing over data streams
    • 基于草图的数据流多查询处理
    • US07328220B2
    • 2008-02-05
    • US11025211
    • 2004-12-29
    • Alin DobraJohannes GehrkeRajeev RastogiMinos Garofalakis
    • Alin DobraJohannes GehrkeRajeev RastogiMinos Garofalakis
    • G06F17/00
    • G06F17/30516G06F17/3046Y10S707/99936Y10S707/99942
    • A method of efficiently providing estimated answers to workloads of aggregate, multi-join SQL-like queries over a number of input data-streams. The method only examines each data elements once and uses a limited amount of computer memory. The method uses join graphs and atomic sketches that are essentially pseudo-random summaries formed using random binary variables. The estimated answer is the product of all the atomic sketches for all the vertices in the query join graph. A query workload is processed efficiently by identifying and sharing atomic sketches common to distinct queries, while ensuring that the join graphs remain well formed. The method may automatically minimize either the average query error or the maximum query error over the workload.
    • 一种有效提供对多个输入数据流的聚合,多连接SQL类查询的工作负载的估计答案的方法。 该方法仅检查每个数据元素一次并使用有限数量的计算机存储器。 该方法使用连接图和原子素描,它们本质上是使用随机二进制变量形成的伪随机摘要。 估计答案是查询连接图中所有顶点的所有原子草图的乘积。 通过识别和共享不同查询共同的原子草图,同时确保连接图形式保持良好,可以有效地处理查询工作负载。 该方法可以自动最小化平均查询错误或工作负载上的最大查询错误。