会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 37. 发明公开
    • DATA CLUSTERING, SEGMENTATION, AND PARALLELIZATION
    • 日内瓦,塞尔维亚共和国
    • EP2780835A1
    • 2014-09-24
    • EP12795220.8
    • 2012-11-15
    • Ab Initio Technology LLC
    • ANDERSON, Arlen
    • G06F17/30
    • G06F16/285G06F16/20G06F16/24534G06F16/3338
    • Received data records, each including one or more values in one or more fields, are processed to identify one or more data clusters. The processing includes: identifying (110) tokens that each include at least one value or fragment of a value in a field or a combination of fields; generating (120) a network representing the identified tokens, with nodes of the network representing tokens and edges of the network each representing a variant relationship between tokens; and generating a graphical representation of the network with different subsets of nodes distinguished based at least in part on values associated with nodes, where a value associated with a particular node quantifies a count of a number of instances of the token represented by that particular node appearing within the received data records.
    • 处理在一个或多个字段中包括一个或多个值的接收数据记录,以识别匹配的数据集群。 该处理包括:对于所选择的数据记录,从一个或多个值生成查询; 使用所述查询从所接收的数据记录中识别一个或多个候选数据记录; 确定所选择的数据记录是否满足包含候选记录的一个或多个现有数据集群的至少一个候选数据集群的集群成员标准; 以及至少部分地基于候选数据集群的增长标准从一个或多个候选数据集群中选择匹配的数据集群,或者如果所选择的数据记录不满足集群,则使用所选择的数据记录初始化匹配的数据集群 任何现有数据集群的成员标准或基于增长标准的结果。