会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • System and Process for Record Duplication Analysis
    • 记录复制分析的系统和过程
    • US20110004626A1
    • 2011-01-06
    • US12498186
    • 2009-07-06
    • Frank NAEYMI-RADRegis CHARLOTDavid HAINESMatthew C. CARDWELLMichael Decaro
    • Frank NAEYMI-RADRegis CHARLOTDavid HAINESMatthew C. CARDWELLMichael Decaro
    • G06F17/30G06N5/02
    • G06N7/005
    • A system and process for record duplication analysis that relies on a multi-membership Bayesian analysis to determine the probability that records within a data set are matches. The Bayesian calculation may rely on objective data describing the data set as well as subjective assessments of the data set. In addition, a system and process for record duplication analysis may rely on the predetermination of probabilistic patterns, where the system only searches for patterns exceeding a chosen threshold. Work flow may include selecting which fields within each record should be analyzed, normalizing the values within those fields and removing default data, calculating possible patterns and their match probabilities, analyzing record pairs to determine which have patterns exceeding a chosen threshold to determine the presence of duplicates, and merging duplicates, closing transactions reflecting non-duplicates, identifying records having insufficient data to determine the existence or lack of a match, and/or rolling back accidental merges.
    • 用于记录复制分析的系统和过程,其依赖于多成员贝叶斯分析来确定数据集中的记录的概率是否匹配。 贝叶斯计算可能依赖于描述数据集的客观数据以及数据集的主观评估。 另外,用于记录复制分析的系统和过程可以依赖于概率模式的预先确定,其中系统仅搜索超过所选阈值的模式。 工作流程可以包括选择应分析每个记录中的哪些字段,对这些字段内的值进行归一化并移除默认数据,计算可能的模式及其匹配概率,分析记录对以确定哪些模式超过所选阈值以确定是否存在 复制和合并重复,关闭反映不重复的事务,识别具有不足数据的记录以确定是否存在或缺少匹配,和/或回滚意外合并。