会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Detecting correlation from data
    • 从数据中检测相关性
    • US07647293B2
    • 2010-01-12
    • US10864463
    • 2004-06-10
    • Paul Geoffrey BrownPeter Jay HaasIhab F. IlyasVolker G. Markl
    • Paul Geoffrey BrownPeter Jay HaasIhab F. IlyasVolker G. Markl
    • G06F17/30
    • G06F17/30536G06F17/30471Y10S707/99932
    • A system and method of discovering dependencies between relational database column pairs and application of discoveries to query optimization is provided. For each candidate column pair remaining after simultaneously generating column pairs, pruning pairs not satisfying specified heuristic constraints, and eliminating pairs with trivial instances of correlation, a random sample of data values is collected. A candidate column pair is tested for the existence of a soft functional dependency (FD), and if a dependency is not found, statistically tested for correlation using a robust chi-squared statistic. Column pairs for which either a soft FD or a statistical correlation exists are prioritized for recommendation to a query optimizer, based on any of: strength of dependency, degree of correlation, or adjustment factor; statistics for recommended columns pairs are tracked to improve selectivity estimates. Additionally, a dependency graph representing correlations and dependencies as edges and column pairs as nodes is provided.
    • 提供了一种发现关系数据库列对与查询优化应用发现之间依赖关系的系统和方法。 对于在同时生成列对之后剩余的每个候选列对,修剪对不满足指定的启发式约束,并且消除具有相关性的平凡实例的对,收集数据值的随机样本。 测试候选列对是否存在软功能依赖(FD),并且如果没有找到依赖关系,则使用鲁棒的卡方统计统计检验相关性。 基于任何一个依赖关系强度,相关程度或调整因素,对存在软FD或统计相关性的列进行优先排序以推荐给查询优化器; 跟踪推荐列对的统计量,以提高选择性估计。 另外,提供了表示作为边缘和列对作为节点的相关性和依赖性的依赖图。