会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明申请
    • METHOD AND SYSTEM FOR DATA PROVENANCE MANAGEMENT IN MULTI-LAYER SYSTEMS
    • 多层系统数据保护管理方法与系统
    • US20120203782A1
    • 2012-08-09
    • US13022334
    • 2011-02-07
    • Chris OlstonAnish Das Sarma
    • Chris OlstonAnish Das Sarma
    • G06F17/30
    • G06F17/30557
    • Method, system, and programs for heterogeneous data management. Information from multiple data sources is first obtained. Data/metadata from each of the data sources is modeled based on the source and/or granularity information of the data/metadata to generate data/metadata models. The data/metadata from multiple data sources are integrated, by applying one or more processes to the data/metadata from different data sources based on the data/metadata models, to generate integrated data/metadata. A provenance representation for the integrated data/metadata is created tracing sources, granularities, and/or processes applied and archived for enabling an query associated with the integrated data/metadata.
    • 用于异构数据管理的方法,系统和程序。 首先获得来自多个数据源的信息。 基于数据/元数据的源和/或粒度信息来建立来自每个数据源的数据/元数据,以生成数据/元数据模型。 通过从基于数据/元数据模型的不同数据源应用一个或多个过程到数据/元数据来集成多个数据源的数据/元数据,以生成集成的数据/元数据。 创建集成数据/元数据的来源代表创建了跟踪源,粒度和/或应用和归档的流程,以启用与集成数据/元数据相关联的查询。
    • 6. 发明授权
    • Leveraging constraints for deduplication
    • 利用重复数据删除的约束
    • US08204866B2
    • 2012-06-19
    • US11804400
    • 2007-05-18
    • Surajit ChaudhuriVenkatesh GantiShriraghav KaushikAnish Das Sarma
    • Surajit ChaudhuriVenkatesh GantiShriraghav KaushikAnish Das Sarma
    • G06F17/30
    • G06F17/30489
    • A deduplication algorithm that provides improved accuracy in data deduplication by using aggregate and/or groupwise constraints. Deduplication is accomplished using only as many of these constraints that are satisfied rather than be imposed inflexibly as hard constraints. Additionally, textual similarity between tuples is leveraged to restrict the search space. The algorithm begins with a coarse initial partition of data records and continues by raising the similarity threshold until the threshold splits a given partition. This sequence of splits defines a rich space of alternatives. Over this space, an algorithm finds a partition of the input that maximizes constraint satisfaction. In the context of groupwise aggregation constraints for deduplication all SQL (structured query language) aggregates are allowed, including summation.
    • 重复数据删除算法,通过使用聚合和/或分组约束来提高重复数据删除的精度。 重复数据删除使用只有这些约束满足的约束才能实现,而不是将其作为硬约束条件强制强加。 此外,利用元组之间的文本相似性来限制搜索空间。 该算法以数据记录的粗略初始分区开始,并通过提高相似性阈值继续,直到阈值分裂给定分区。 这个拆分序列定义了丰富的替代空间。 在这个空间上,一个算法找到了一个最大化约束满足度的输入分区。 在重复数据消除的分组聚合约束的上下文中,允许所有SQL(结构化查询语言)聚合,包括求和。
    • 7. 发明申请
    • METHOD AND SYSTEM FOR DISCOVERING DYNAMIC RELATIONS AMONG ENTITIES
    • 发现实体动态关系的方法与系统
    • US20120143875A1
    • 2012-06-07
    • US12958151
    • 2010-12-01
    • Anish Das SarmaAlpa JainCong Yu
    • Anish Das SarmaAlpa JainCong Yu
    • G06F17/30
    • G06F16/3346G06F16/288
    • Method, system, and programs for detecting dynamic relationship and discovering dynamic events. Data from a first data source is first received. At least one dynamic relation candidate is identified and each dynamic relation candidate involves multiple entities. The at least one dynamic relation candidate is identified based on temporal properties with respect to the entities exhibited in the data from the first data source. Dynamic relations are then extracted by corroborating the temporal properties of the entities involved in the at least one dynamic relation candidate with that of the same entities exhibited in data from a second data source. Then, a dynamic event that gives rise to the dynamic relations among different entities is detected.
    • 用于检测动态关系和发现动态事件的方法,系统和程序。 首先接收来自第一数据源的数据。 识别至少一个动态关系候选,并且每个动态关系候选涉及多个实体。 基于在来自第一数据源的数据中显示的实体的时间属性来识别至少一个动态关系候选。 然后通过证实在至少一个动态关系候选中涉及的实体的时间属性与来自第二数据源的数据中展示的相同实体的时间属性来提取动态关系。 然后,检测到引起不同实体之间的动态关系的动态事件。
    • 10. 发明授权
    • Method and system for data provenance management in multi-layer systems
    • 多层系统中数据来源管理的方法和系统
    • US08819064B2
    • 2014-08-26
    • US13022334
    • 2011-02-07
    • Chris OlstonAnish Das Sarma
    • Chris OlstonAnish Das Sarma
    • G06F17/30
    • G06F17/30557
    • Method, system, and programs for heterogeneous data management. Information from multiple data sources is first obtained. Data/metadata from each of the data sources is modeled based on the source and/or granularity information of the data/metadata to generate data/metadata models. The data/metadata from multiple data sources are integrated, by applying one or more processes to the data/metadata from different data sources based on the data/metadata models, to generate integrated data/metadata. A provenance representation for the integrated data/metadata is created tracing sources, granularities, and/or processes applied and archived for enabling an query associated with the integrated data/metadata.
    • 用于异构数据管理的方法,系统和程序。 首先获得来自多个数据源的信息。 基于数据/元数据的源和/或粒度信息来建立来自每个数据源的数据/元数据,以生成数据/元数据模型。 通过从基于数据/元数据模型的不同数据源应用一个或多个过程到数据/元数据来集成多个数据源的数据/元数据,以生成集成的数据/元数据。 创建集成数据/元数据的来源代表创建了跟踪源,粒度和/或应用和归档的流程,以启用与集成数据/元数据相关联的查询。