会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明公开
    • DATA ARCHIVE VAULT IN BIG DATA PLATFORM
    • DATENARCHIVHHLLUMUM在大数据平台
    • EP3128445A1
    • 2017-02-08
    • EP16001738.0
    • 2016-08-04
    • SAP SE
    • Herbst, AxelBolik, VeitRoeher, Mathias
    • G06F17/30
    • G06F16/219G06F16/2272G06F16/258
    • Embodiments relate to data archiving utilizing an existing big data platform (e.g., HADOOP) as a cost-effective target infrastructure for storage. Particular embodiments construct a logical structure (hereafter, "vault") in the big data platform so that a source, type, and context of the data is maintained, and metadata can be added to aid searching for snapshots according to a given time, version, and other considerations. A vaulting process transforms relationally stored data in an object view to allow for object-based retrieval or object-wise operations (such as destruction due to legal data privacy reasons), and provide references to also store unstructured data (e.g., sensor data, documents, streams) as attachments. A legacy archive extractor provides extraction services for existing archives, so that extracted information is stored in the same vault. This allows for cross queries over legacy data and data from other sources, facilitating the application of new analysis techniques by data scientists.
    • 实施例涉及使用现有的大数据平台(例如,HADOOP)作为用于存储的成本有效的目标基础设施的数据归档。 特定实施例在大数据平台中构建逻辑结构(以下称为“保险库”),使得维护数据的源,类型和上下文,并且可以添加元数据以辅助根据给定时间版本搜索快照 ,和其他考虑。 存储过程将对象视图中的关系存储数据转换为允许基于对象的检索或对象操作(例如由于合法的数据隐私原因导致的破坏),并提供参考以存储非结构化数据(例如,传感器数据,文档 ,流)作为附件。 遗留归档提取器为现有存档提供提取服务,从而将提取的信息存储在同一保管库中。 这允许对来自其他来源的遗留数据和数据进行交叉查询,从而有助于数据科学家应用新的分析技术。