会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Method and architecture for automated optimization of ETL throughput in data warehousing applications
    • 在数据仓库应用程序中自动优化ETL吞吐量的方法和架构
    • US06208990B1
    • 2001-03-27
    • US09116426
    • 1998-07-15
    • Sankaran SureshJyotindra Pramathnath GautamGirish PanchaFrank Joseph DeRoseMohan Sankaran
    • Sankaran SureshJyotindra Pramathnath GautamGirish PanchaFrank Joseph DeRoseMohan Sankaran
    • G06F1730
    • G06F17/30563Y10S707/99936Y10S707/99943Y10S707/99945
    • A computer software architecture to automatically optimize the throughput of the data extraction/transformation/loading (ETL) process in data warehousing applications. This architecture has a componentized aspect and a pipeline-based aspect. The componentized aspect refers to the fact that every transformation used in this architecture is built up with transformation components selected from an extensible set of transformation components. Besides simplifying source code maintenance and adjustment for the data warehouse users, these transformation components also provide these users the building blocks to effectively construct pertinent and functionally sophisticated transformations in a pipelined manner. Within a pipeline, each transformation component automatically stages or streams its data to optimize ETL throughput. Furthermore, each transformation either pushes data to another transformation component, pulls data from another transformation component, or performs a push/pull operation on the data. Thereby, the pipelining; staging/streaming; and pushing/pulling features of the transformation components effectively optimizes the throughput of the ETL process.
    • 一种计算机软件架构,用于自动优化数据仓库应用程序中数据提取/转换/加载(ETL)流程的吞吐量。 该架构具有组件化方面和基于流水线的方面。 组件化方面是指在该架构中使用的每个变换都是由可扩展的转换组件集合中选择的转换组件构建的。 除了简化数据仓库用户的源代码维护和调整外,这些转换组件还为这些用户提供了以流水线方式有效构建相关和功能复杂的转换的构建块。 在管道中,每个转换组件自动对其数据进行排序或流式传输,以优化ETL吞吐量。 此外,每个变换将数据推送到另一个变换组件,从另一变换组件中提取数据,或对数据执行推/拉操作。 因此,流水线; 分段/流式传输 并且转换组件的推/拉功能有效地优化了ETL过程的吞吐量。