专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20100312775A1 MANAGING UNCERTAIN DATA USING MONTE CARLO TECHNIQUES 有权
标题翻译：使用蒙特卡罗技术管理不确定的数据
公开(公告)号：US20100312775A1
公开(公告)日：2010-12-09
申请号：US12477856
申请日：2009-06-03
申请人： Peter Jay Haas , Ravindranath Jampani , Chistopher Matthew Jermaine , Luis Leopoldo Perez , Mingxi Wu , Fei Xu
发明人： Peter Jay Haas , Ravindranath Jampani , Chistopher Matthew Jermaine , Luis Leopoldo Perez , Mingxi Wu , Fei Xu
IPC分类号： G06F17/30
CPC分类号： G06F17/30536
摘要： According to one embodiment of the present invention, a method for managing uncertain data is provided. The method includes specifying data uncertainty using at least one variable generation (VG) function, wherein the VG function generates pseudorandom samples of uncertain data values. A random database based on the VG function is specified. and multiple Monte Carlo instantiations of the random database are generated. Using a Monte Carlo method, a query is repeatedly executed over the multiple Monte Carlo instantiations to output a Monte Carlo method result and associated query-results. The Monte Carlo method result may then be used to estimate statistical properties of a probability distribution of the query-result.
摘要翻译：根据本发明的一个实施例，提供了一种用于管理不确定数据的方法。该方法包括使用至少一个可变生成（VG）函数来指定数据不确定性，其中VG功能产生不确定数据值的伪随机样本。指定了基于VG功能的随机数据库。并生成随机数据库的多个蒙特卡罗实例。使用蒙特卡罗方法，通过多个蒙特卡洛实例重复执行查询，以输出蒙特卡罗方法结果和关联的查询结果。然后可以使用蒙特卡罗方法结果来估计查询结果的概率分布的统计特性。

2. 发明授权

US08234295B2 Managing uncertain data using Monte Carlo techniques 有权
标题翻译：使用蒙特卡罗技术管理不确定的数据
公开(公告)号：US08234295B2
公开(公告)日：2012-07-31
申请号：US12477856
申请日：2009-06-03
申请人： Peter Jay Haas , Ravindranath Jampani , Chistopher Matthew Jermaine , Luis Leopoldo Perez , Mingxi Wu , Fei Xu
发明人： Peter Jay Haas , Ravindranath Jampani , Chistopher Matthew Jermaine , Luis Leopoldo Perez , Mingxi Wu , Fei Xu
IPC分类号： G06F17/30
CPC分类号： G06F17/30536
摘要： According to one embodiment of the present invention, a method for managing uncertain data is provided. The method includes specifying data uncertainty using at least one variable generation (VG) function, wherein the VG function generates pseudorandom samples of uncertain data values. A random database based on the VG function is specified. and multiple Monte Carlo instantiations of the random database are generated. Using a Monte Carlo method, a query is repeatedly executed over the multiple Monte Carlo instantiations to output a Monte Carlo method result and associated query-results. The Monte Carlo method result may then be used to estimate statistical properties of a probability distribution of the query-result.
摘要翻译：根据本发明的一个实施例，提供了一种用于管理不确定数据的方法。该方法包括使用至少一个可变生成（VG）函数来指定数据不确定性，其中VG功能产生不确定数据值的伪随机样本。指定了基于VG功能的随机数据库。并生成随机数据库的多个蒙特卡罗实例。使用蒙特卡罗方法，通过多个蒙特卡洛实例重复执行查询，以输出蒙特卡罗方法结果和关联的查询结果。然后可以使用蒙特卡罗方法结果来估计查询结果的概率分布的统计特性。

3. 发明申请

US20120254238A1 MANAGING UNCERTAIN DATA USING MONTE CARLO TECHNIQUES 审中-公开
标题翻译：使用蒙特卡罗技术管理不确定的数据
公开(公告)号：US20120254238A1
公开(公告)日：2012-10-04
申请号：US13495610
申请日：2012-06-13
申请人： Peter Jay Haas , Ravindranath Jampani , Christopher Matthew Jermaine , Luis Leopoldo Perez , Mingxi Wu , Fei Xu
发明人： Peter Jay Haas , Ravindranath Jampani , Christopher Matthew Jermaine , Luis Leopoldo Perez , Mingxi Wu , Fei Xu
IPC分类号： G06F17/30
CPC分类号： G06F17/30536
摘要： According to one embodiment of the present invention, a method for managing uncertain data is provided. The method includes specifying data uncertainty using at least one variable generation (VG) function. The VG function generates pseudorandom samples of uncertain data values. A random database based on the VG function is specified and multiple Monte Carlo instantiations of the random database are generated. Using a Monte Carlo method, a query is repeatedly executed over the multiple Monte Carlo instantiations to output a Monte Carlo method result and associated query-results. The Monte Carlo method result may then be used to estimate statistical properties of a probability distribution of the query-result.
摘要翻译：根据本发明的一个实施例，提供了一种用于管理不确定数据的方法。该方法包括使用至少一个变量生成（VG）函数来指定数据不确定性。 VG函数生成不确定数据值的伪随机样本。指定基于VG功能的随机数据库，并生成随机数据库的多个蒙特卡罗实例。使用蒙特卡罗方法，通过多个蒙特卡洛实例重复执行查询，以输出蒙特卡罗方法结果和关联的查询结果。然后可以使用蒙特卡罗方法结果来估计查询结果的概率分布的统计特性。

4. 发明授权

US09063987B2 Managing uncertain data using Monte Carlo techniques 有权
标题翻译：使用蒙特卡罗技术管理不确定的数据
公开(公告)号：US09063987B2
公开(公告)日：2015-06-23
申请号：US13495610
申请日：2012-06-13
申请人： Peter J Haas , Ravindranath Jampani , Christopher M Jermaine , Luis L Perez , Mingxi Wu , Fei Xu
发明人： Peter J Haas , Ravindranath Jampani , Christopher M Jermaine , Luis L Perez , Mingxi Wu , Fei Xu
IPC分类号： G06F17/30
CPC分类号： G06F17/30536
摘要： According to one embodiment of the present invention, a method for managing uncertain data is provided. The method includes specifying data uncertainty using at least one variable generation (VG) function. The VG function generates pseudorandom samples of uncertain data values. A random database based on the VG function is specified and multiple Monte Carlo instantiations of the random database are generated. Using a Monte Carlo method, a query is repeatedly executed over the multiple Monte Carlo instantiations to output a Monte Carlo method result and associated query-results. The Monte Carlo method result may then be used to estimate statistical properties of a probability distribution of the query-result.
摘要翻译：根据本发明的一个实施例，提供了一种用于管理不确定数据的方法。该方法包括使用至少一个变量生成（VG）函数来指定数据不确定性。 VG函数生成不确定数据值的伪随机样本。指定基于VG功能的随机数据库，并生成随机数据库的多个蒙特卡罗实例。使用蒙特卡罗方法，通过多个蒙特卡洛实例重复执行查询，以输出蒙特卡罗方法结果和关联的查询结果。然后可以使用蒙特卡罗方法结果来估计查询结果的概率分布的统计特性。

5. 发明申请

US20130275363A1 META-DATA DRIVEN DATA INGESTION USING MAPREDUCE FRAMEWORK 有权
标题翻译：使用MAPREDUCF框架的元数据驱动数据采集
公开(公告)号：US20130275363A1
公开(公告)日：2013-10-17
申请号：US13466981
申请日：2012-05-08
申请人： Mingxi Wu , Songting Chen
发明人： Mingxi Wu , Songting Chen
IPC分类号： G06F17/30
CPC分类号： G06F9/46
摘要： A generic approach for automatically ingesting data into an HDFS (Hadoop File System) based data warehouse includes a datahub server, a generic pipelined data loading framework, and a meta-data model that, together, address data loading efficiency, data source heterogeneities, and data warehouse schema evolvement. The loading efficiency is achieved via the MapReduce scale-out solution. The meta-data model is comprised of configuration files and a catalog. The configuration file is setup per ingestion task. The catalog manages the data warehouse schema. When a scheduled data loading task is executed, the configuration files and the catalog collaboratively drive the datahub server to load the heterogeneous data to their destination schemas automatically.
摘要翻译：将数据自动摄取到基于HDFS（Hadoop文件系统）的数据仓库中的通用方法包括数据存储服务器，通用流水线数据加载框架和元数据模型，它们一起处理数据加载效率，数据源异构性和数据仓库架构发展。负载效率通过MapReduce横向扩展解决方案实现。元数据模型由配置文件和目录组成。配置文件是每次摄取任务设置的。目录管理数据仓库模式。执行计划的数据加载任务时，配置文件和目录协同驱动数据存储服务器，将异构数据自动加载到目标模式。

6. 发明授权

US08949175B2 Meta-data driven data ingestion using MapReduce framework 有权
标题翻译：使用MapReduce框架进行元数据驱动的数据采集
公开(公告)号：US08949175B2
公开(公告)日：2015-02-03
申请号：US13466981
申请日：2012-05-08
申请人： Mingxi Wu , Songting Chen
发明人： Mingxi Wu , Songting Chen
IPC分类号： G06F17/30
CPC分类号： G06F9/46
摘要： A generic approach for automatically ingesting data into an HDFS (Hadoop File System) based data warehouse includes a datahub server, a generic pipelined data loading framework, and a meta-data model that, together, address data loading efficiency, data source heterogeneities, and data warehouse schema evolvement. The loading efficiency is achieved via the MapReduce scale-out solution. The meta-data model is comprised of configuration files and a catalog. The configuration file is setup per ingestion task. The catalog manages the data warehouse schema. When a scheduled data loading task is executed, the configuration files and the catalog collaboratively drive the datahub server to load the heterogeneous data to their destination schemas automatically.
摘要翻译：将数据自动摄取到基于HDFS（Hadoop文件系统）的数据仓库中的通用方法包括数据存储服务器，通用流水线数据加载框架和元数据模型，它们一起处理数据加载效率，数据源异构性和数据仓库架构发展。负载效率通过MapReduce横向扩展解决方案实现。元数据模型由配置文件和目录组成。配置文件是每次摄取任务设置的。目录管理数据仓库模式。执行计划的数据加载任务时，配置文件和目录协同驱动数据存储服务器，将异构数据自动加载到目标模式。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式