专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20120291040A1 AUTOMATIC LOAD BALANCING FOR HETEROGENEOUS CORES 有权
标题翻译：自动负载平衡异常角
公开(公告)号：US20120291040A1
公开(公告)日：2012-11-15
申请号：US13105250
申请日：2011-05-11
申请人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery , Anton Chernoff
发明人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery , Anton Chernoff
IPC分类号： G06F9/46
CPC分类号： G06F9/5083
摘要： A system and method for efficient automatic scheduling of the execution of work units between multiple heterogeneous processor cores. A processing node includes a first processor core with a general-purpose micro-architecture and a second processor core with a single instruction multiple data micro-architecture. A computer program comprises one or more compute kernels, or function calls. A compiler computes pre-runtime information of the given function call. A runtime scheduler produces one or more work units by matching each of the one or more kernels with an associated record of data. The scheduler assigns work units either to the first or to the second processor core based at least in part on the computed pre-runtime information. In addition, the scheduler is able to change an original assignment for a waiting work unit based on dynamic runtime behavior of other work units corresponding to a same kernel as the waiting work unit.
摘要翻译：一种用于在多个异构处理器内核之间高效自动调度工作单元执行的系统和方法。处理节点包括具有通用微架构的第一处理器核心和具有单个指令多数据微架构的第二处理器核心。计算机程序包括一个或多个计算内核或函数调用。编译器计算给定函数调用的运行前信息。运行时调度器通过将一个或多个内核中的每一个与相关联的数据记录进行匹配来生成一个或多个工作单元。至少部分地基于所计算的运行前信息，调度器将工作单元分配给第一或第二处理器核。此外，调度器能够基于与等待工作单元相同的内核的其他工作单元的动态运行时行为来改变等待工作单元的原始分配。

2. 发明授权

US08683468B2 Automatic kernel migration for heterogeneous cores 有权
标题翻译：异构核心的自动内核迁移
公开(公告)号：US08683468B2
公开(公告)日：2014-03-25
申请号：US13108438
申请日：2011-05-16
申请人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery , Anton Chernoff , Dz-Ching Ju
发明人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery , Anton Chernoff , Dz-Ching Ju
IPC分类号： G06F9/46
CPC分类号： G06F9/4856 , G06F9/5066
摘要： A system and method for automatically migrating the execution of work units between multiple heterogeneous cores. A computing system includes a first processor core with a single instruction multiple data micro-architecture and a second processor core with a general-purpose micro-architecture. A compiler predicts execution of a function call in a program migrates at a given location to a different processor core. The compiler creates a data structure to support moving live values associated with the execution of the function call at the given location. An operating system (OS) scheduler schedules at least code before the given location in program order to the first processor core. In response to receiving an indication that a condition for migration is satisfied, the OS scheduler moves the live values to a location indicated by the data structure for access by the second processor core and schedules code after the given location to the second processor core.
摘要翻译：一种用于在多个异构核心之间自动迁移工作单元执行的系统和方法。计算系统包括具有单指令多数据微架构的第一处理器核心和具有通用微架构的第二处理器核心。编译器预测程序中的函数调用的执行在给定位置迁移到不同的处理器核心。编译器创建一个数据结构，以支持在给定位置移动与执行函数调用相关联的实时值。操作系统（OS）调度器将程序顺序之前的给定位置之前的至少代码调度到第一处理器核心。响应于接收到满足迁移条件的指示，OS调度器将活动值移动到由数据结构指示的位置，以供第二处理器核心访问，并且将给定位置之后的代码调度到第二处理器核心。

3. 发明申请

US20120331278A1 BRANCH REMOVAL BY DATA SHUFFLING 审中-公开
标题翻译：分支由数据取出拆卸
公开(公告)号：US20120331278A1
公开(公告)日：2012-12-27
申请号：US13167517
申请日：2011-06-23
申请人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery
发明人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery
IPC分类号： G06F9/38
CPC分类号： G06F9/5027 , G06F8/451 , G06F9/5044
摘要： A system and method for automatically optimizing parallel execution of multiple work units in a processor by reducing a number of branch instructions. A computing system includes a first processor core with a general-purpose micro-architecture and a second processor core with a same instruction multiple data (SIMD) micro-architecture. A compiler detects and evaluates branches within function calls with one or more records of data used to determine one or more outcomes. Multiple compute sub-kernels are generated, each comprising code from the function corresponding to a unique outcome of the branch. Multiple work units are produced by assigning one or more records of data corresponding to a given outcome of the branch to one of the multiple compute sub-kernels associated with the given outcome. The branch is removed. An operating system scheduler schedules each of the one or more compute sub-kernels to the first processor core or to the second processor core.
摘要翻译：一种用于通过减少多个分支指令来自动优化处理器中的多个工作单元的并行执行的系统和方法。计算系统包括具有通用微架构的第一处理器核和具有相同指令多数据（SIMD）微架构的第二处理器核。编译器使用用于确定一个或多个结果的一个或多个数据记录来检测和评估函数调用中的分支。生成多个计算子内核，每个子内核包含来自与分支的唯一结果相对应的函数的代码。通过将与分支的给定结果相对应的数据的一个或多个记录分配给与给定结果相关联的多个计算子核之一来生成多个工作单元。分支被删除。操作系统调度器将一个或多个计算子内核中的每一个调度到第一处理器核或第二处理器核。

4. 发明授权

US08782645B2 Automatic load balancing for heterogeneous cores 有权
标题翻译：异构核心的自动负载平衡
公开(公告)号：US08782645B2
公开(公告)日：2014-07-15
申请号：US13105250
申请日：2011-05-11
申请人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery , Anton Chernoff
发明人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery , Anton Chernoff
IPC分类号： G06F9/46 , G06F9/50
CPC分类号： G06F9/5083
摘要： A system and method for efficient automatic scheduling of the execution of work units between multiple heterogeneous processor cores. A processing node includes a first processor core with a general-purpose micro-architecture and a second processor core with a single instruction multiple data micro-architecture. A computer program comprises one or more compute kernels, or function calls. A compiler computes pre-runtime information of the given function call. A runtime scheduler produces one or more work units by matching each of the one or more kernels with an associated record of data. The scheduler assigns work units either to the first or to the second processor core based at least in part on the computed pre-runtime information. In addition, the scheduler is able to change an original assignment for a waiting work unit based on dynamic runtime behavior of other work units corresponding to a same kernel as the waiting work unit.
摘要翻译：一种用于在多个异构处理器内核之间高效自动调度工作单元执行的系统和方法。处理节点包括具有通用微架构的第一处理器核心和具有单个指令多数据微架构的第二处理器核心。计算机程序包括一个或多个计算内核或函数调用。编译器计算给定函数调用的运行前信息。运行时调度器通过将一个或多个内核中的每一个与相关联的数据记录进行匹配来生成一个或多个工作单元。至少部分地基于所计算的运行前信息，调度器将工作单元分配给第一或第二处理器核。此外，调度器能够基于与等待工作单元相同的内核的其他工作单元的动态运行时行为来改变等待工作单元的原始分配。

5. 发明申请

US20120297163A1 AUTOMATIC KERNEL MIGRATION FOR HETEROGENEOUS CORES 有权
标题翻译：自动KERNEL移动异构牙
公开(公告)号：US20120297163A1
公开(公告)日：2012-11-22
申请号：US13108438
申请日：2011-05-16
申请人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery , Anton Chernoff , Dz-Ching Ju
发明人： Mauricio Breternitz , Patryk Kaminski , Keith Lowery , Anton Chernoff , Dz-Ching Ju
IPC分类号： G06F9/315 , G06F15/80
CPC分类号： G06F9/4856 , G06F9/5066
摘要： A system and method for automatically migrating the execution of work units between multiple heterogeneous cores. A computing system includes a first processor core with a single instruction multiple data micro-architecture and a second processor core with a general-purpose micro-architecture. A compiler predicts execution of a function call in a program migrates at a given location to a different processor core. The compiler creates a data structure to support moving live values associated with the execution of the function call at the given location. An operating system (OS) scheduler schedules at least code before the given location in program order to the first processor core. In response to receiving an indication that a condition for migration is satisfied, the OS scheduler moves the live values to a location indicated by the data structure for access by the second processor core and schedules code after the given location to the second processor core.
摘要翻译：一种用于在多个异构核心之间自动迁移工作单元执行的系统和方法。计算系统包括具有单指令多数据微架构的第一处理器核心和具有通用微架构的第二处理器核心。编译器预测程序中的函数调用的执行在给定位置迁移到不同的处理器核心。编译器创建一个数据结构，以支持在给定位置移动与执行函数调用相关联的实时值。操作系统（OS）调度器将程序顺序之前的给定位置之前的至少代码调度到第一处理器核心。响应于接收到满足迁移条件的指示，OS调度器将活动值移动到由数据结构指示的位置，以供第二处理器核心访问，并且将给定位置之后的代码调度到第二处理器核心。

6. 发明申请

US20140372782A1 COMBINED DYNAMIC AND STATIC POWER AND PERFORMANCE OPTIMIZATION ON DATA CENTERS 有权
标题翻译：数据中心的组合动态和静态功能和性能优化
公开(公告)号：US20140372782A1
公开(公告)日：2014-12-18
申请号：US13917417
申请日：2013-06-13
申请人： Mauricio Breternitz , Leonardo Piga , Patryk Kaminski
发明人： Mauricio Breternitz , Leonardo Piga , Patryk Kaminski
IPC分类号： G06F1/28
CPC分类号： G06F1/3206 , G06F1/324 , G06F1/3296 , Y02D10/126
摘要： Various datacenter or other computing center control apparatus and methods are disclosed. In one aspect, a method of computing is provided that includes defining plural processor performance bins where each processor performance bin has a processor performance state. At least one processor is assigned to each of the plural processor performance bins. Processor performance metrics of at least one of the processors are monitored while the at least one of the processors executes an incoming task. Processor power is modeled based on the monitored performance metrics. Future incoming tasks are assigned to one of the processor performance bins based on the modeled processor power.
摘要翻译：公开了各种数据中心或其他计算中心控制装置和方法。在一个方面，提供了一种计算方法，其包括定义多个处理器性能箱，其中每个处理器性能仓具有处理器性能状态。至少一个处理器分配给多个处理器性能仓中的每一个。在处理器中的至少一个处理器执行传入任务时，监视至少一个处理器的处理器性能度量。处理器电源是基于监视的性能指标进行建模的。基于建模的处理器功率，将来的进入任务分配给一个处理器性能箱。

7. 发明授权

US09274585B2 Combined dynamic and static power and performance optimization on data centers 有权
标题翻译：数据中心的动态和静态功耗与性能优化相结合
公开(公告)号：US09274585B2
公开(公告)日：2016-03-01
申请号：US13917417
申请日：2013-06-13
申请人： Mauricio Breternitz , Leonardo Piga , Patryk Kaminski
发明人： Mauricio Breternitz , Leonardo Piga , Patryk Kaminski
IPC分类号： G06F1/32
CPC分类号： G06F1/3206 , G06F1/324 , G06F1/3296 , Y02D10/126
摘要： Various datacenter or other computing center control apparatus and methods are disclosed. In one aspect, a method of computing is provided that includes defining plural processor performance bins where each processor performance bin has a processor performance state. At least one processor is assigned to each of the plural processor performance bins. Processor performance metrics of at least one of the processors are monitored while the at least one of the processors executes an incoming task. Processor power is modeled based on the monitored performance metrics. Future incoming tasks are assigned to one of the processor performance bins based on the modeled processor power.
摘要翻译：公开了各种数据中心或其他计算中心控制装置和方法。在一个方面，提供了一种计算方法，其包括定义多个处理器性能箱，其中每个处理器性能仓具有处理器性能状态。至少一个处理器分配给多个处理器性能仓中的每一个。在处理器中的至少一个处理器执行传入任务时，监视至少一个处理器的处理器性能度量。处理器电源是基于监视的性能指标进行建模的。基于建模的处理器功率，将来的进入任务分配给一个处理器性能箱。

8. 发明申请

US20140047341A1 SYSTEM AND METHOD FOR CONFIGURING CLOUD COMPUTING SYSTEMS 有权
标题翻译：用于配置云计算系统的系统和方法
公开(公告)号：US20140047341A1
公开(公告)日：2014-02-13
申请号：US13568368
申请日：2012-08-07
申请人： Mauricio Breternitz , Keith A. Lowery , Patryk Kaminski , Anton Chernoff
发明人： Mauricio Breternitz , Keith A. Lowery , Patryk Kaminski , Anton Chernoff
IPC分类号： G06F15/177 , G06F3/01
CPC分类号： G06F17/30002 , G06F9/505 , G06F9/5072
摘要： The present disclosure relates to a method, system, and apparatus for configuring a computing system, such as a cloud computing system. A method includes, based on user selections received via a user interface, configuring a cluster of nodes by selecting the cluster of nodes from a plurality of available nodes, selecting a workload container module from a plurality of available workload container modules for operation on each node of the selected cluster of nodes, and selecting a workload for execution with the workload container on the cluster of nodes. Each node of the cluster of nodes includes at least one processing device and memory, and the cluster of nodes is operative to share processing of a workload.
摘要翻译：本公开涉及用于配置诸如云计算系统之类的计算系统的方法，系统和装置。一种方法包括：基于经由用户接口接收的用户选择，通过从多个可用节点中选择节点簇来配置节点簇，从多个可用工作负载容器模块中选择工作负载容器模块，以在每个节点上进行操作的选定的节点集群，并选择一个工作负载以便与节点集群上的工作负载容器一起执行。节点集群的每个节点包括至少一个处理设备和存储器，并且节点集群可操作地共享工作负载的处理。

9. 发明授权

US09152532B2 System and method for configuring a cloud computing system with a synthetic test workload 有权
标题翻译：用于配置具有合成测试工作负载的云计算系统的系统和方法
公开(公告)号：US09152532B2
公开(公告)日：2015-10-06
申请号：US13568459
申请日：2012-08-07
申请人： Mauricio Breternitz , Keith A. Lowery , Patryk Kaminski , Anton Chernoff
发明人： Mauricio Breternitz , Keith A. Lowery , Patryk Kaminski , Anton Chernoff
IPC分类号： G06F11/00 , G06F11/34 , G06F11/26 , G06F9/50
CPC分类号： G06F11/3495 , G06F9/5072 , G06F11/26 , G06F11/3414 , G06F11/3433
摘要： The present disclosure relates to a method and system for configuring a computing system, such as a cloud computing system. A method includes selecting, based on a user selection received via a user interface, a workload for execution on a cluster of nodes of the computing system. The workload is selected from a plurality of available workloads including an actual workload and a synthetic test workload. The method further includes configuring the cluster of nodes of the computing system to execute the selected workload such that processing of the selected workload is distributed across the cluster of nodes. The synthetic test workload may be generated by a code synthesizer based on a set of user-defined workload parameters provided via a user interface that identify execution characteristics of the synthetic test workload.
摘要翻译：本公开涉及用于配置诸如云计算系统之类的计算系统的方法和系统。一种方法包括：基于经由用户接口接收到的用户选择来选择用于在所述计算系统的节点簇上执行的工作负载。从多个可用工作负载中选择工作负载，包括实际工作负载和合成测试工作负载。该方法还包括配置计算系统的节点集群以执行所选择的工作负载，使得所选择的工作负载的处理分布在节点簇上。综合测试工作量可以由代码合成器基于经由识别合成测试工作负载的执行特性的用户界面提供的一组用户定义的工作负载参数来生成。

10. 发明授权

US08887056B2 System and method for configuring cloud computing systems 有权
标题翻译：用于配置云计算系统的系统和方法
公开(公告)号：US08887056B2
公开(公告)日：2014-11-11
申请号：US13568368
申请日：2012-08-07
申请人： Mauricio Breternitz , Keith A. Lowery , Patryk Kaminski , Anton Chernoff
发明人： Mauricio Breternitz , Keith A. Lowery , Patryk Kaminski , Anton Chernoff
IPC分类号： G06F15/177 , G06F3/00 , G06F17/30 , G06F9/50
CPC分类号： G06F17/30002 , G06F9/505 , G06F9/5072
摘要： The present disclosure relates to a method, system, and apparatus for configuring a computing system, such as a cloud computing system. A method includes, based on user selections received via a user interface, configuring a cluster of nodes by selecting the cluster of nodes from a plurality of available nodes, selecting a workload container module from a plurality of available workload container modules for operation on each node of the selected cluster of nodes, and selecting a workload for execution with the workload container on the cluster of nodes. Each node of the cluster of nodes includes at least one processing device and memory, and the cluster of nodes is operative to share processing of a workload.
摘要翻译：本公开涉及用于配置诸如云计算系统之类的计算系统的方法，系统和装置。一种方法包括：基于经由用户接口接收的用户选择，通过从多个可用节点中选择节点簇来配置节点簇，从多个可用工作负载容器模块中选择工作负载容器模块，以在每个节点上进行操作的选定的节点集群，并选择一个工作负载以便与节点集群上的工作负载容器一起执行。节点集群的每个节点包括至少一个处理设备和存储器，并且节点集群可操作地共享工作负载的处理。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式