专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US08261249B2 Distributed schemes for deploying an application in a large parallel system 有权
标题翻译：在大型并行系统中部署应用程序的分布式方案
公开(公告)号：US08261249B2
公开(公告)日：2012-09-04
申请号：US11971006
申请日：2008-01-08
申请人： Charles Jens Archer , Thomas Michael Gooding , Ruth Janine Poole , Albert Sidelnik
发明人： Charles Jens Archer , Thomas Michael Gooding , Ruth Janine Poole , Albert Sidelnik
IPC分类号： G06F9/45 , G06F9/46 , G06F15/16 , G06F15/173
CPC分类号： G06F9/5072
摘要： Embodiments of the invention provide a method for deploying and running an application on a massively parallel computer system, while minimizing the costs associated with latency, bandwidth, and limited memory resources. The executable code of a program may be divided into multiple code fragments and distributed to different compute nodes of a parallel computing system. During program execution, one compute node may fetch code fragments from other compute nodes as necessary.
摘要翻译：本发明的实施例提供了一种用于在大规模并行计算机系统上部署和运行应用程序的方法，同时最小化与等待时间，带宽和有限的存储器资源相关联的成本。程序的可执行代码可以被划分为多个代码片段并被分配到并行计算系统的不同计算节点。在程序执行期间，一个计算节点可以根据需要从其他计算节点获取代码段。

2. 发明授权

US08112658B2 Row fault detection system 失效
标题翻译：行故障检测系统
公开(公告)号：US08112658B2
公开(公告)日：2012-02-07
申请号：US12197563
申请日：2008-08-25
申请人： Charles Jens Archer , Kurt Walter Pinnow , Joseph D. Ratterman , Brian Edward Smith
发明人： Charles Jens Archer , Kurt Walter Pinnow , Joseph D. Ratterman , Brian Edward Smith
IPC分类号： G06F11/00
CPC分类号： G06F15/16
摘要： An apparatus, program product and method check for nodal faults in a row of nodes by causing each node in the row to concurrently communicate with its adjacent neighbor nodes in the row. The communications are analyzed to determine a presence of a faulty node or connection.
摘要翻译：通过使该行中的每个节点与该行中的相邻邻居节点同时进行通信，从而对一行节点中的节点故障进行设备，程序产品和方法的检查。分析通信以确定故障节点或连接的存在。

3. 发明授权

US08090704B2 Database retrieval with a non-unique key on a parallel computer system 失效
标题翻译：在并行计算机系统上使用非唯一的密钥进行数据库检索
公开(公告)号：US08090704B2
公开(公告)日：2012-01-03
申请号：US11830463
申请日：2007-07-30
申请人： Charles Jens Archer , Amanda Peters , Gary Ross Ricard , Albert Sidelnik , Brian Edward Smith
发明人： Charles Jens Archer , Amanda Peters , Gary Ross Ricard , Albert Sidelnik , Brian Edward Smith
IPC分类号： G06F17/30
CPC分类号： G06F17/30445
摘要： An apparatus and method retrieves a database record from an in-memory database of a parallel computer system using a non-unique key. The parallel computer system performs a simultaneous search on each node of the computer system using the non-unique key and then utilizes a global combining network to combine the local results from the searches of each node to efficiently and quickly search the entire database.
摘要翻译：装置和方法使用非唯一的密钥从并行计算机系统的存储器内数据库检索数据库记录。并行计算机系统使用非唯一密钥对计算机系统的每个节点进行同时搜索，然后利用全局组合网络将来自每个节点的搜索的本地结果组合以有效且快速地搜索整个数据库。

4. 发明授权

US08031614B2 Method and apparatus for routing data in an inter-nodal communications lattice of a massively parallel computer system by dynamic global mapping of contended links 失效
标题翻译：用于通过竞争链接的动态全局映射在大型并行计算机系统的节点间通信网格中路由数据的方法和装置
公开(公告)号：US08031614B2
公开(公告)日：2011-10-04
申请号：US11539248
申请日：2006-10-06
申请人： Charles Jens Archer , Roy Glenn Musselman , Amanda Peters , Kurt Walter Pinnow , Brent Allen Swartz , Brian Paul Wallenfelt
发明人： Charles Jens Archer , Roy Glenn Musselman , Amanda Peters , Kurt Walter Pinnow , Brent Allen Swartz , Brian Paul Wallenfelt
IPC分类号： G01R31/08
CPC分类号： H04L45/125 , H04L45/02 , H04L45/48
摘要： A massively parallel nodal computer system periodically collects and broadcasts usage data for an internal communications network. A node sending data over the network makes a global routing determination using the network usage data. Preferably, network usage data comprises an N-bit usage value for each output buffer associated with a network link. An optimum routing is determined by summing the N-bit values associated with each link through which a data packet must pass, and comparing the sums associated with different possible routes.
摘要翻译：大规模并行节点计算机系统周期性收集和广播内部通信网络的使用数据。通过网络发送数据的节点使用网络使用数据进行全局路由确定。优选地，网络使用数据包括与网络链路相关联的每个输出缓冲器的N位使用值。通过对与数据分组必须通过的每个链路相关联的N比特值进行求和并且比较与不同可能路由相关联的和来确定最佳路由。

5. 发明申请

US20110191633A1 PARALLEL DEBUGGING IN A MASSIVELY PARALLEL COMPUTING SYSTEM 有权
标题翻译：并行调试在大规模并行计算系统中
公开(公告)号：US20110191633A1
公开(公告)日：2011-08-04
申请号：US12697721
申请日：2010-02-01
申请人： Charles Jens Archer , Todd Alan Inglett
发明人： Charles Jens Archer , Todd Alan Inglett
IPC分类号： G06F11/263
CPC分类号： G06F11/263
摘要： A method and apparatus is described for parallel debugging on the data nodes of a parallel computer system. A data template associated with the debugger can be used as a reference to the common data on the nodes. The application or data contained on the compute nodes diverges from the data template at the service node during the course of program execution, so that pieces of the data are different at each of the nodes at some time of interest. For debugging, the compute nodes search their own memory image for checksum matches with the template and produces new data blocks with checksums that didn't exist in the data template, and a template of references to the original data blocks in the template. Examples herein include an application of the rsync protocol, compression and network broadcast to improve debugging in a massively parallel computer environment.
摘要翻译：描述了用于并行计算机系统的数据节点上的并行调试的方法和装置。与调试器相关联的数据模板可以用作对节点上的公共数据的引用。包含在计算节点上的应用程序或数据在程序执行过程中从服务节点处的数据模板发散，使得在某些感兴趣的时间点，每个节点上的数据片段不同。为了进行调试，计算节点搜索自己的内存映像以与模板进行校验和匹配，并生成新的数据块，其中包含数据模板中不存在校验和的新数据块，以及模板中原始数据块的引用模板。本文的示例包括rsync协议，压缩和网络广播的应用，以改进大规模并行计算机环境中的调试。

6. 发明申请

US20100318835A1 BISECTIONAL FAULT DETECTION SYSTEM 失效
标题翻译：双向故障检测系统
公开(公告)号：US20100318835A1
公开(公告)日：2010-12-16
申请号：US12196931
申请日：2008-08-22
申请人： Charles Jens Archer , Kurt Walter Pinnow , Joseph D. Ratterman , Brian Edward Smith
发明人： Charles Jens Archer , Kurt Walter Pinnow , Joseph D. Ratterman , Brian Edward Smith
IPC分类号： G06F11/07 , G06F11/20
CPC分类号： G06F11/2236
摘要： An apparatus, program product and method logically divide a group of nodes and causes node pairs comprising a node from each section to communicate. Results from the communications may be analyzed to determine performance characteristics, such as bandwidth and proper connectivity.
摘要翻译：装置，程序产品和方法在逻辑上划分一组节点并且使得包括来自每个部分的节点的节点对进行通信。可以分析来自通信的结果以确定性能特征，例如带宽和适当的连接性。

7. 发明授权

US07835284B2 Method and apparatus for routing data in an inter-nodal communications lattice of a massively parallel computer system by routing through transporter nodes 失效
标题翻译：用于通过传送节点路由在大型并行计算机系统的节点间通信网格中路由数据的方法和装置
公开(公告)号：US07835284B2
公开(公告)日：2010-11-16
申请号：US11539300
申请日：2006-10-06
申请人： Charles Jens Archer , Roy Glenn Musselman , Amanda Peters , Kurt Walter Pinnow , Brent Allen Swartz , Brian Paul Wallenfelt
发明人： Charles Jens Archer , Roy Glenn Musselman , Amanda Peters , Kurt Walter Pinnow , Brent Allen Swartz , Brian Paul Wallenfelt
IPC分类号： G01R31/08 , G06F11/00 , H04L12/50 , H04Q11/00
CPC分类号： H04L45/00 , H04L45/42
摘要： A massively parallel computer system contains an inter-nodal communications network of node-to-node links. An automated routing strategy routes packets through one or more intermediate nodes of the network to reach a destination. Some packets are constrained to be routed through respective designated transporter nodes, the automated routing strategy determining a path from a respective source node to a respective transporter node, and from a respective transporter node to a respective destination node. Preferably, the source node chooses a routing policy from among multiple possible choices, and that policy is followed by all intermediate nodes. The use of transporter nodes allows greater flexibility in routing.
摘要翻译：大型并行计算机系统包含节点到节点链路的节点间通信网络。自动路由策略通过网络的一个或多个中间节点路由分组以到达目的地。一些分组被限制为通过相应的指定的传送器节点路由，自动路由策略确定从相应的源节点到相应的传输节点以及从相应的传输节点到相应目的地节点的路径。优选地，源节点从多个可能的选择中选择路由策略，并且该策略遵循所有中间节点。运输节点的使用允许更大的路由灵活性。

8. 发明授权

US07627783B2 Template based parallel checkpointing in a massively parallel computer system 失效
标题翻译：在大规模并行计算机系统中基于模板的并行检查点
公开(公告)号：US07627783B2
公开(公告)日：2009-12-01
申请号：US12104224
申请日：2008-04-16
申请人： Charles Jens Archer , Todd Alan Inglett
发明人： Charles Jens Archer , Todd Alan Inglett
IPC分类号： G06F11/00
CPC分类号： G06F11/1438 , G06F11/1451
摘要： A method and apparatus for a template based parallel checkpoint save for a massively parallel super computer system using a parallel variation of the rsync protocol, and network broadcast. In preferred embodiments, the checkpoint data for each node is compared to a template checkpoint file that resides in the storage and that was previously produced. Embodiments herein greatly decrease the amount of data that must be transmitted and stored for faster checkpointing and increased efficiency of the computer system. Embodiments are directed to a parallel computer system with nodes arranged in a cluster with a high speed interconnect that can perform broadcast communication. The checkpoint contains a set of actual small data blocks with their corresponding checksums from all nodes in the system. The data blocks may be compressed using conventional non-lossy data compression algorithms to further reduce the overall checkpoint size.
摘要翻译：一种用于基于模板的并行检查点的方法和装置，用于使用rsync协议的并行变体和网络广播的大规模并行超级计算机系统。在优选实施例中，将每个节点的检查点数据与驻留在存储器中并且之前产生的模板检查点文件进行比较。本文的实施方式大大减少了必须发送和存储的数据量，以便更快地检查点和提高计算机系统的效率。实施例涉及具有布置在具有可执行广播通信的高速互连的集群中的节点的并行计算机系统。检查点包含一系列具有系统中所有节点的相应校验和的实际小数据块。可以使用常规的非有损数据压缩算法来压缩数据块，以进一步减少总体检查点大小。

9. 发明授权

US07506197B2 Multi-directional fault detection system 失效
标题翻译：多方向故障检测系统
公开(公告)号：US07506197B2
公开(公告)日：2009-03-17
申请号：US11052661
申请日：2005-02-07
申请人： Charles Jens Archer , Kurt Walter Pinnow , Joseph D. Ratterman , Brian Edward Smith
发明人： Charles Jens Archer , Kurt Walter Pinnow , Joseph D. Ratterman , Brian Edward Smith
IPC分类号： G06F11/00
CPC分类号： G06F11/2242
摘要： An apparatus, program product and method checks for nodal faults in a group of nodes comprising a center node and all adjacent nodes. The center node concurrently communicates with the immediately adjacent nodes in three dimensions. The communications are analyzed to determine a presence of a faulty node or connection.
摘要翻译：装置，程序产品和方法检查包括中心节点和所有相邻节点的一组节点中的节点故障。中心节点同时与三个相邻的节点进行通信。分析通信以确定故障节点或连接的存在。

10. 发明申请

US20090067334A1 MECHANISM FOR PROCESS MIGRATION ON A MASSIVELY PARALLEL COMPUTER 失效
标题翻译：一个大规模并行计算机进程迁移的机制
公开(公告)号：US20090067334A1
公开(公告)日：2009-03-12
申请号：US11853927
申请日：2007-09-12
申请人： Charles Jens Archer , David L. Darrington , Patrick Joseph McCarthy , Amanda Peters , Albert Sidelnik
发明人： Charles Jens Archer , David L. Darrington , Patrick Joseph McCarthy , Amanda Peters , Albert Sidelnik
IPC分类号： G08C15/00
CPC分类号： G06F9/4856 , G06F9/461 , G06F9/546
摘要： Embodiments off the invention provide a mechanism for process migration on a massively parallel computer system. In particular, embodiments of the invention may be used to update process state data for a migrated compute node, such as MPI (or other communication library) state data, across a full collection of compute nodes present in a given parallel system executing a parallel task. Migrating a process form one compute node to another may be useful to address a variety of sub-optimal operating conditions. For example, one or more processes may be migrated to cure network congestion resulting from a poorly mapped task or when a compute node is predicted to experience a hardware failure.
摘要翻译：本发明的实施例提供了一种用于大规模并行计算机系统上的过程迁移的机制。特别地，可以使用本发明的实施例来跨越在执行并行任务的给定并行系统中存在的计算节点的整个集合来更新用于迁移的计算节点（例如MPI（或其他通信库））状态数据的进程状态数据。将一个计算节点迁移到另一个计算节点可能有助于解决各种次优的运行条件。例如，可以迁移一个或多个进程以修复由映射不良的任务引起的网络拥塞，或者当预测计算节点经历硬件故障时。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式