会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Message passing with queues and channels
    • 消息传递与队列和通道
    • US08543722B2
    • 2013-09-24
    • US12750363
    • 2010-03-30
    • Gabor J. DozsaPhilip HeidelbergerSameer KumarJoseph D. RattermanBurkhard Steinmacher-BurowRobert W. Wisniewski
    • Gabor J. DozsaPhilip HeidelbergerSameer KumarJoseph D. RattermanBurkhard Steinmacher-BurowRobert W. Wisniewski
    • G06F15/16
    • G06F9/546
    • In an embodiment, a send thread receives an identifier that identifies a destination node and a pointer to data. The send thread creates a first send request in response to the receipt of the identifier and the data pointer. The send thread selects a selected channel from among a plurality of channels. The selected channel comprises a selected hand-off queue and an identification of a selected message unit. Each of the channels identifies a different message unit. The selected hand-off queue is randomly accessible. If the selected hand-off queue contains an available entry, the send thread adds the first send request to the selected hand-off queue. If the selected hand-off queue does not contain an available entry, the send thread removes a second send request from the selected hand-off queue and sends the second send request to the selected message unit.
    • 在一个实施例中,发送线程接收标识目的地节点的标识符和指向数据的指针。 响应于接收到标识符和数据指针,发送线程创建第一个发送请求。 发送线程从多个信道中选择所选择的信道。 所选择的信道包括所选择的切换队列和所选择的消息单元的标识。 每个通道标识不同的消息单元。 所选择的切换队列是随机访问的。 如果所选择的切换队列包含可用条目,则发送线程将第一个发送请求添加到所选择的切换队列。 如果所选择的切换队列不包含可用条目,则发送线程从所选择的切换队列中移除第二发送请求,并将第二发送请求发送到所选择的消息单元。
    • 4. 发明申请
    • MESSAGE PASSING WITH A LIMITED NUMBER OF DMA BYTE COUNTERS
    • 消息传递与有限数量的DMA字节计数器
    • US20090007141A1
    • 2009-01-01
    • US11768813
    • 2007-06-26
    • Michael BlocksomeDong ChenMark E. GiampapaPhilip HeidelbergerSameer KumarJeffrey J. Parker
    • Michael BlocksomeDong ChenMark E. GiampapaPhilip HeidelbergerSameer KumarJeffrey J. Parker
    • G06F9/44
    • G06F15/17356G06F9/546
    • A method for passing messages in a parallel computer system constructed as a plurality of compute nodes interconnected as a network where each compute node includes a DMA engine but includes only a limited number of byte counters for tracking a number of bytes that are sent or received by the DMA engine, where the byte counters may be used in shared counter or exclusive counter modes of operation. The method includes using rendezvous protocol, a source compute node deterministically sending a request to send (RTS) message with a single RTS descriptor using an exclusive injection counter to track both the RTS message and message data to be sent in association with the RTS message, to a destination compute node such that the RTS descriptor indicates to the destination compute node that the message data will be adaptively routed to the destination node. Using one DMA FIFO at the source compute node, the RTS descriptors are maintained for rendezvous messages destined for the destination compute node to ensure proper message data ordering thereat. Using a reception counter at a DMA engine, the destination compute node tracks reception of the RTS and associated message data and sends a clear to send (CTS) message to the source node in a rendezvous protocol form of a remote get to accept the RTS message and message data and processing the remote get (CTS) by the source compute node DMA engine to provide the message data to be sent.
    • 一种在并行计算机系统中传送消息的方法,该并行计算机系统被构造为作为网络互连的多个计算节点,其中每个计算节点包括DMA引擎,但是仅包括有限数量的字节计数器,用于跟踪由 DMA引擎,其中可以在共享计数器或专用计数器操作模式中使用字节计数器。 该方法包括使用会合协议,源计算节点使用专用注入计数器确定性地发送具有单个RTS描述符的请求(RTS)消息以跟踪要与RTS消息相关联地发送的RTS消息和消息数据, 到目的地计算节点,使得RTS描述符向目标计算节点指示消息数据将自适应地路由到目的地节点。 在源计算节点使用一个DMA FIFO,将为发往目的地计算节点的会合消息保留RTS描述符,以确保正确的消息数据顺序。 在DMA引擎上使用接收计数器,目的地计算节点跟踪RTS和相关联的消息数据的接收,并以远程获取的会合协议形式向源节点发送明确发送(CTS)消息以接受RTS消息 和消息数据,并由源计算节点DMA引擎处理远程获取(CTS)以提供要发送的消息数据。
    • 5. 发明申请
    • Message Passing with Queues and Channels
    • 消息传递与队列和通道
    • US20110265098A1
    • 2011-10-27
    • US12764315
    • 2010-04-21
    • Gabor J. DozsaPhilip HeidelbergerSameer KumarJoseph D. RattermanBurkhard Steinmacher-Burow
    • Gabor J. DozsaPhilip HeidelbergerSameer KumarJoseph D. RattermanBurkhard Steinmacher-Burow
    • G06F9/54
    • G06F9/546G06F2209/548
    • In an embodiment, a reception thread receives a source node identifier, a type, and a data pointer from an application and, in response, creates a receive request. If the source node identifier specifies a source node, the reception thread adds the receive request to a fast-post queue. If a message received from a network does not match a receive request on a posted queue, a polling thread adds a receive request that represents the message to an unexpected queue. If the fast-post queue contains the receive request, the polling thread removes the receive request from the fast-post queue. If the receive request that was removed from the fast-post queue does not match the receive request on the unexpected queue, the polling thread adds the receive request that was removed from the fast-post queue to the posted queue. The reception thread and the polling thread execute asynchronously from each other.
    • 在一个实施例中,接收线程从应用接收源节点标识符,类型和数据指针,作为响应,创建接收请求。 如果源节点标识符指定源节点,则接收线程将接收请求添加到快速发送队列。 如果从网络接收到的消息与发布的队列上的接收请求不匹配,轮询线程将将消息的接收请求添加到意外队列。 如果快速发送队列包含接收请求,轮询线程将从快速发送队列中删除接收请求。 如果从快速发布队列中删除的接收请求与意外队列中的接收请求不匹配,轮询线程将从快速发布队列中删除的接收请求添加到发布的队列。 接收线程和轮询线程彼此异步执行。
    • 6. 发明授权
    • Message passing with a limited number of DMA byte counters
    • 消息传递有限数量的DMA字节计数器
    • US08032892B2
    • 2011-10-04
    • US11768813
    • 2007-06-26
    • Michael BlocksomeDong ChenMark E. GiampapaPhilip HeidelbergerSameer KumarJeffrey J. Parker
    • Michael BlocksomeDong ChenMark E. GiampapaPhilip HeidelbergerSameer KumarJeffrey J. Parker
    • G06F9/44G06F9/46G06F13/00G06F15/167
    • G06F15/17356G06F9/546
    • A method for passing messages in a parallel computer system constructed as a plurality of compute nodes interconnected as a network where each compute node includes a DMA engine but includes only a limited number of byte counters for tracking a number of bytes that are sent or received by the DMA engine, where the byte counters may be used in shared counter or exclusive counter modes of operation. The method includes using rendezvous protocol, a source compute node deterministically sending a request to send (RTS) message with a single RTS descriptor using an exclusive injection counter to track both the RTS message and message data to be sent in association with the RTS message, to a destination compute node such that the RTS descriptor indicates to the destination compute node that the message data will be adaptively routed to the destination node. Using one DMA FIFO at the source compute node, the RTS descriptors are maintained for rendezvous messages destined for the destination compute node to ensure proper message data ordering thereat. Using a reception counter at a DMA engine, the destination compute node tracks reception of the RTS and associated message data and sends a clear to send (CTS) message to the source node in a rendezvous protocol form of a remote get to accept the RTS message and message data and processing the remote get (CTS) by the source compute node DMA engine to provide the message data to be sent.
    • 一种在并行计算机系统中传送消息的方法,该并行计算机系统被构造为作为网络互连的多个计算节点,其中每个计算节点包括DMA引擎,但是仅包括有限数量的字节计数器,用于跟踪由 DMA引擎,其中可以在共享计数器或专用计数器操作模式中使用字节计数器。 该方法包括使用会合协议,源计算节点使用专用注入计数器确定性地发送具有单个RTS描述符的请求(RTS)消息以跟踪要与RTS消息相关联地发送的RTS消息和消息数据, 到目的地计算节点,使得RTS描述符向目标计算节点指示消息数据将自适应地路由到目的地节点。 在源计算节点使用一个DMA FIFO,将为发往目的地计算节点的会合消息保留RTS描述符,以确保正确的消息数据顺序。 在DMA引擎上使用接收计数器,目的地计算节点跟踪RTS和相关联的消息数据的接收,并以远程获取的会合协议形式向源节点发送明确发送(CTS)消息以接受RTS消息 和消息数据,并由源计算节点DMA引擎处理远程获取(CTS)以提供要发送的消息数据。
    • 8. 发明申请
    • Recording A Communication Pattern and Replaying Messages in a Parallel Computing System
    • 在并行计算系统中记录通信模式和回放消息
    • US20110010471A1
    • 2011-01-13
    • US12500715
    • 2009-07-10
    • Philip HeidelbergerSameer Kumar
    • Philip HeidelbergerSameer Kumar
    • G06F13/28
    • G06F13/28
    • A parallel computer system includes a plurality of compute nodes. Each of the compute nodes includes at least one processor, at least one memory, and a direct memory address engine coupled to the at least one processor and the at least one memory. The system also includes a network interconnecting the plurality of compute nodes. The network operates a global message-passing application for performing communications across the network. Local instances of the global message-passing application operate at each of the compute nodes to carry out local processing operations independent of processing operations carried out at another one of the compute nodes. The direct memory address engines are configured to interact with the local instances of the global message-passing application via injection FIFO metadata describing an injection FIFO in a corresponding one of the memories. The local instances of the global message passing application are configured to record, in the injection FIFO in the corresponding one of the memories, message descriptors associated with messages of an arbitrary communication pattern in an iteration of an executing application program. The local instances of the global message passing application are configured to replay the message descriptors during a subsequent iteration of the executing application program.
    • 并行计算机系统包括多个计算节点。 每个计算节点包括耦合到至少一个处理器和至少一个存储器的至少一个处理器,至少一个存储器和直接存储器地址引擎。 该系统还包括互连多个计算节点的网络。 该网络运行全球消息传递应用程序,用于跨网络执行通信。 全局消息传递应用的本地实例在每个计算节点处操作,以独立于在另一个计算节点处执行的处理操作来执行本地处理操作。 直接存储器地址引擎被配置为通过描述在对应的一个存储器中的注入FIFO的注入FIFO元数据与全局消息传递应用的本地实例进行交互。 全局消息传递应用程序的本地实例被配置为在执行的应用程序的迭代中在对应的一个存储器中的注入FIFO中记录与任意通信模式的消息相关联的消息描述符。 全局消息传递应用程序的本地实例被配置为在执行的应用程序的后续迭代期间重播消息描述符。