会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 22. 发明授权
    • Scalable processor
    • 可扩展处理器
    • US06978360B2
    • 2005-12-20
    • US09854243
    • 2001-05-11
    • Gianfranco BilardiKattamuri EkanadhamPratap Chandra Pattnaik
    • Gianfranco BilardiKattamuri EkanadhamPratap Chandra Pattnaik
    • G06F9/30G06F9/312G06F9/34G06F9/38
    • G06F9/30043G06F9/383G06F9/3834G06F9/3885
    • A method and apparatus for issuing and executing memory instructions from a computer system so as to (1) maximize the number of requests issued to a highly pipe-lined memory, the only limitation being data dependencies in the program and (2) avoid reading data from memory before a corresponding write to memory. The memory instructions are organized to read and write into memory, by using explicit move instructions, thereby avoiding any data storage limitations in the processor. The memory requests are organized to carry complete information, so that they can be processed independently when memory returns the requested data. The memory is divided into a number of regions, each of which is associated with a fence counter. The fence counter for a memory region is incremented each time a memory instruction that is targeted to the memory region is issued and decremented each time there is a write to the memory region. After a fence instruction is issued, no further memory instructions are issued if the counter for the memory region specified in the fence instruction is above a threshold. When a sufficient number of the outstanding issued instructions are executed, the counter will be decremented below the threshold and further memory instructions are then issued.
    • 一种用于从计算机系统发出和执行存储器指令的方法和装置,以便(1)最大限度地发送给高度管道内存的请求数量,唯一的限制是程序中的数据依赖性,以及(2)避免读取数据 从内存之前对相应的写入内存。 存储器指令通过使用显式移动指令被组织以读取和写入存储器,从而避免了处理器中的任何数据存储限制。 存储器请求被组织以携带完整的信息,使得当存储器返回所请求的数据时它们可以被独立地处理。 存储器被分成多个区域,每个区域与栅栏计数器相关联。 每当存储区域的存储器指令被发出并且每次对存储器区域进行写操作时,存储器区域的栅栏计数器递增。 发出栅栏指令后,如果栅栏指令中指定的存储器区域的计数器高于阈值,则不会再发出存储指令。 当执行足够数量的未完成的发出的指令时,计数器将递减到阈值以下,然后再发出存储器指令。
    • 25. 发明授权
    • Seamless interface for multi-threaded core accelerators
    • 多线程核心加速器的无缝界面
    • US08683175B2
    • 2014-03-25
    • US13048214
    • 2011-03-15
    • Kattamuri EkanadhamHung Q. LeJose E. MoreiraPratap C. Pattnaik
    • Kattamuri EkanadhamHung Q. LeJose E. MoreiraPratap C. Pattnaik
    • G06F9/30G06F12/10
    • G06F9/3877G06F9/30043G06F9/3012G06F9/30123G06F9/3851G06F12/1027
    • A method, system and computer program product are disclosed for interfacing between a multi-threaded processing core and an accelerator. In one embodiment, the method comprises copying from the processing core to the hardware accelerator memory address translations for each of multiple threads operating on the processing core, and simultaneously storing on the hardware accelerator one or more of the memory address translations for each of the threads. Whenever any one of the multiple threads operating on the processing core instructs the hardware accelerator to perform a specified operation, the hardware accelerator has stored thereon one or more of the memory address translations for the any one of the threads. This facilitates starting that specified operation without memory translation faults. In an embodiment, the copying includes, each time one of the memory address translations is updated on the processing core, copying the updated one of the memory address translations to the hardware accelerator.
    • 公开了用于在多线程处理核心和加速器之间进行接口的方法,系统和计算机程序产品。 在一个实施例中,该方法包括从处理核心复制到在处理核心上操作的多个线程中的每个线程的硬件加速器存储器地址转换,以及同时在硬件加速器上存储每个线程的一个或多个存储器地址转换 。 只要在处理核心上操作的多个线程中的任何一个指示硬件加速器执行指定的操作,则硬件加速器在其上存储有针对任何一个线程的一个或多个存储器地址转换。 这有助于启动指定的操作,而不会出现内存转换错误。 在一个实施例中,复制包括每次在处理核心上更新一个存储器地址转换时,将更新的一个存储器地址转换复制到硬件加速器。
    • 26. 发明授权
    • Scalable memory
    • 可扩展内存
    • US07107399B2
    • 2006-09-12
    • US09854213
    • 2001-05-11
    • Gianfranco BilardiKattamuri EkanadhamPratap Chandra Pattnaik
    • Gianfranco BilardiKattamuri EkanadhamPratap Chandra Pattnaik
    • G06F12/00
    • G11C7/1057G06F12/08G11C7/10G11C7/1006G11C7/1051G11C7/1078G11C7/1084
    • A memory structure and method for handling memory requests from a processor and for returning correspondence responses to the processor from various levels of the memory structure. The memory levels of the memory structure are interconnected by a forward and return path with the return path having twice the bandwidth of the forward path. An algorithm is used to determine how many responses are sent from each memory level on the return path to the processor. This algorithm is designed to guarantee a constant bound on the rate of responses sent to the processor. More specifically, if a write request is at the same level to which it is targeted, or if a request at a memory level is targeted to a higher memory level, then two responses are forwarded from a controller at the memory level on the return path to the processor. Otherwise, only one response is forwarded from the memory level on the return path.
    • 一种用于处理来自处理器的存储器请求并用于从存储器结构的各个级别返回到处理器的对应响应的存储器结构和方法。 存储器结构的存储器级别通过前向和返回路径互连,返回路径具有正向路径的两倍的带宽。 一种算法用于确定从返回路径上的每个存储器级别发送到处理器的响应数量。 该算法旨在保证对发送到处理器的响应速率的恒定限制。 更具体地说,如果写入请求处于与其所针对的相同级别,或者如果存储器级别的请求针对更高的存储器级别,则在返回路径上的存储器级的控制器处转发两个响应 到处理器。 否则,只有一个响应从返回路径上的内存级别转发。
    • 28. 发明授权
    • Self-parallelizing computer system and method
    • 自并行计算机系统及方法
    • US5347639A
    • 1994-09-13
    • US731224
    • 1991-07-15
    • Rudolph N. RechtschaffenKattamuri Ekanadham
    • Rudolph N. RechtschaffenKattamuri Ekanadham
    • G06F9/38G06F9/44G06F15/16G06F15/177G06F9/30
    • G06F8/45
    • A self-parallelizing computer system and method asynchronously processes execution sequences of instructions in two modes of execution on a set of processing elements which communicate with each other. Each processing element is capable of decoding instructions, generating memory operand addresses, executing instructions and referencing and updating its own set of general purpose registers. These processing elements act in concert during the first mode of execution not only to execute the instructions in an execution sequence but also to partition an execution sequence into separate instruction subsequences. The separate instruction subsequences are stored along with additional information which will allow the stored subsequences to be correctly executed in parallel. Subsequent re-execution of the same execution sequence is done much faster in the second mode of execution, since each of the processing elements decodes and executes only the instructions in one of the subsequences while the other processing elements are concurrently each doing the same with another one of the subsequences.
    • 自并行计算机系统和方法以两种通信方式异步处理两种执行模式的指令执行顺序。 每个处理元件能够解码指令,产生存储器操作数地址,执行指令,并引用和更新其自己的通用寄存器集合。 这些处理元件在第一执行模式下一致地执行,不仅在执行顺序中执行指令,而且还将执行序列分割成单独的指令子序列。 单独的指令子序列与附加信息一起存储,这将允许存储的子序列被并行地正确执行。 在第二种执行模式中,相同执行顺序的后续重新执行的速度要快得多,因为每个处理元件仅解码并执行其中一个子序列中的指令,而其他处理元件同时对其他处理元件执行相同操作 其中一个子序列。