会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 31. 发明申请
    • METHOD AND APPARATUS FOR EFFICIENT HELPER THREAD STATE INITIALIZATION USING INTER-THREAD REGISTER COPY
    • 使用内部线程寄存器复制的有效帮助螺纹状态初始化的方法和装置
    • US20110296431A1
    • 2011-12-01
    • US12787128
    • 2010-05-25
    • Michael K. GschwindJohn K. O'BrienValentina SalapuraZehra N. Sura
    • Michael K. GschwindJohn K. O'BrienValentina SalapuraZehra N. Sura
    • G06F9/46
    • G06F9/544
    • This disclosure describes a method and system that may enable fast, hardware-assisted, producer-consumer style communication of values between threads. The method, in one aspect, uses a dedicated hardware buffer as an intermediary storage for transferring values from registers in one thread to registers in another thread. The method may provide a generic, programmable solution that can transfer any subset of register values between threads in any given order, where the source and target registers may or may not be correlated. The method also may allow for determinate access times, since it completely bypasses the memory hierarchy. Also, the method is designed to be lightweight, focusing on communication, and keeping synchronization facilities orthogonal to the communication mechanism. It may be used by a helper thread that performs data prefetching for an application thread, for example, to initialize the upward-exposed reads in the address computation slice of the helper thread code.
    • 本公开描述了一种方法和系统,其可以实现线程之间的值的快速,硬件辅助,生产者 - 消费者风格的通信。 该方法在一个方面中使用专用硬件缓冲器作为用于将值从一个线程中的寄存器传送到另一线程中的寄存器的中间存储器。 该方法可以提供通用的可编程解决方案,其可以以任何给定的顺序在线程之间传送寄存器值的任何子集,其中源寄存器和目标寄存器可以或可以不相关。 该方法还可以允许确定的访问时间,因为它完全绕过存储器层次结构。 此外,该方法被设计为轻量级,专注于通信,并保持与通信机制正交的同步设备。 它可以由对应用程序线程执行数据预取的辅助线程使用,例如,初始化辅助线程代码的地址计算切片中的向上暴露的读取。
    • 32. 发明申请
    • LOW COMPLEXITY SPECULATIVE MULTITHREADING SYSTEM BASED ON UNMODIFIED MICROPROCESSOR CORE
    • 基于未修改的微处理器核心的低复杂度测量多路复用系统
    • US20080263280A1
    • 2008-10-23
    • US12147914
    • 2008-06-27
    • Alan G. GaraMichael K. GschwindValentina Salapura
    • Alan G. GaraMichael K. GschwindValentina Salapura
    • G06F12/08
    • G06F12/0811G06F9/3828G06F9/3842G06F9/3851G06F12/0815G06F2212/507
    • A system, method and computer program product for supporting thread level speculative execution in a computing environment having multiple processing units adapted for concurrent execution of threads in speculative and non-speculative modes. Each processing unit includes a cache memory hierarchy of caches operatively connected therewith. The apparatus includes an additional cache level local to each processing unit for use only in a thread level speculation mode, each additional cache for storing speculative results and status associated with its associated processor when handling speculative threads. The additional local cache level at each processing unit are interconnected so that speculative values and control data may be forwarded between parallel executing threads. A control implementation is provided that enables speculative coherence between speculative threads executing in the computing environment.
    • 一种用于在具有多个处理单元的计算环境中支持线程级推测性执行的系统,方法和计算机程序产品,该处理单元适于以推测和非推测模式并行执行线程。 每个处理单元包括与其可操作地连接的高速缓存的高速缓冲存储器层级。 该装置包括仅在线程级推测模式中使用的每个处理单元本地的附加高速缓存级别,每个附加高速缓存用于存储推测结果以及处理推测性线程时与其相关联的处理器相关联的状态。 在每个处理单元处的附加本地高速缓存级别互连,使得推测值和控制数据可以在并行执行线程之间转发。 提供了一种控制实现,其实现在计算环境中执行的推测线程之间的推测性一致性。
    • 35. 发明授权
    • Instruction merging optimization
    • 指令合并优化
    • US09513915B2
    • 2016-12-06
    • US13432537
    • 2012-03-28
    • Michael K. GschwindValentina Salapura
    • Michael K. GschwindValentina Salapura
    • G06F9/30G06F9/38
    • G06F9/30181G06F9/3017G06F9/3836G06F9/384
    • A computer system for optimizing instructions includes a processor including an instruction execution unit configured to execute instructions and an instruction optimization unit configured to optimize instructions and memory to store machine instructions to be executed by the instruction execution unit. The computer system is configured to perform a method including analyzing machine instructions from among a stream of instructions to be executed by the instruction execution unit, the machine instructions including a memory load instruction and a data processing instruction to perform a data processing function based on the memory load instruction, identifying the machine instructions as being eligible for optimization, merging the machine instructions into a single optimized internal instruction, and executing the single optimized internal instruction to perform a memory load function and a data processing function corresponding to the memory load instruction and the data processing instruction.
    • 用于优化指令的计算机系统包括:处理器,包括被配置为执行指令的指令执行单元和被配置为优化指令和存储器以存储由指令执行单元执行的机器指令的指令优化单元。 计算机系统被配置为执行一种方法,包括从由指令执行单元执行的指令流中分析机器指令,包括存储器加载指令的机器指令和数据处理指令,以执行基于 存储器加载指令,将机器指令识别为符合优化的要求,将机器指令合并到单个优化的内部指令中,以及执行单个优化内部指令以执行对应于存储器加载指令的存储器加载功能和数据处理功能,以及 数据处理指令。
    • 39. 发明授权
    • Method and apparatus for efficient helper thread state initialization using inter-thread register copy
    • 使用线程间寄存器复制的有效帮助线程状态初始化的方法和装置
    • US08453161B2
    • 2013-05-28
    • US12787128
    • 2010-05-25
    • Michael K. GschwindJohn K. O'BrienValentina SalapuraZehra N. Sura
    • Michael K. GschwindJohn K. O'BrienValentina SalapuraZehra N. Sura
    • G06F13/00G06F12/00G06F9/30
    • G06F9/544
    • This disclosure describes a method and system that may enable fast, hardware-assisted, producer-consumer style communication of values between threads. The method, in one aspect, uses a dedicated hardware buffer as an intermediary storage for transferring values from registers in one thread to registers in another thread. The method may provide a generic, programmable solution that can transfer any subset of register values between threads in any given order, where the source and target registers may or may not be correlated. The method also may allow for determinate access times, since it completely bypasses the memory hierarchy. Also, the method is designed to be lightweight, focusing on communication, and keeping synchronization facilities orthogonal to the communication mechanism. It may be used by a helper thread that performs data prefetching for an application thread, for example, to initialize the upward-exposed reads in the address computation slice of the helper thread code.
    • 本公开描述了一种方法和系统,其可以实现线程之间的值的快速,硬件辅助,生产者 - 消费者风格的通信。 该方法在一个方面中使用专用硬件缓冲器作为用于将值从一个线程中的寄存器传送到另一线程中的寄存器的中间存储器。 该方法可以提供通用的可编程解决方案,其可以以任何给定的顺序在线程之间传送寄存器值的任何子集,其中源寄存器和目标寄存器可以或可以不相关。 该方法还可以允许确定的访问时间,因为它完全绕过存储器层次结构。 此外,该方法被设计为轻量级,专注于通信,并保持与通信机制正交的同步设备。 它可以由对应用程序线程执行数据预取的辅助线程使用,例如,初始化辅助线程代码的地址计算切片中的向上暴露的读取。