会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 41. 发明授权
    • Low complexity speculative multithreading system based on unmodified microprocessor core
    • 基于未修改的微处理器核心的低复杂度推测性多线程系统
    • US07836260B2
    • 2010-11-16
    • US12147914
    • 2008-06-27
    • Alan G. GaraMichael K. GschwindValentina Salapura
    • Alan G. GaraMichael K. GschwindValentina Salapura
    • G06F12/00
    • G06F12/0811G06F9/3828G06F9/3842G06F9/3851G06F12/0815G06F2212/507
    • A system, method and computer program product for supporting thread level speculative execution in a computing environment having multiple processing units adapted for concurrent execution of threads in speculative and non-speculative modes. Each processing unit includes a cache memory hierarchy of caches operatively connected therewith. The apparatus includes an additional cache level local to each processing unit for use only in a thread level speculation mode, each additional cache for storing speculative results and status associated with its associated processor when handling speculative threads. The additional local cache level at each processing unit are interconnected so that speculative values and control data may be forwarded between parallel executing threads. A control implementation is provided that enables speculative coherence between speculative threads executing in the computing environment.
    • 一种用于在具有多个处理单元的计算环境中支持线程级推测性执行的系统,方法和计算机程序产品,该处理单元适于以推测和非推测模式并行执行线程。 每个处理单元包括与其可操作地连接的高速缓存的高速缓冲存储器层级。 该装置包括仅在线程级推测模式中使用的每个处理单元本地的附加高速缓存级别,每个附加高速缓存用于存储推测结果以及处理推测性线程时与其相关联的处理器相关联的状态。 在每个处理单元处的附加本地高速缓存级别互连,使得推测值和控制数据可以在并行执行线程之间转发。 提供了一种控制实现,其实现在计算环境中执行的推测线程之间的推测性一致性。
    • 42. 发明授权
    • Method and apparatus for efficient performance monitoring of a large number of simultaneous events
    • 用于高效率监测大量同时事件的方法和装置
    • US07461383B2
    • 2008-12-02
    • US11507307
    • 2006-08-21
    • Alan G. GaraMichael K. GschwindValentina Salapura
    • Alan G. GaraMichael K. GschwindValentina Salapura
    • G06F3/00G06F9/44G06F9/46G06F13/00
    • G06F11/348Y02D10/34
    • A system for monitoring a large number of simultaneous events implements a hybrid counter array device having a first counter portion comprising counter devices, each counter device for receiving signals representing occurrences of events from an event source and providing a first count value corresponding to a lower order bits of the hybrid counter array. A second counter portion comprises a memory array device having addressable memory locations in correspondence with the counter devices, each addressable memory location for storing a second count value representing higher order bits. A control device monitors each of the counter devices and initiates updating a value of a corresponding second count value stored at the corresponding addressable memory location. The system includes interrupt pre-indication for providing fast interrupt trigger to a processor device when a count value related to an event equals a threshold value. A data transfer sub-system additionally enables one or more of: read access or write access to both the count values in the first and second counter portions over a narrow bus, the read/write access for purposes of initializing and determining status of the count values for a monitored event type in response to a processor device request.
    • 一种用于监视大量同时事件的系统实现了具有包括计数器装置的第一计数器部分的混合计数器阵列装置,每个计数器装置用于接收表示从事件源发生的事件的信号,并提供对应于较低次序的第一计数值 混合计数器阵列的位。 第二计数器部分包括具有与计数器装置对应的可寻址存储器位置的存储器阵列器件,每个可寻址存储器位置用于存储表示较高阶位的第二计数值。 控制装置监视每个计数器装置并且启动更新存储在相应的可寻址存储器位置处的对应的第二计数值的值。 当与事件相关的计数值等于阈值时,该系统包括用于向处理器设备提供快速中断触发的中断预指示。 数据传输子系统另外启用以下一个或多个:通过窄总线对第一和第二计数器部分中的计数值进行读访问或写入访问,用于初始化和确定计数状态的读/写访问 响应于处理器设备请求的被监视事件类型的值。
    • 44. 发明申请
    • PERFORMING PREDECODE-TIME OPTIMIZED INSTRUCTIONS IN CONJUNCTION WITH PREDECODE TIME OPTIMIZED INSTRUCTION SEQUENCE CACHING
    • 与预定时间优化的指令序列缓存执行预定时间优化的指令
    • US20130262821A1
    • 2013-10-03
    • US13432357
    • 2012-03-28
    • Michael K. GschwindValentina Salapura
    • Michael K. GschwindValentina Salapura
    • G06F9/30G06F9/312
    • G06F9/382G06F9/3017G06F9/3808G06F9/384
    • A method for performing predecode-time optimized instructions in conjunction with predecode time optimized instruction sequence caching. The method includes receiving a first instruction of an instruction sequence and a second instruction of the instruction sequence and determining if the first instruction and the second instruction can be optimized. In response to the determining that the first instruction and second instruction can be optimized, the method includes, preforming a pre-decode optimization on the instruction sequence and generating a new second instruction, wherein the new second instruction is not dependent on a target operand of the first instruction and storing a pre-decoded first instruction and a pre-decoded new second instruction in an instruction cache. In response to determining that the first instruction and second instruction can not be optimized, the method includes, storing the pre-decoded first instruction and a pre-decoded second instruction in the instruction cache.
    • 一种执行预解码时间优化指令并结合预解码时间优化指令序列缓存的方法。 该方法包括接收指令序列的第一指令和指令序列的第二指令,并且确定是否可以优化第一指令和第二指令。 响应于确定可以优化第一指令和第二指令,该方法包括:对指令序列执行预解码优化并产生新的第二指令,其中新的第二指令不依赖于目标操作数 所述第一指令并将预解码的第一指令和预解码的新的第二指令存储在指令高速缓存中。 响应于确定第一指令和第二指令不能被优化,该方法包括:将预解码的第一指令和预解码的第二指令存储在指令高速缓存中。
    • 48. 发明授权
    • High performance unaligned cache access
    • 高性能未对齐缓存访问
    • US08127078B2
    • 2012-02-28
    • US12572416
    • 2009-10-02
    • Michael K. GschwindValentina Salapura
    • Michael K. GschwindValentina Salapura
    • G06F12/00
    • G06F12/0886
    • A cache memory device and method for operating the same. One embodiment of the cache memory device includes an address decoder decoding a memory address and selecting a target cache line. A first cache array is configured to output a first cache entry associated with the target cache line, and a second cache array coupled to an alignment unit is configured to output a second cache entry associated with the alignment cache line. The alignment unit coupled to the address decoder selects either the target cache line or a neighbor cache line proximate the target cache line as an alignment cache line output. Selection of either the target cache line or a neighbor cache line is based on an alignment bit in the memory address. A tag array cache is split into even and odd cache lines tags, and provides one or two tags for every cache access.
    • 一种高速缓冲存储器件及其操作方法。 高速缓冲存储器设备的一个实施例包括解码存储器地址并选择目标高速缓存行的地址译码器。 第一高速缓存阵列被配置为输出与目标高速缓存行相关联的第一高速缓存条目,并且耦合到对准单元的第二高速缓存阵列被配置为输出与对准高速缓存行相关联的第二高速缓存条目。 耦合到地址解码器的对准单元选择目标高速缓存线或邻近目标高速缓存行的相邻高速缓存行作为对准高速缓存行输出。 目标高速缓存行或相邻高速缓存行的选择基于存储器地址中的对齐位。 标签数组高速缓存分为偶数和奇数缓存行标签,并为每个高速缓存访​​问提供一个或两个标签。