会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Method and apparatus for reducing leakage power in a cache memory by using a timer control signal that removes power to associated cache lines
    • 用于通过使用去除相关联的高速缓存行的功率的定时器控制信号来减少高速缓冲存储器中的泄漏功率的方法和装置
    • US06983388B2
    • 2006-01-03
    • US09865847
    • 2001-05-25
    • Stefanos KaxirasPhilip W. DiodatoHubert Rae McLellan, Jr.Girija Narlikar
    • Stefanos KaxirasPhilip W. DiodatoHubert Rae McLellan, Jr.Girija Narlikar
    • G06F1/32
    • G11C5/143G11C11/417
    • A method and apparatus are disclosed for reducing leakage power in a cache memory. A cache decay technique is employed for both data and instruction caches that removes power from cache lines that have not been accessed for a predefined time interval, referred to as the decay interval. The cache-line granularity of the present invention permits a significant reduction in leakage power while at the same time preserving much of the performance of the cache. The decay interval is maintained using a timer that is reset each time the corresponding cache line is accessed. The decay interval may be fixed or variable. Once the decay interval timer exceeds a specified decay interval, power to the cache line is removed. Once power to the cache line is removed, the contents of the data and tag fields are allowed to decay and the valid bit associated with the cache line is reset. When a cache line is later accessed after being powered down by the present invention, a cache miss is incurred while the cache line is again powered up and the data is obtained from the next level of the memory hierarchy.
    • 公开了一种用于减少高速缓冲存储器中的泄漏功率的方法和装置。 缓存衰减技术被用于数据和指令高速缓存,这些高速缓存从预定义的时间间隔(称为衰减间隔)中去除未被访问的高速缓存行的功率。 本发明的高速缓存线粒度允许显着降低泄漏功率,同时保持高速缓存的大部分性能。 使用每次访问相应的高速缓存行时重置的定时器来维持衰减间隔。 衰减间隔可以是固定的或可变的。 一旦衰减间隔定时器超过指定的衰减间隔,就删除高速缓存行的电源。 一旦去除了高速缓存线的电源,就允许数据和标签字段的内容衰减,并且与高速缓存行相关联的有效位被重置。 当通过本发明关闭高速缓存行之后,高速缓存未命中,同时高速缓存行再次通电并且从存储器层级的下一级获得数据。
    • 4. 发明申请
    • SYSTEMS AND METHODS FOR COHERENCE IN CLUSTERED CACHE HIERARCHIES
    • 集群高速缓存中的一致性的系统和方法
    • US20160232107A1
    • 2016-08-11
    • US15015274
    • 2016-02-04
    • Alberto ROSStefanos KAXIRAS
    • Alberto ROSStefanos KAXIRAS
    • G06F12/10G06F12/08
    • G06F12/1045G06F12/0815G06F12/084G06F12/0897G06F2212/1021G06F2212/6042G06F2212/684
    • According to embodiments described herein, the hierarchical complexity for coherence protocols associated with clustered cache architectures can be encapsulated in a simple function, i.e., that of determining when a data block is shared entirely within a cluster (i.e., a sub-tree of the hierarchy) and is private from the outside. This allows embodiments to eliminate complex recursive coherence operations that span the hierarchy and instead employ simple coherence mechanisms such as self-invalidation and write-through but which are restricted to operate where a data block is shared. Thus embodiments recognize that, in the context of clustered cache hierarchies, data can be shared entirely within one cluster but can be private (unshared) to this cluster when viewed from the perspective of other clusters. This characteristic of the data can be determined and then used to locally simplify coherence protocols.
    • 根据本文描述的实施例,与集群高速缓存架构相关联的一致性协议的分层复杂度可以被封装在简单的功能中,即,确定数据块何时完全在集群内共享(即,层级的子树 )并且是外部私人的。 这允许实施例消除跨层次的复杂递归相干运算,而是采用诸如自无效和直写之类的简单相干机制,但是限于在共享数据块的情况下操作。 因此,实施例认识到,在集群缓存层次结构的上下文中,当从其他集群的角度观察时,数据可以在一个集群内完全共享,但是可以是私有(非共享)到该集群。 可以确定数据的这种特性,然后用于本地简化相干协议。
    • 6. 发明授权
    • Method and apparatus for releasing functional units in a multithreaded VLIW processor
    • 用于释放多线程VLIW处理器中的功能单元的方法和装置
    • US06665791B1
    • 2003-12-16
    • US09538669
    • 2000-03-30
    • Alan David BerenbaumNevin HeintzeTor E. JeremiassenStefanos Kaxiras
    • Alan David BerenbaumNevin HeintzeTor E. JeremiassenStefanos Kaxiras
    • G06F15163
    • G06F9/3851G06F9/3853
    • A method and apparatus are disclosed for releasing functional units in a multithreaded very large instruction word (VLIW) processor. The functional unit release mechanism can retrieve the capacity lost due to multiple cycle instructions. The functional unit release mechanism of the present invention permits idle functional units to be reallocated to other threads, thereby improving workload efficiency. Instruction packets are assigned to functional units, which can maintain their state, independent of the issue logic. Each functional unit has an associated state machine (SM) that keeps track of the number of cycles that the functional unit will be occupied by a multiple-cycle instruction. Functional units do not reassign themselves as long as the functional unit is busy. When the instruction is complete, the functional unit can participate in functional unit allocation, even if other functional units assigned to the same thread are still busy. The functional unit release approach of the present invention allows the functional units that are not associated with a multiple-cycle instruction to be allocated to other threads while the blocked thread is waiting, thereby improving throughput of the multithreaded VLIW processor. Since the state is associated with each functional unit separately from the instruction issue unit, the functional units can be assigned to threads independently of the state of any one thread and its constituent instructions.
    • 公开了用于释放多线程超大指令字(VLIW)处理器中的功能单元的方法和装置。 功能单元释放机构可以检索由于多个循环指令而导致的容量损失。 本发明的功能单元释放机构允许将空闲功能单元重新分配给其他线程,从而提高工作效率。 指令包被分配给功能单元,它们可以保持其状态,而与发行逻辑无关。 每个功能单元具有关联的状态机(SM),其跟踪功能单元将被多周期指令占用的周期数。 只要功能单元繁忙,功能单元就不会自动重新分配。 指令完成后,即使分配给同一线程的其他功能单元仍然忙,功能单元也可以参与功能单元分配。 本发明的功能单元释放方法允许在阻塞的线程等待时将不与多周期指令相关联的功能单元分配给其他线程,从而提高多线程VLIW处理器的吞吐量。 由于状态与指令发布单元分开地与每个功能单元相关联,所以功能单元可以独立于任何一个线程的状态及其组成指令分配给线程。
    • 8. 发明授权
    • System and method for simplifying cache coherence using multiple write policies
    • 使用多个写策略简化缓存一致性的系统和方法
    • US09274960B2
    • 2016-03-01
    • US13793521
    • 2013-03-11
    • Stefanos KaxirasAlberto Ros
    • Stefanos KaxirasAlberto Ros
    • G06F12/00G06F12/08
    • G06F12/0815G06F12/0811G06F12/0837G06F12/084Y02D10/13
    • System and methods for cache coherence in a multi-core processing environment having a local/shared cache hierarchy. The system includes multiple processor cores, a main memory, and a local cache memory associated with each core for storing cache lines accessible only by the associated core. Cache lines are classified as either private or shared. A shared cache memory is coupled to the local cache memories and main memory for storing cache lines. The cores follow a write-back to the local memory for private cache lines, and a write-through to the shared memory for shared cache lines. Shared cache lines in local cache memory enter a transient dirty state when written by the core. Shared cache lines transition from a transient dirty to a valid state with a self-initiated write-through to the shared memory. The write-through to shared memory can include only data that was modified in the transient dirty state.
    • 具有本地/共享高速缓存层级的多核处理环境中的高速缓存一致性的系统和方法。 该系统包括多个处理器核心,主存储器和与每个核心相关联的本地高速缓冲存储器,用于存储仅可由相关联的核心访问的高速缓存线。 缓存行被分类为私有或共享。 共享高速缓存存储器耦合到本地高速缓存存储器和用于存储高速缓存行的主存储器。 内核遵循本地存储器的专用高速缓存行的回写,以及用于共享高速缓存行的共享内存的写入。 本地高速缓存中的共享缓存行在内核写入时会进入暂时的脏状态。 共享缓存行从一个瞬态脏到一个有效的状态,通过自启动的写入到共享内存。 对共享内存的直写可以仅包括在瞬态脏状态下被修改的数据。
    • 9. 发明授权
    • Method and apparatus for reducing leakage power in a cache memory using adaptive time-based decay
    • 使用自适应基于时间的衰减来减少高速缓冲存储器中的泄漏功率的方法和装置
    • US07472302B2
    • 2008-12-30
    • US11245513
    • 2005-10-07
    • Zhigang HuStefanos KaxirasMargaret Martonosi
    • Zhigang HuStefanos KaxirasMargaret Martonosi
    • G06F1/32
    • G11C5/143G11C11/417
    • An adaptive cache decay technique is disclosed that removes power from cache lines that have not been accessed for a variable time interval, referred to as the cache line decay interval, assuming that these cache lines are unlikely to be accessed in the future. The decay interval may be increased or decreased for each cache line to increase cache performance or save power, respectively. A default decay interval is initially established for the cache and the default decay interval may then be adjusted for a given cache line based on the performance of the cache line following a cache decay. The cache decay performance is evaluated by determining if a cache line was decayed too quickly. If a cache line is decayed and the same cache contents are again required, then the cache line was decayed too quickly and the cache line decay interval is increased. If a cache line is decayed and the cache line is then accessed to obtain a different cache content, the cache line decay interval can be decreased. When a cache line is later accessed after being decayed, a cache miss is incurred and a test is performed to evaluate the cache decay performance by determining if the same cache contents are again accessed (e.g., whether the address associated with a subsequent access is the same address of the previously stored contents). The cache decay interval is then adjusted accordingly.
    • 公开了一种自适应高速缓存衰减技术,其假设在将来不太可能访问这些高速缓存线,从而将高速缓存线路的功率从尚未被访问的可变时间间隔(称为高速缓存行衰减间隔)中移除。 对于每个高速缓存线,衰减间隔可以增加或减小,以分别提高高速缓存性能或节省功率。 初始建立高速缓存的默认衰减间隔,然后可以根据高速缓存衰减之后的高速缓存行的性能,为给定的高速缓存行调整默认衰减间隔。 缓存衰减性能通过确定高速缓存行是否衰减太快来进行评估。 如果缓存线被衰减并且再次需要相同的高速缓存内容,则高速缓存线被衰减太快,并且高速缓存行衰减间隔增加。 如果缓存行被衰减并且然后访问高速缓存行以获得不同的高速缓存内容,则可以减少高速缓存行衰减间隔。 当高速缓存行在衰减之后被访问时,产生高速缓存未命中,并且执行测试以通过确定是否再次访问相同的缓存内容来评估高速缓存衰减性能(例如,与后续访问相关联的地址是否为 以前存储的内容的相同地址)。 然后相应地调整缓存衰减间隔。