会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 41. 发明申请
    • NOVEL MASSIVELY PARALLEL SUPERCOMPUTER
    • 新的大型并行超级计算机
    • US20090259713A1
    • 2009-10-15
    • US12492799
    • 2009-06-26
    • Matthias A. BlumrichDong ChenGeorge L. ChiuThomas M. CipollaPaul W. CoteusAlan G. GaraMark E. GiampapaPhilip HeidelbergerGerard V. KopcsayLawrence S. MokTodd E. Takken
    • Matthias A. BlumrichDong ChenGeorge L. ChiuThomas M. CipollaPaul W. CoteusAlan G. GaraMark E. GiampapaPhilip HeidelbergerGerard V. KopcsayLawrence S. MokTodd E. Takken
    • G06F15/76G06F15/16G06F11/28G06F12/08G06F9/02G06F15/177
    • H05K7/20836F24F11/77G06F9/52G06F9/526G06F15/17381G06F17/142G09G5/008H04L7/0338
    • A novel massively parallel supercomputer of hundreds of teraOPS-scale includes node architectures based upon System-On-a-Chip technology, i.e., each processing node comprises a single Application Specific Integrated Circuit (ASIC). Within each ASIC node is a plurality of processing elements each of which consists of a central processing unit (CPU) and plurality of floating point processors to enable optimal balance of computational performance, packaging density, low cost, and power and cooling requirements. The plurality of processors within a single node may be used individually or simultaneously to work on any combination of computation or communication as required by the particular algorithm being solved or executed at any point in time. The system-on-a-chip ASIC nodes are interconnected by multiple independent networks that optimally maximizes packet communications throughput and minimizes latency. In the preferred embodiment, the multiple networks include three high-speed networks for parallel algorithm message passing including a Torus, Global Tree, and a Global Asynchronous network that provides global barrier and notification functions. These multiple independent networks may be collaboratively or independently utilized according to the needs or phases of an algorithm for optimizing algorithm processing performance. For particular classes of parallel algorithms, or parts of parallel calculations, this architecture exhibits exceptional computational performance, and may be enabled to perform calculations for new classes of parallel algorithms. Additional networks are provided for external connectivity and used for Input/Output, System Management and Configuration, and Debug and Monitoring functions. Special node packaging techniques implementing midplane and other hardware devices facilitates partitioning of the supercomputer in multiple networks for optimizing supercomputing resources.
    • 数百个teraOPS级别的新型大规模并行超级计算机包括基于片上系统技术的节点架构,即每个处理节点包括单个专用集成电路(ASIC)。 在每个ASIC节点内是多个处理元件,每个处理元件由中央处理单元(CPU)和多个浮点处理器组成,以实现计算性能,封装密度,低成本以及功率和冷却​​要求的最佳平衡。 单个节点内的多个处理器可以单独使用或同时使用,以在任何时间点解决或执行的特定算法所要求的任何计算或通信组合上工作。 片上系统ASIC节点通过多个独立网络互连,从而最大限度地最大限度地提高了分组通信吞吐量并最大限度地减少了延迟。 在优选实施例中,多个网络包括用于并行算法消息传递的三个高速网络,包括提供全局障碍和通知功能的环形,全局树和全球异步网络。 这些多个独立网络可以根据用于优化算法处理性能的算法的需求或阶段来协同或独立地利用。 对于特定类别的并行算法或并行计算的部分,该架构具有出色的计算性能,并且可以启用对新类并行算法执行计算。 为外部连接提供附加网络,用于输入/输出,系统管理和配置以及调试和监控功能。 实现中平面和其他硬件设备的特殊节点打包技术有助于在多个网络中划分超级计算机,以优化超级计算资源。
    • 44. 发明申请
    • METHOD AND APPARATUS FOR FILTERING SNOOP REQUESTS USING MULTIPLE SNOOP CACHES
    • 使用多个SNOOP CACHES过滤SNOOP要求的方法和设备
    • US20080155201A1
    • 2008-06-26
    • US12042958
    • 2008-03-05
    • Matthias A. BlumrichAlan G. GaraValentina Salapura
    • Matthias A. BlumrichAlan G. GaraValentina Salapura
    • G06F12/08
    • G06F12/0831G06F12/0822G06F2212/507Y02D10/13
    • A method and apparatus for implementing a snoop filter unit associated with a single processor in a multiprocessor system. The snoop filter unit has a plurality of ports, each port receiving snoop requests from exactly one dedicated source. Associated with each port is a snoop cache filter that processes each snoop cache request and records addresses of the most recent snoop requests for exactly one source. The snoop cache filter uses vector encoding to record the occurrence of snoop requests for a sequence of consecutive cache lines. All addresses of snoop requests are added to the snoop cache unless a received snoop cache request matches an entry present in the associated snoop cache, in which case the snoop request is discarded. Otherwise, the associated snoop cache request is enqueued for forwarding to the single processor. Information from all snoop cache filters assigned to all ports in the snoop filter unit are removed in the case that data corresponding to any one of the memory addresses contained in snoop cache filter is loaded in the cache hierarchy of the processor the snoop cache filter is assigned to.
    • 一种用于在多处理器系统中实现与单个处理器相关联的窥探滤波器单元的方法和装置。 监听过滤器单元具有多个端口,每个端口接收来自正好一个专用源的窥探请求。 与每个端口相关联的是一个侦听缓存过滤器,用于处理每个侦听缓存请求,并记录最近一次侦听请求的地址。 监听高速缓存过滤器使用向量编码来记录连续高速缓存行序列的窥探请求的发生。 侦听请求的所有地址都将添加到侦听缓存中,除非接收到的侦听缓存请求与相关侦听缓存中存在的条目匹配,在这种情况下,侦听请求将被丢弃。 否则,相关联的侦听缓存请求被排入队列以转发到单个处理器。 在分配给窥探过滤器单元中的所有端口的所有侦听缓存过滤器中的信息将被删除,因为与侦听高速缓存过滤器中包含的任何一个存储器地址相对应的数据被加载到处理器的高速缓存层次结构中,该侦听缓存过滤器被分配 至。
    • 45. 发明授权
    • Method and apparatus for filtering snoop requests using a scoreboard
    • 使用记分板过滤窥探请求的方法和装置
    • US07383397B2
    • 2008-06-03
    • US11093160
    • 2005-03-29
    • Matthias A. BlumrichAlan G. GaraThomas R. PuzakValentina Salapura
    • Matthias A. BlumrichAlan G. GaraThomas R. PuzakValentina Salapura
    • G06F12/00G06F13/00
    • G06F12/0822G06F12/0831G06F2212/507Y02D10/13
    • An apparatus for implementing snooping cache coherence that locally reduces the number of snoop requests presented to each cache in a multiprocessor system. A snoop filter device associated with a single processor includes one or more “scoreboard” data structures that make snoop determinations, i.e., for each snoop request from another processor, to determine if a request is to be forwarded to the processor or, discarded. At least one scoreboard is active, and at least one scoreboard is determined to be historic at any point in time. A snoop determination of the queue indicates that an entry may be in the cache, but does not indicate its actual residence status. In addition, the snoop filter block implementing scoreboard data structures is operatively coupled with a cache wrap detection logic means whereby, upon detection of a cache wrap condition, the content of the active scoreboard is copied into a historic scoreboard and the content of at least one active scoreboard is reset.
    • 用于实现窥探高速缓存一致性的装置,其本地地减少呈现给多处理器系统中的每个缓存的窥探请求的数量。 与单个处理器相关联的窥探过滤器装置包括一个或多个“记分板”数据结构,其进行窥探确定,即,来自另一个处理器的每个窥探请求,以确定请求是否被转发到处理器或被丢弃。 至少一个记分牌是活跃的,并且至少一个记分牌被确定为在任何时间点的历史。 队列的窥探确定表示一个条目可能在缓存中,但不表示其实际居住状态。 此外,实现记分板数据结构的窥探过滤器块与高速缓存包检测逻辑装置可操作地耦合,由此在检测到缓存包装条件时,将活动记分板的内容复制到历史记分板中,并且至少一个 活动记分板重置。
    • 46. 发明授权
    • Method and apparatus for filtering snoop requests using stream registers
    • 使用流寄存器对窥探请求进行过滤的方法和装置
    • US08135917B2
    • 2012-03-13
    • US12137325
    • 2008-06-11
    • Matthias A. BlumrichAlan G. GaraValentina Salapura
    • Matthias A. BlumrichAlan G. GaraValentina Salapura
    • G06F12/00G06F13/00
    • G06F12/0831G06F12/0822G06F2212/507Y02D10/13
    • A method and apparatus for supporting cache coherency in a multiprocessor computing environment having multiple processing units, each processing unit having a local cache memory associated therewith. A snoop filter device is associated with each processing unit and includes at least one snoop filter primitive implementing filtering method based on usage of stream registers sets and associated stream register comparison logic. From the plurality of stream registers sets, at least one stream register set is active, and at least one stream register set is labeled historic at any point in time. In addition, the snoop filter block is operatively coupled with cache wrap detection logic whereby the content of the active stream register set is switched into a historic stream register set upon the cache wrap condition detection, and the content of at least one active stream register set is reset. Each filter primitive implements stream register comparison logic that determines whether a received snoop request is to be forwarded to the processor or discarded.
    • 一种用于在具有多个处理单元的多处理器计算环境中支持高速缓存一致性的方法和装置,每个处理单元具有与其相关联的本地高速缓冲存储器。 窥探过滤设备与每个处理单元相关联并且包括至少一个基于流寄存器集合和相关流寄存器比较逻辑的使用实现过滤方法的窥探过滤器原语。 从多个流寄存器组中,至少一个流寄存器组是有效的,并且至少一个流寄存器集合在任何时间点被标记为历史。 另外,监听滤波器块可操作地与高速缓存包检测逻辑耦合,从而将活动流寄存器集合的内容切换到在高速缓存环绕条件检测时设置的历史流寄存器,并且至少一个活动流寄存器集合的内容 被复位。 每个滤波器基元实现流寄存器比较逻辑,其确定接收的窥探请求是否被转发到处理器或丢弃。
    • 47. 发明授权
    • Method and apparatus for filtering snoop requests using a scoreboard
    • 使用记分板过滤窥探请求的方法和装置
    • US08015364B2
    • 2011-09-06
    • US12129289
    • 2008-05-29
    • Matthias A. BlumrichAlan G. GaraThomas R. PuzakValentina Salapura
    • Matthias A. BlumrichAlan G. GaraThomas R. PuzakValentina Salapura
    • G06F12/00G06F13/00
    • G06F12/0822G06F12/0831G06F2212/507Y02D10/13
    • An apparatus for implementing snooping cache coherence that locally reduces the number of snoop requests presented to each cache in a multiprocessor system. A snoop filter device associated with a single processor includes one or more “scoreboard” data structures that make snoop determinations, i.e., for each snoop request from another processor, to determine if a request is to be forwarded to the processor or, discarded. At least one scoreboard is active, and at least one scoreboard is determined to be historic at any point in time. A snoop determination of the queue indicates that an entry may be in the cache, but does not indicate its actual residence status. In addition, the snoop filter block implementing scoreboard data structures is operatively coupled with a cache wrap detection logic means whereby, upon detection of a cache wrap condition, the content of the active scoreboard is copied into a historic scoreboard and the content of at least one active scoreboard is reset.
    • 用于实现窥探高速缓存一致性的装置,其本地地减少呈现给多处理器系统中的每个缓存的窥探请求的数量。 与单个处理器相关联的窥探过滤器装置包括一个或多个“记分板”数据结构,其进行窥探确定,即,来自另一个处理器的每个窥探请求,以确定请求是否被转发到处理器或被丢弃。 至少一个记分牌是活跃的,并且至少一个记分牌被确定为在任何时间点的历史。 队列的窥探确定表示一个条目可能在缓存中,但不表示其实际居住状态。 此外,实现记分板数据结构的窥探过滤器块与高速缓存包检测逻辑装置可操作地耦合,由此在检测到缓存包装条件时,将活动记分板的内容复制到历史记分板中,并且至少一个 活动记分板重置。
    • 48. 发明申请
    • METHOD AND APPARATUS FOR FILTERING SNOOP REQUESTS USING A SCOREBOARD
    • 使用分光镜过滤SNOOP要求的方法和装置
    • US20080294850A1
    • 2008-11-27
    • US12129289
    • 2008-05-29
    • Matthias A. BlumrichAlan G. GaraThomas R. PuzakValentina Salapura
    • Matthias A. BlumrichAlan G. GaraThomas R. PuzakValentina Salapura
    • G06F12/08
    • G06F12/0822G06F12/0831G06F2212/507Y02D10/13
    • An apparatus for implementing snooping cache coherence that locally reduces the number of snoop requests presented to each cache in a multiprocessor system. A snoop filter device associated with a single processor includes one or more “scoreboard” data structures that make snoop determinations, i.e., for each snoop request from another processor, to determine if a request is to be forwarded to the processor or, discarded. At least one scoreboard is active, and at least one scoreboard is determined to be historic at any point in time. A snoop determination of the queue indicates that an entry may be in the cache, but does not indicate its actual residence status. In addition, the snoop filter block implementing scoreboard data structures is operatively coupled with a cache wrap detection logic means whereby, upon detection of a cache wrap condition, the content of the active scoreboard is copied into a historic scoreboard and the content of at least one active scoreboard is reset.
    • 用于实现窥探高速缓存一致性的装置,其本地地减少呈现给多处理器系统中的每个缓存的窥探请求的数量。 与单个处理器相关联的窥探过滤器装置包括一个或多个“记分板”数据结构,其进行窥探确定,即,来自另一个处理器的每个窥探请求,以确定请求是否被转发到处理器或被丢弃。 至少一个记分牌是活跃的,并且至少一个记分牌被确定为在任何时间点的历史。 队列的窥探确定表示一个条目可能在缓存中,但不表示其实际居住状态。 此外,实现记分板数据结构的窥探过滤器块与高速缓存包检测逻辑装置可操作地耦合,由此在检测到缓存包装条件时,将活动记分板的内容复制到历史记分板中,并且至少一个 活动记分板重置。
    • 49. 发明申请
    • METHOD AND APARATHUS FOR FILTERING SNOOP REQUESTS USING STREAM REGISTERS
    • 使用流记录器过滤SNOOP要求的方法和方法
    • US20080244194A1
    • 2008-10-02
    • US12137325
    • 2008-06-11
    • Matthias A. BlumrichAlan G. GaraValentina Salapura
    • Matthias A. BlumrichAlan G. GaraValentina Salapura
    • G06F12/08
    • G06F12/0831G06F12/0822G06F2212/507Y02D10/13
    • A method and apparatus for supporting cache coherency in a multiprocessor computing environment having multiple processing units, each processing unit having a local cache memory associated therewith. A snoop filter device is associated with each processing unit and includes at least one snoop filter primitive implementing filtering method based on usage of stream registers sets and associated stream register comparison logic. From the plurality of stream registers sets, at least one stream register set is active, and at least one stream register set is labeled historic at any point in time. In addition, the snoop filter block is operatively coupled with cache wrap detection logic whereby the content of the active stream register set is switched into a historic stream register set upon the cache wrap condition detection, and the content of at least one active stream register set is reset. Each filter primitive implements stream register comparison logic that determines whether a received snoop request is to be forwarded to the processor or discarded.
    • 一种用于在具有多个处理单元的多处理器计算环境中支持高速缓存一致性的方法和装置,每个处理单元具有与其相关联的本地高速缓冲存储器。 窥探过滤设备与每个处理单元相关联并且包括至少一个基于流寄存器集合和相关流寄存器比较逻辑的使用实现过滤方法的窥探过滤器原语。 从多个流寄存器组中,至少一个流寄存器组是有效的,并且至少一个流寄存器集合在任何时间点被标记为历史。 另外,监听滤波器块可操作地与高速缓存包检测逻辑耦合,从而将活动流寄存器集合的内容切换到在高速缓存环绕条件检测时设置的历史流寄存器,并且至少一个活动流寄存器集合的内容 被复位。 每个滤波器基元实现流寄存器比较逻辑,其确定接收的窥探请求是否被转发到处理器或丢弃。
    • 50. 发明授权
    • Method and apparatus for filtering snoop requests using stream registers
    • 使用流寄存器对窥探请求进行过滤的方法和装置
    • US07392351B2
    • 2008-06-24
    • US11093130
    • 2005-03-29
    • Matthias A. BlumrichAlan G. GaraValentina Salapura
    • Matthias A. BlumrichAlan G. GaraValentina Salapura
    • G06F13/28G06F12/00
    • G06F12/0831G06F12/0822G06F2212/507Y02D10/13
    • A method and apparatus for supporting cache coherency in a multiprocessor computing environment having multiple processing units, each processing unit having a local cache memory associated therewith. A snoop filter device is associated with each processing unit and includes at least one snoop filter primitive implementing filtering method based on usage of stream registers sets and associated stream register comparison logic. From the plurality of stream registers sets, at least one stream register set is active, and at least one stream register set is labeled historic at any point in time. In addition, the snoop filter block is operatively coupled with cache wrap detection logic whereby the content of the active stream register set is switched into a historic stream register set upon the cache wrap condition detection, and the content of at least one active stream register set is reset. Each filter primitive implements stream register comparison logic that determines whether a received snoop request is to be forwarded to the processor or discarded.
    • 一种用于在具有多个处理单元的多处理器计算环境中支持高速缓存一致性的方法和装置,每个处理单元具有与其相关联的本地高速缓冲存储器。 窥探过滤设备与每个处理单元相关联并且包括至少一个基于流寄存器集合和相关流寄存器比较逻辑的使用实现过滤方法的窥探过滤器原语。 从多个流寄存器组中,至少一个流寄存器组是有效的,并且至少一个流寄存器集合在任何时间点被标记为历史。 另外,监听滤波器块可操作地与高速缓存包检测逻辑耦合,从而将活动流寄存器集合的内容切换到在高速缓存环绕条件检测时设置的历史流寄存器,并且至少一个活动流寄存器集合的内容 被复位。 每个滤波器基元实现流寄存器比较逻辑,其确定接收的窥探请求是否被转发到处理器或丢弃。