会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Mechanism for reducing latency of memory barrier operations on a
multiprocessor system
    • 减少多处理器系统上存储器屏障操作延迟的机制
    • US6088771A
    • 2000-07-11
    • US957501
    • 1997-10-24
    • Simon C. Steely, Jr.Madhumitra SharmaKourosh GharachorlooStephen R. Van Doren
    • Simon C. Steely, Jr.Madhumitra SharmaKourosh GharachorlooStephen R. Van Doren
    • G06F9/45G06F13/00
    • G06F9/3004G06F8/458G06F9/30087G06F9/3834
    • A technique reduces the latency of a memory barrier (MB) operation used to impose an inter-reference order between sets of memory reference operations issued by a processor to a multiprocessor system having a shared memory. The technique comprises issuing the MB operation immediately after issuing a first set of memory reference operations (i.e., the pre-MB operations) without waiting for responses to those pre-MB operations. Issuance of the MB operation to the system results in serialization of that operation and generation of a MB Acknowledgment (MB-Ack) command. The MB-Ack is loaded into a probe queue of the issuing processor and, according to the invention, functions to pull-in all previously ordered invalidate and probe commands in that queue. By ensuring that the probes and invalidates are ordered before the MB-Ack is received at the issuing processor, the inventive technique provides the appearance that all pre-MB references have completed.
    • 一种技术减少了用于在处理器向具有共享存储器的多处理器系统发出的存储器参考操作的集合之间施加参考间顺序的存储器屏障(MB)操作的等待时间。 该技术包括在发出第一组存储器参考操作(即,预MB操作)之前立即发出MB操作,而不等待对那些MB前操作的响应。 向系统发出MB操作会导致该操作的序列化和生成MB确认(MB-Ack)命令。 MB-Ack被加载到发布处理器的探测队列中,并且根据本发明,该功能用于在该队列中引入所有先前订购的无效和探测命令。 通过确保在发布处理器接收到MB-Ack之前对探测和无效进行排序,本发明技术提供了所有pre-MB引用完成的外观。
    • 7. 发明授权
    • Livelock prevention by delaying surrender of ownership upon intervening ownership request during load locked / store conditional atomic memory operation
    • 在加载锁定/存储条件原子存储器操作期间,通过延迟所有权所有权投降来实现预防行为
    • US06801986B2
    • 2004-10-05
    • US09933536
    • 2001-08-20
    • Simon C. Steely, Jr.Stephen R. Van DorenMadhumitra Sharma
    • Simon C. Steely, Jr.Stephen R. Van DorenMadhumitra Sharma
    • G06F1200
    • G06F9/52G06F9/3004G06F9/30047G06F9/30087G06F9/3834
    • A method, for executing a load locked and a store conditional instruction in a processor, achieves an atomic read-write operation to a memory block. First the load locked instruction is executed to read a memory block, and the processor in response to executing the load locked instruction issues a read modify system command to read the block and to take ownership of the block by the processor, and also sets a lock flag for the address of the memory block, and writes a value of the memory block into a cache of the processor as a cache copy of the memory block. The lock flag, upon receipt of an invalidate message by the processor for the cache copy of the memory block, is reset if any invalidate messages for the memory block are received by the processor. The processor waits for a selected time interval before the processor surrenders ownership of the memory block upon receipt of an ownership request message, if any is received by the processor after execution of the load locked instruction. The processor executes the store conditional instruction, and the processor in response to executing the store conditional instruction tests the lock flag, and if the lock flag is set, writing to the cache copy of the memory block. The processor ends, in the event that the lock flag is reset, the store conditional instruction and does not write to the cache copy of the memory block.
    • 一种用于在处理器中执行加载锁定和存储条件指令的方法,对存储器块实现原子读写操作。 首先执行加载锁定指令以读取存储器块,并且响应于执行加载锁定指令的处理器发出读取修改系统命令来读取块并由处理器获取块的所有权,并且还设置锁定 标记存储器块的地址,并将存储器块的值写入处理器的高速缓存作为存储器块的高速缓存副本。 如果处理器接收到存储块的任何无效消息,则锁定标志在由处理器接收到存储器块的高速缓存副本的无效消息时被重置。 处理器在接收到所有权请求消息之后处理器递交所述存储器块的所有权,等待处理器选定的时间间隔(如果在执行加载锁定指令之后由处理器接收到)。 处理器执行存储条件指令,并且处理器响应于执行存储条件指令测试锁定标志,并且如果设置了锁定标志,则写入存储器块的高速缓存副本。 处理器在锁定标志被复位的情况下结束,存储条件指令,并且不写入存储器块的高速缓存副本。
    • 10. 发明授权
    • High-performance non-blocking switch with multiple channel ordering constraints
    • 具有多通道排序限制的高性能非阻塞开关
    • US06249520B1
    • 2001-06-19
    • US08957664
    • 1997-10-24
    • Simon C. Steely, Jr.Stephen R. VanDorenMadhumitra SharmaCraig D. KeeferDavid W. Davis
    • Simon C. Steely, Jr.Stephen R. VanDorenMadhumitra SharmaCraig D. KeeferDavid W. Davis
    • H04L1250
    • G06F13/4022G06F12/0826G06F15/17393
    • An architecture and coherency protocol for use in a large SMP computer system includes a hierarchical switch structure which allows for a number of multi-processor nodes to be coupled to the switch to operate at an optimum performance. Within each multi-processor node, a simultaneous buffering system is provided that allows all of the processors of the multi-processor node to operate at peak performance. A memory is shared among the nodes, with a portion of the memory resident at each of the multi-processor nodes. Each of the multi-processor nodes includes a number of elements for maintaining memory coherency, including a victim cache, a directory and a transaction tracking table. The victim cache allows for selective updates of victim data destined for memory stored at a remote multi-processing node, thereby improving the overall performance of memory. Memory performance is additionally improved by including, at each memory, a delayed write buffer which is used in conjunction with the directory to identify victims that are to be written to memory. An arb bus coupled to the output of the directory of each node provides a central ordering point for all messages that are transferred through the SMP. The messages comprise a number of transactions, and each transaction is assigned to a number of different virtual channels, depending upon the processing stage of the message. The use of virtual channels thus helps to maintain data coherency by providing a straightforward method for maintaining system order. Using the virtual channels and the directory structure, cache coherency problems that would previously result in deadlock may be avoided.
    • 用于大SMP计算机系统的架构和一致性协议包括分层交换结构,其允许多个多处理器节点耦合到交换机以以最佳性能进行操作。 在每个多处理器节点内,提供同时缓冲系统,其允许多处理器节点的所有处理器以最高性能运行。 存储器在节点之间共享,存储器的一部分驻留在每个多处理器节点处。 每个多处理器节点包括用于维持存储器一致性的多个元件,包括受害缓存,目录和事务跟踪表。 受害者缓存允许选择性地更新目的地存储在远程多处理节点处的存储器的受害者数据,从而提高存储器的整体性能。 通过在每个存储器处包括延迟的写入缓冲器来进一步改善存储器性能,该缓冲器与目录一起使用以识别要写入存储器的受害者。 耦合到每个节点的目录的输出的arb总线为通过SMP传输的所有消息提供了中心排序点。 消息包括多个事务,并且根据消息的处理阶段,将每个事务分配给多个不同的虚拟通道。 因此,通过提供用于维护系统顺序的简单方法,使用虚拟通道有助于维持数据一致性。 使用虚拟通道和目录结构,可以避免先前导致死锁的高速缓存一致性问题。