专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US5805487A Method and system for fast determination of sticky and guard bits 失效
标题翻译：用于快速测定粘性和保护位的方法和系统
公开(公告)号：US5805487A
公开(公告)日：1998-09-08
申请号：US677843
申请日：1996-07-12
申请人： Timothy Alan Elliott , Christopher Hans Olson , Michael Putrino
发明人： Timothy Alan Elliott , Christopher Hans Olson , Michael Putrino
IPC分类号： G06F7/38 , G06F7/00 , G06F7/483 , G06F7/57 , G06F7/76 , G06F7/48
CPC分类号： G06F7/483 , G06F7/49952 , G06F7/49957
摘要： A method and system for fast calculation of the sticky bit and a function of the guard bit is disclosed. A first aspect of the method and system provides a fast calculation of the sticky bit. A second aspect provides a fast calculation of a function of the guard bit. Both aspects comprise means for providing an intermediate result of a floating point mathematical operation involving at least a first and a second operand and means for providing a mask indicating a position of a leading one in a mantissa of the intermediate result. In the first aspect, means for aligning a first bit of the mask to an (n+2)nd bit of the intermediate result, where n is the number of bits in a mantissa of the first or second operand, are coupled to the intermediate result providing means. In the second aspect, means for aligning a first bit of the mask to an (n+1)st bit of the intermediate result are coupled to the intermediate result providing means. In both aspects, means for providing an output are coupled to the aligning means and intermediate result providing means. The output of the first aspect comprises the sticky bit. The output of the second aspect comprises a function of the guard bit. Thus, the method and system allow the sticky bit and a function of the guard bit to be calculated substantially simultaneously with normalization. Because the method and system allow fast determination of the sticky bit and a function of the guard bit, the overall speed of the calculation is increased and system performance is improved.
摘要翻译：公开了一种用于快速计算粘滞位和保护位功能的方法和系统。该方法和系统的第一方面提供了粘性位的快速计算。第二方面提供了对保护位的功能的快速计算。两个方面包括用于提供涉及至少第一和第二操作数的浮点数学运算的中间结果的装置，以及用于提供指示中间结果的尾数中的前导位置的掩码的装置。在第一方面，用于将掩模的第一位与中间结果的第（n + 2）位对齐的装置，其中n是第一或第二操作数的尾数中的位数，结果提供手段。在第二方面，用于将掩模的第一位与中间结果的第（n + 1）位进行对准的装置耦合到中间结果提供装置。在两个方面，用于提供输出的装置耦合到对准装置和中间结果提供装置。第一方面的输出包括粘点。第二方面的输出包括保护位的功能。因此，该方法和系统允许基本上与归一化同时计算粘滞位和保护位的功能。由于方法和系统允许快速确定粘滞位和保护位的功能，所以计算的总速度提高，系统性能得到提高。

2. 发明授权

US06324638B1 Processor having vector processing capability and method for executing a vector instruction in a processor 有权
标题翻译：具有向量处理能力的处理器和用于在处理器中执行向量指令的方法
公开(公告)号：US06324638B1
公开(公告)日：2001-11-27
申请号：US09282268
申请日：1999-03-31
申请人： Thomas Elmer , Michael Putrino
发明人： Thomas Elmer , Michael Putrino
IPC分类号： G06F1517
CPC分类号： G06F7/5324 , G06F7/5332 , G06F9/30014 , G06F9/30036 , G06F2207/382 , G06F2207/3828
摘要： A processor capable of executing vector instructions includes at least an instruction sequencing unit and a vector processing unit that receives vector instructions to be executed from the instruction sequencing unit. The vector processing unit includes a plurality of multiply structures, each containing only a single multiply array, that each correspond to at least one element of a vector input operand. Utilizing the single multiply array, each of the plurality of multiply structures is capable of performing a multiplication operation on one element of a vector input operand and is also capable of performing a multiplication operation on multiple elements of a vector input operand concurrently. In an embodiment in which the maximum length of an element of a vector input operand is N bits, each of the plurality of multiply arrays can handle both N by N bit integer multiplication and M by M bit integer multiplication, where N is a non-unitary integer multiple of M. At least one of the multiply structures also preferably includes an accumulating adder that receives as a first input a result produced by that multiply structure and receives as a second input a result produced by another multiply structure. From these inputs, the accumulating adder produces as an output an accumulated sum of the results in response to execution of the same instruction that caused the multiply structures to produce the intermediate results.
摘要翻译：能够执行向量指令的处理器至少包括指令排序单元和向量处理单元，其从指令排序单元接收要执行的向量指令。矢量处理单元包括多个乘法结构，每个乘法结构仅包含单个乘法阵列，每个乘法阵列对应于向量输入操作数的至少一个元素。利用单个乘法阵列，多个乘法结构中的每一个能够对向量输入操作数的一个元素执行乘法运算，并且还能够同时对矢量输入操作数的多个元素执行乘法运算。在矢量输入操作数的元素的最大长度为N位的实施例中，多个乘法阵列中的每一个可以处理N乘N位整数乘法和M乘M位整数乘法，其中N是非乘法，多重结构中的至少一个还优选地包括累积加法器，其接收由该乘法结构产生的结果作为第一输入，并且作为第二输入接收由另一乘法结构产生的结果。从这些输入中，积累加法器响应于导致乘法结构产生中间结果的相同指令的执行而产生结果的累加和。

3. 发明授权

US5805916A Method and apparatus for dynamic allocation of registers for intermediate floating-point results 失效
标题翻译：用于中间浮点数结果的寄存器的动态分配方法和装置
公开(公告)号：US5805916A
公开(公告)日：1998-09-08
申请号：US758017
申请日：1996-11-27
申请人： Soummya Mallick , Michael Putrino , Romesh Mangho Jessani
发明人： Soummya Mallick , Michael Putrino , Romesh Mangho Jessani
IPC分类号： G06F9/302 , G06F9/38
CPC分类号： G06F9/30014 , G06F9/30105 , G06F9/30112 , G06F9/3836 , G06F9/384 , G06F9/3855 , G06F9/3857 , G06F9/3875
摘要： The present invention relates to a multiple stage execution unit for executing instructions in a microprocessor having a plurality of rename registers for storing execution results, an instruction cache for storing instructions, each instruction being associated with a rename register, a sequencer unit for providing an instruction to the execution unit, and a data cache for providing data to the execution unit. In one version, the execution unit includes a first stage which generates an intermediate result from the data according to an instruction; a means for providing a first portion of the intermediate result to an intermediate register; a means for providing a second portion of the intermediate result to a rename register associated with the instruction; a means for passing the first portion from the intermediate register to a second stage of the execution unit; a means for passing the second portion from the rename register to the second stage of the execution unit; wherein the second stage of the execution unit operates on the first and second portions according to the instruction.
摘要翻译：本发明涉及一种多级执行单元，用于在微处理器中执行指令，该微处理器具有用于存储执行结果的多个重命名寄存器，用于存储指令的指令高速缓存，每个指令与重命名寄存器相关联，定序器单元用于提供指令以及用于向执行单元提供数据的数据高速缓存。在一个版本中，执行单元包括根据指令从数据生成中间结果的第一阶段; 用于将中间结果的第一部分提供给中间寄存器的装置; 用于将中间结果的第二部分提供给与指令相关联的重命名寄存器的装置; 用于将第一部分从中间寄存器传递到执行单元的第二级的装置; 用于将第二部分从重命名寄存器传递到执行单元的第二级的装置; 其中执行单元的第二级根据该指令在第一和第二部分上操作。

4. 发明授权

US5872948A Processor and method for out-of-order execution of instructions based upon an instruction parameter 失效
标题翻译：基于指令参数的指令无序执行的处理器和方法
公开(公告)号：US5872948A
公开(公告)日：1999-02-16
申请号：US616613
申请日：1996-03-15
申请人： Soummya Mallick , Rajesh Bikhubhai Patel , Romesh Mangho Jessani , Michael Putrino
发明人： Soummya Mallick , Rajesh Bikhubhai Patel , Romesh Mangho Jessani , Michael Putrino
IPC分类号： G06F9/38 , G06F9/28
CPC分类号： G06F9/3836 , G06F9/384 , G06F9/3857
摘要： A processor and method for out-of-order execution of instructions are disclosed which fetch a first and a second instruction, wherein the first instruction precedes the second instruction in a program order. A determination is made whether execution of the second instruction is subject to execution of the first instruction. In response to a determination that execution of the second instruction is subject to execution of the first instruction, the second instruction is selectively executed prior to the first instruction in response to a parameter of at least one of the first and second instructions. In one embodiment, the parameter is an execution latency parameter of the first and second instructions.
摘要翻译：公开了用于执行指令的处理器和方法，其提取第一和第二指令，其中第一指令以程序顺序在第二指令之前。确定第二指令的执行是否受到第一指令的执行。响应于第二指令的执行被执行第一指令的确定，响应于第一和第二指令中的至少一个指令的参数在第一指令之前选择性地执行第二指令。在一个实施例中，该参数是第一和第二指令的执行等待时间参数。

5. 发明授权

US5765191A Method for implementing a four-way least recently used (LRU) mechanism in high-performance 失效
标题翻译：在高性能数据处理系统中实现四路最近最少使用（LRU）机制的方法
公开(公告)号：US5765191A
公开(公告)日：1998-06-09
申请号：US641060
申请日：1996-04-29
申请人： Albert John Loper , Soummya Mallick , Rajesh Bhikhubhai Patel , Michael Putrino
发明人： Albert John Loper , Soummya Mallick , Rajesh Bhikhubhai Patel , Michael Putrino
IPC分类号： G06F12/08 , G06F12/12
CPC分类号： G06F12/123
摘要： A method for implementing a four-way least recently used cache line replacement scheme in a four-way cache memory is disclosed. The cache memory includes multiple cache lines, and each cache line includes four congruence sets. In accordance with the present disclosure, a 5-bit Least Recently Used (LRU) field is associated with each of the cache lines within the cache memory. For a particular cache line, a set number of a least recently used set among the four congruence sets is stored in any two bits of the LRU field associated with that cache line. Next, a set number of the second least recently used set among the four congruence sets is stored in another two bits of the same LRU field associated with the same cache line. Finally, a last bit of the 5-bit LRU field is set to a specific state in response to a determination of which one of the remaining two sets is the second most recently used set.
摘要翻译：公开了一种用于在四路高速缓冲存储器中实现四路最少使用的高速缓存行替换方案的方法。高速缓冲存储器包括多个高速缓存行，并且每个高速缓存行包括四个一致集合。根据本公开，5位最近使用（LRU）字段与高速缓冲存储器内的每个高速缓存行相关联。对于特定的高速缓存行，四个同余集中的最近最少使用的集合的集合数存储在与该高速缓存行相关联的LRU字段的任何两个位中。接下来，将四个同余集合中的第二最近使用的集合的集合数存储在与相同高速缓存行相关联的相同LRU字段的另外两个比特中。最后，响应于确定剩余两组中的哪一组是最近使用的第二组，将5位LRU字段的最后一位设置为特定状态。

6. 发明授权

US4914617A High performance parallel binary byte adder 失效
标题翻译：高性能并行二进制字节加法器
公开(公告)号：US4914617A
公开(公告)日：1990-04-03
申请号：US66580
申请日：1987-06-26
申请人： Michael Putrino , Stamatis Vassiliadis , Eric M. Schwartz
发明人： Michael Putrino , Stamatis Vassiliadis , Eric M. Schwartz
IPC分类号： G06F7/505 , G06F7/494 , G06F7/50 , G06F7/508
CPC分类号： G06F7/505 , G06F2207/382
摘要： A parallel binary byte adder performs addition and subtraction on the individual bytes of an A-operand and a B-operand as well as on the entire A and B operand. An A-operand is input to a special adder circuit. A B-operand is modified in a set up logic circuit, in accordance with the specific operation to be performed, before being input to the special adder circuit. A set/mask logic generates set, mask and carry signals which are further input to the special adder circuit. The special adder circuit includes an auxiliary functions circuit and a pseudo carry circuit for generating a set of variables which are processed by a sum circuit to produce three partial results. The first partial result relates to bits 0-5 of the particular byte being processed, the second relates to bit 6, and the third relates to bit 7. A concatenation of the three partial results produces a final sum or difference of the particular byte or bytes involved.

7. 发明授权

US6098168A System for completing instruction out-of-order which performs target address comparisons prior to dispatch 失效
标题翻译：用于完成在发送前执行目标地址比较的无序指令的系统
公开(公告)号：US6098168A
公开(公告)日：2000-08-01
申请号：US46867
申请日：1998-03-24
申请人： Lee Evan Eisen , Michael Putrino
发明人： Lee Evan Eisen , Michael Putrino
IPC分类号： G06F9/38
CPC分类号： G06F9/3842 , G06F9/3836 , G06F9/384 , G06F9/3855 , G06F9/3857
摘要： A mechanism structured to check for instruction collisions at the Dispatch Unit rather than the Completion Unit. In processors which issue multiple commands simultaneously, a flag bit is sent to the Completion Unit and attached to the instruction in the queue that follows the other in program order if they both have the same targeted address. When the instructions from position 1 and position 2 of the instruction queue are ready to issue, the Completion Unit checks position 2 for a flag bit. If there is a bit, then the instruction in position 1 is discarded and the instruction in position 2 is written to the target address. If there is no flag bit with the instruction in position 2, the instruction in position 1 is written to the target register. This method eliminates the need to compare all the targeted addresses that are associated with the rename registers. It requires two comparisons instead of a minimum of 15 comparisons.
摘要翻译：一种结构化的检查在调度单位而不是完成单位的指令冲突的机制。在同时发出多个命令的处理器中，如果标志位都具有相同的目标地址，则将标志位发送到完成单元并附加到队列中的跟随另一命令的指令。当指令队列的位置1和位置2的指令准备发出时，完成单元检查位置2是否有一个标志位。如果有位，则丢弃位置1的指令，将位置2中的指令写入目标地址。如果位置2中的指令没有标志位，则将位置1的指令写入目标寄存器。该方法不需要比较与重命名寄存器相关的所有目标地址。它需要两次比较，而不是至少15次比较。

8. 发明授权

US5732005A Single-precision, floating-point register array for floating-point units performing double-precision operations by emulation 失效
标题翻译：用于通过仿真执行双精度操作的浮点单元的单精度浮点寄存器阵列
公开(公告)号：US5732005A
公开(公告)日：1998-03-24
申请号：US386980
申请日：1995-02-10
申请人： James Allan Kahle , Tai Dinh Ngo , Aubrey Deene Ogden , Michael Putrino , Johm Victor Sell
发明人： James Allan Kahle , Tai Dinh Ngo , Aubrey Deene Ogden , Michael Putrino , Johm Victor Sell
IPC分类号： G06F7/57 , G06F7/00 , G06F7/38
CPC分类号： G06F7/483 , G06F2207/382
摘要： A single-precision floating-point register array for a floating-point execution unit that performs double-precision operations by emulation is provided. The register array comprises a plurality of single-precision floating-point registers and a storage device that stores one or more status bits in association with each of the plurality of registers; the status bits associated with each register indicate either that the associated data register contains single-precision or integer data, or that the data for the associated register is contained in an emulated register in memory that is mapped to the associated register. When a register is a source for an operation, the status bits associated with the register are checked and the required operand data for that register is read from the register or from an emulated register mapped to that register, as a function of the state of the status bits.
摘要翻译：提供了一种用于通过仿真执行双精度操作的浮点执行单元的单精度浮点寄存器阵列。寄存器阵列包括多个单精度浮点寄存器和与多个寄存器中的每一个相关联地存储一个或多个状态位的存储器件; 与每个寄存器相关联的状态位指示相关联的数据寄存器包含单精度或整数数据，或者相关联寄存器的数据包含在映射到相关寄存器的存储器中的仿真寄存器中。当寄存器是操作的源时，检查与寄存器相关联的状态位，并且从寄存器或映射到该寄存器的仿真寄存器读取该寄存器所需的操作数数据，作为该寄存器状态的函数状态位。

9. 发明授权

US4924422A Method and apparatus for modified carry-save determination of arithmetic/logic zero results 失效
标题翻译：用于修改进位保存确定算术/逻辑零结果的方法和装置
公开(公告)号：US4924422A
公开(公告)日：1990-05-08
申请号：US157500
申请日：1988-02-17
申请人： Stamatis Vassiliadis , Michael Putrino , Ann E. Huffman , Brice J. Feal , Gerald G. Pechanek
发明人： Stamatis Vassiliadis , Michael Putrino , Ann E. Huffman , Brice J. Feal , Gerald G. Pechanek
IPC分类号： G06F7/02 , G06F7/50 , G06F7/507 , G06F7/508 , G06F7/57 , G06F9/32
CPC分类号： G06F7/57 , G06F7/026 , G06F7/505 , G06F9/30094 , G06F7/49905
摘要： The invention determines when two operands are equivalent directly from the operand without the use of an adder. In one embodiment, conditions for the sum being equal to zero are determined from half sum to carry and transmit operators derived from the input operands. These operands are used in some known types of adders and, thus may be provided from a parallel adder to the condition prediction circuitry. In another embodiment, the equations for a carry-save-adder are modified to provide a circuit specifically designed for the determination of the condition when the sum of the operands is equal to zero. This sum is equal to zero circuit greatly reduces the gate delay and gate count thus allowing the central processing unit to determine the condition prior to the actual sum of two operands. This allows the CPU to react to the condition more quickly, thus increasing overall system speed.

10. 发明授权

US4914579A Apparatus for branch prediction for computer instructions 失效
标题翻译：用于计算机指令的分支预测装置
公开(公告)号：US4914579A
公开(公告)日：1990-04-03
申请号：US157474
申请日：1988-02-17
申请人： Michael Putrino , Stamatis Vassiliadis , Ann E. Huffman , Agnes Y. Ngai
发明人： Michael Putrino , Stamatis Vassiliadis , Ann E. Huffman , Agnes Y. Ngai
IPC分类号： G06F7/00 , G06F9/32 , G06F9/38
CPC分类号： G06F9/3844
摘要： An apparatus for branch prediction for computer instructions predicts the outcome of an executing branch instruction in response to instruction operands Q, R, and B. The apparatus includes combinatorial logic for predicting a first branch condition, ((Q+R)-B)>0, or a second branch condition ((Q+R)-B).ltoreq.0.

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式