会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Apparatus and method for maintaining status flags and condition codes
using a renaming technique in an out of order floating point execution
unit
    • 用于在有序的浮点执行单元中使用重命名技术来维护状态标志和条件代码的装置和方法
    • US5826070A
    • 1998-10-20
    • US708006
    • 1996-08-30
    • Christopher Hans OlsonJeffrey Scott BrooksMartin Stanley Schmookler
    • Christopher Hans OlsonJeffrey Scott BrooksMartin Stanley Schmookler
    • G06F9/32G06F9/38G06F9/302
    • G06F9/3855G06F9/30094G06F9/3836G06F9/384G06F9/3842G06F9/3853G06F9/3857
    • An apparatus and method reduces the number of rename registers for a floating point status and control register (FPSCR) in a superscalar microprocessor executing out of order/speculative instructions. A floating point queue (FPQ) receives speculative instructions and issues out-of-order instructions to FPQ execution units, each instruction containing a group identifier tag (GID) and a target identifier tag (TID). The GID tag indicates a set of instructions bounded by interruptible or branch instructions. The TID indicates a targeted architected facility and the program order of the instruction. The FPSCR contains status and control bits for each instruction and is updated when an instruction is executed and committed. A FPSCR renaming mechanism assigns an FPSCR rename to selected FPSCR bits during instruction dispatch from an instruction fetch unit (IFU) to the FPQ when an arithmetic instruction is dispatched that has a GID which has not been committed by instruction dispatch unit (IDU) and does not already have an FPSCR rename assigned, as determined by the FPQ. The FPSCR rename mechanism utilizes the TID upon the presence of selected bits in the FPSCR. The bits in the FPSCR rename are updated as a new arithmetic instruction enters a write-back stage in the FPU. The resulting FPSCR updates of all instructions in a given GID are merged into one FPSCR rename register. A FPSCR rename register exists for each GID rather than a FPSCR rename register for each FPR rename register as in the prior art.
    • 一种装置和方法减少了执行无序/推测性指令的超标量微处理器中浮点状态和控制寄存器(FPSCR)的重命名寄存器的数量。 浮点队列(FPQ)接收推测指令并向FPQ执行单元发出无序指令,每个指令包含组标识符标签(GID)和目标标识符标签(TID)。 GID标签指示一组由可中断或分支指令限定的指令。 TID表示目标架构设施和指令的程序顺序。 FPSCR包含每条指令的状态和控制位,并在指令执行并提交时更新。 当调度具有尚未由指令分派单元(IDU)提交的GID的算术指令时,FPSCR重命名机制在从指令获取单元(IFU)到FPQ的指令分派期间将FPSCR重命名分配给所选择的FPSCR位,并且 尚未由FPQ确定的FPSCR重命名分配。 FPSCR重命名机制在FPSCR中存在选定位时利用TID。 FPSCR重命名中的位随着新的算术指令进入FPU中的回写阶段而被更新。 给定GID中的所有指令的结果FPSCR更新被合并到一个FPSCR重命名寄存器中。 对于每个GID而言,对于每个FPR重命名寄存器,存在针对每个GID的FPSCR重命名寄存器,如现有技术中那样。
    • 2. 发明授权
    • Fast alignment unit for multiply-add floating point unit
    • 用于多重加法浮点单元的快速对准单元
    • US5790444A
    • 1998-08-04
    • US727331
    • 1996-10-08
    • Christopher Hans OlsonMartin Stanley Schmookler
    • Christopher Hans OlsonMartin Stanley Schmookler
    • G06F5/01G06F7/544G06F7/00G06F7/38
    • G06F7/5443G06F5/012G06F7/483
    • A floating point arithmetic unit performs a multiply-add function B+(A*C) in which an alignment shifter is responsive to an input signal representative of the B mantissa. The shifter includes a sequential stack of multiplexers, typically three (3), for shifting the B mantissa to align it with the A*C product, and a complementer contained between two of the multiplexers to invert the signals when B is a negative number. A shift amount generator responsive to the A, B and C exponents produces control signals for the multiplexers. The shift amount generator includes a multiple input adder utilizing carry save adder and carry lookahead adder techniques to minimize delay, and separate decoders for each multiplexer or group of multiplexers. The generator also includes a Leading Zeros Anticipator (LZA) circuit for the most significant bits to limit shift amount signals that are within the shifting range of the shifter, which reduces the delay attributed to the carry lookahead adder. The multiplexers are arranged in a sequence such that the control signals for the first multiplexers are dependent only on the least significant bits and thus can be generated earliest, and therefore the delay of these multiplexers and the delay of the complementer is in parallel with the delay for producing the control signals to the last multiplexers.
    • 浮点算术单元执行其中对准移位器响应于代表B尾数的输入信号的加法函数B +(A * C)。 移位器包括一组多路复用器,通常为三(3),用于移位B尾数以将其与A * C乘积对齐,以及包含在两个多路复用器之间的补码器,以在B为负数时反转信号。 响应于A,B和C指数的移位量发生器产生用于多路复用器的控制信号。 移位量产生器包括利用进位存储加法器和进位前置加法器技术来最小化延迟的多输入加法器,以及用于每个多路复用器或多路复用器组的单独解码器。 该发生器还包括一个用于最高有效位的前导零点预期器(LZA)电路,用于限制在移位器的移位范围内的移位量信号,这减少了归因于进位前瞻加法器的延迟。 多路复用器按照这样的顺序排列,使得用于第一多路复用器的控制信号仅依赖于最低有效位,并且因此可以最早生成,因此这些多路复用器的延迟和补码器的延迟与延迟并行 用于产生到最后一个多路复用器的控制信号。
    • 5. 发明授权
    • Method and system for forwarding instructions in a processor with
increased forwarding probability
    • 用于在具有增加的转发概率的处理器中转发指令的方法和系统
    • US5878242A
    • 1999-03-02
    • US845093
    • 1997-04-21
    • Christopher Hans OlsonJeffrey Scott Brooks
    • Christopher Hans OlsonJeffrey Scott Brooks
    • G06F9/38G06F9/30
    • G06F9/3826
    • A system and method for forwarding a first instruction into a second instruction in a processor is disclosed. The processor comprises an execution unit and providing a plurality of instructions. The first instruction depends upon execution of the second instruction but does not otherwise require execution by the execution unit. The method first searches for the second instruction. The method then forwards the first instruction via the second instruction by appending a tag to the second instruction, the tag identifying the first instruction.One aspect of the method and system forwards a store instruction into a floating point instruction in a processor. The store instruction has a source address and the floating point instruction has a target address. The processor provides a plurality of instructions. The method searches for the floating point instruction that is provided before the store instruction. The method then determines if the source address is equal to the target address. The method forwards the store instruction through the floating point instruction if the source address is equal to the target address by appending a tag to the floating point instruction, the tag identifying the store instruction.
    • 公开了一种用于将第一指令转发到处理器中的第二指令的系统和方法。 处理器包括执行单元并提供多个指令。 第一条指令取决于第二条指令的执行,但是否则不需要执行单元的执行。 该方法首先搜索第二条指令。 然后,该方法通过将标签附加到第二指令,通过第二指令转发第一指令,标签识别第一指令。 该方法和系统的一个方面将存储指令转发到处理器中的浮点指令。 存储指令具有源地址,浮点指令具有目标地址。 处理器提供多个指令。 该方法搜索在存储指令之前提供的浮点指令。 该方法然后确定源地址是否等于目标地址。 如果源地址等于目标地址,则该方法通过将浮点指令附加到浮点指令(标识存储指令)的标记来转发存储指令。
    • 7. 发明授权
    • Method and system for performing a high speed floating point add
operation
    • 执行高速浮点加法运算的方法和系统
    • US5790445A
    • 1998-08-04
    • US641307
    • 1996-04-30
    • Lee Evan EisenTimothy Alan ElliottRobert Thaddeus GollaChristopher Hans Olson
    • Lee Evan EisenTimothy Alan ElliottRobert Thaddeus GollaChristopher Hans Olson
    • G06F5/01G06F7/485G06F7/50G06F7/38
    • G06F7/485G06F5/012
    • A system and method for calculating a floating point add/subtract of a plurality of floating point operands is disclosed. The system comprises at least one pair of data paths. Each pair of data paths comprises a first data path and a second data path. The first data path includes a first aligner, a first adder coupled to the first aligner, and a first normalizer coupled to the first adder. The first normalizer is capable of shifting a mantissa by a substantially smaller number of digits than the first aligner. The second data path comprises control logic, a second aligner coupled to the control logic, a second adder coupled to the second aligner, and a second normalizer coupled to the second adder. The control logic provides a control signal that is responsive to a first predetermined number of digits of each exponent of a pair of exponents. The pair of exponents are the exponents for a pair of inputs to the second data path. The second aligner is responsive to the control signal provided by the control logic. In addition, the second normalizer is capable of shifting a mantissa by a substantially larger number of digits than the second aligner.
    • 公开了一种用于计算多个浮点操作数的浮点加法/减法的系统和方法。 该系统包括至少一对数据路径。 每对数据路径包括第一数据路径和第二数据路径。 第一数据路径包括第一对准器,耦合到第一对准器的第一加法器和耦合到第一加法器的第一归一化器。 第一标准器能够将尾数移位比第一对准器小得多的位数。 第二数据路径包括控制逻辑,耦合到控制逻辑的第二对准器,耦合到第二对准器的第二加法器以及耦合到第二加法器的第二归一化器。 控制逻辑提供响应于一对指数的每个指数的第一预定数量位数的控制信号。 一对指数是对于第二数据路径的一对输入的指数。 第二对准器响应于由控制逻辑提供的控制信号。 此外,第二归一化器能够将尾数移动比第二对准器大得多的位数。
    • 10. 发明授权
    • Floating point split multiply/add system which has infinite precision
    • 具有无限精度的浮点分割乘法/加法系统
    • US5880983A
    • 1999-03-09
    • US620733
    • 1996-03-25
    • Timothy Alan ElliottRobert Thaddeus GollaChristopher Hans OlsonTerence Matthew Potter
    • Timothy Alan ElliottRobert Thaddeus GollaChristopher Hans OlsonTerence Matthew Potter
    • G06F7/544G06F7/38
    • G06F7/5443G06F7/483G06F7/49942
    • A method and system for an infinite precision split multiply and add operation which has increased speed. The method and system for providing a split multiply and add of a plurality of operands include a multiplier and an adder means. The multiplier multiplies a first portion of the plurality of operands, thereby providing a product. The adder, which combines the remaining operands and the product, comprise at least one pair of data paths. Each pair of data paths comprises a first data path and a second data path. The first data path comprises a first aligner, a first adder, and a first normalizer capable of shifting a mantissa by a substantially fewer number digits than the aligner. The second data path comprises a second aligner, a second adder, and a second normalizer capable of shifting a mantissa by a substantially larger number of digits than the aligner. Accordingly, the present invention includes split multiply and add data paths which, individually, are faster than a fused multiply and add. In addition, the split multiply and add data paths can preserve the appearance of infinite precision. Consequently, overall system performance is increased.
    • 一种用于无限精密分割乘法和加法运算的方法和系统,其具有增加的速度。 用于提供多个操作数的分割乘法和相加的方法和系统包括乘法器和加法器装置。 乘法器乘以多个操作数的第一部分,从而提供乘积。 组合剩余操作数和乘积的加法器包括至少一对数据路径。 每对数据路径包括第一数据路径和第二数据路径。 第一数据路径包括第一对准器,第一加法器和第一归一化器,其能够将尾数与对准器相比更少的数字位移。 第二数据路径包括第二对准器,第二加法器和第二归一化器,其能够将尾数移位比对准器大得多的位数。 因此,本发明包括分离的乘法和加法数据路径,其分别比融合乘法和加法更快。 此外,拆分乘法和添加数据路径可以保持无限精度的外观。 因此,整体系统性能提高。