会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 42. 发明申请
    • SYSTEM AND METHOD OF BYPASSING UNROUNDED RESULTS IN A MULTIPLY-ADD PIPELINE UNIT
    • 在多用途管道单元中排除未结果的系统和方法
    • US20120233234A1
    • 2012-09-13
    • US13043101
    • 2011-03-08
    • Jeffrey S. BrooksChristopher H. Olson
    • Jeffrey S. BrooksChristopher H. Olson
    • G06F7/00
    • G06F7/49947G06F7/483G06F7/5318G06F7/5338G06F7/5443G06F2207/3884
    • A processing unit, system, and method for performing a multiply operation in a multiply-add pipeline. To reduce the pipeline latency, the unrounded result of a multiply-add operation is bypassed to the inputs of the multiply-add pipeline for use in a subsequent operation. If it is determined that rounding is required for the prior operation, then the rounding will occur during the subsequent operation. During the subsequent operation, a Booth encoder not utilized by the multiply operation will output a rounding correction factor as a selection input to a Booth multiplexer not utilized by the multiply operation. When the Booth multiplexer receives the rounding correction factor, the Booth multiplexer will output a rounding correction value to a carry save adder (CSA) tree, and the CSA tree will generate the correct sum from the rounding correction value and the other partial products.
    • 一种用于在多重加法管线中执行乘法运算的处理单元,系统和方法。 为了减少流水线延迟,乘法运算的未包围结果被旁路到乘法加法管道的输入端,用于后续操作。 如果确定先前操作需要舍入,则在随后的操作期间将进行舍入。 在随后的操作期间,未被乘法运算使用的布斯编码器将输出舍入校正因子作为选择输入到未被乘法运算使用的布斯多路复用器。 当布斯多路复用器接收舍入校正因子时,布尔多路复用器将输出舍入校正值到进位保存加法器(CSA)树,并且CSA树将从舍入校正值和其他部分乘积生成正确的和。
    • 43. 发明授权
    • Handling multi-cycle integer operations for a multi-threaded processor
    • 处理多线程处理器的多循环整数运算
    • US08195919B1
    • 2012-06-05
    • US11927177
    • 2007-10-29
    • Christopher H. OlsonRobert T. GollaManish ShahJeffrey S. Brooks
    • Christopher H. OlsonRobert T. GollaManish ShahJeffrey S. Brooks
    • G06F13/00
    • G06F12/0842G06F9/3001G06F9/30043G06F9/3824G06F12/0855
    • Determining an effective address of a memory with a three-operand add operation in single execution cycle of a multithreaded processor that can access both segmented memory and non-segmented memory. During that cycle, the processor determines whether a memory segment base is zero. If the segment base is zero, the processor can access a memory location at the effective address without adding the segment base. If the segment base is not zero, such as when executing legacy code, the processor consumes another cycle to add the segment base to the effective address. Similarly, the processor consumes another cycle if the effective address or the linear address is misaligned. An integer execution unit that performs the three-operand add using a carry-save adder coupled to a carry look-ahead adder. If the segment base is not zero, the effective address is fed back through the integer execution unit to add the segment base.
    • 在可以访问分段存储器和非分段存储器的多线程处理器的单个执行周期中确定具有三操作数添加操作的存储器的有效地址。 在该周期期间,处理器确定存储器段基数是否为零。 如果分段基数为零,则处理器可以在有效地址的情况下访问存储器位置,而不添加分段基。 如果段基数不为零,例如执行遗留代码时,处理器消耗另一个周期,将段基数添加到有效地址。 类似地,如果有效地址或线性地址不对齐,则处理器消耗另一个周期。 整数执行单元,其使用耦合到进位先行加法器的进位保存加法器来执行三运算加法。 如果段基数不为零,则通过整数执行单元反馈有效地址以添加段基。
    • 47. 发明授权
    • Apparatus and method for reducing execution latency of floating point operations having special case operands
    • 具有特殊情况操作数的浮点运算减少执行延迟的装置和方法
    • US07437538B1
    • 2008-10-14
    • US10881763
    • 2004-06-30
    • Jeffrey S. BrooksChristopher H. Olson
    • Jeffrey S. BrooksChristopher H. Olson
    • G06F9/40
    • G06F9/3885G06F9/30014G06F9/30123G06F9/3013G06F9/3828G06F9/3836G06F9/3851G06F9/3857G06F9/3877G06F9/3891
    • An apparatus and method for floating-point special case handling. In one embodiment, a processor may include a first execution unit configured to execute a longer-latency floating-point instruction, and a second execution unit configured to execute a shorter-latency floating-point instruction. In response to the longer-latency floating-point instruction being issued to the first execution unit, the second execution unit may be further configured to detect whether a result of the longer-latency floating-point instruction is determinable from one or more operands of the longer-latency floating-point instruction independently of the first execution unit executing the longer-latency floating-point instruction. In response to detecting that the result is determinable, the second execution unit may be further configured to flush the longer-latency floating-point instruction from the first execution unit and to determine the result.
    • 一种用于浮点特殊情况处理的装置和方法。 在一个实施例中,处理器可以包括被配置为执行较长延迟浮点指令的第一执行单元,以及被配置为执行较短延迟浮点指令的第二执行单元。 响应于向第一执行单元发出的较长延迟的浮点指令,第二执行单元还可以被配置为检测长延迟浮点指令的结果是否可以从所述第一执行单元的一个或多个操作数确定 更长延迟的浮点指令独立于执行较长延迟浮点指令的第一执行单元。 响应于检测到结果是可确定的,第二执行单元可以被进一步配置为从第一执行单元刷新长延迟浮点指令并确定结果。