会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明申请
    • System and method for performing floating point store folding
    • 执行浮点存储折叠的系统和方法
    • US20060179100A1
    • 2006-08-10
    • US11054686
    • 2005-02-09
    • Juergen HaessMichael KroenerDung NguyenLawrence PowellEric SchwarzSon Dao-TrongRaymond Yeung
    • Juergen HaessMichael KroenerDung NguyenLawrence PowellEric SchwarzSon Dao-TrongRaymond Yeung
    • G06F7/38
    • G06F9/3826G06F9/30014G06F9/3824G06F9/3838G06F9/3885
    • A system for performing floating point arithmetic operations including a plurality of stages making up a pipeline, the stages including a first stage and a last stage. The system also includes a register file adapted for receiving a store instruction for input to the pipeline, where the data associated with the store instruction is dependent on a previous operation still in the pipeline. The system further includes a store register adapted for outputting the data associated with the store instruction to memory and a control unit having instructions. The instructions are directed to inputting the store instruction into the pipeline and to providing a path for forwarding the data associated with the store instruction from the last stage in the pipeline to the store register for use by the store instruction if the previous operation immediately precedes the store operation in the pipeline and if there is a data type match between the store instruction and the previous operation. In addition, the instructions are directed to inputting the store instruction into the pipeline and to providing a path for forwarding the data associated with the store instruction from the first stage in the pipeline to the store register for use by the store instruction if the previous operation precedes the store operation by one or more stage in the pipeline and if there is a data type match between the store instruction and the previous operation.
    • 一种用于执行浮点算术运算的系统,包括构成流水线的多个级,所述级包括第一级和最后级。 该系统还包括适于接收用于输入到流水线的存储指令的寄存器文件,其中与存储指令相关联的数据依赖于仍在流水线中的先前操作。 该系统还包括适于将与存储指令相关联的数据输出到存储器的存储寄存器和具有指令的控制单元。 这些指令旨在将存储指令输入到流水线中,并且提供一个路径,用于将与流水线中的最后一级相关联的存储指令的数据转发到存储寄存器以供存储指令使用,如果先前的操作紧接在 存储操作在流水线中,并且存储指令与先前操作之间存在数据类型匹配。 此外,该指令旨在将存储指令输入到流水线中,并且提供用于将与流水线中的第一级相关联的存储指令的数据转发到存储寄存器以供存储指令使用的路径,如果先前的操作 在存储操作之前在流水线中的一个或多个阶段,以及存储指令和先前操作之间是否存在数据类型匹配。
    • 4. 发明申请
    • System and method for processing limited out-of-order execution of floating point loads
    • 用于处理浮点负载有限次序执行的系统和方法
    • US20060179286A1
    • 2006-08-10
    • US11054201
    • 2005-02-09
    • Juergen HaessMichael KroenerDung NguyenEric SchwarzSon Dao-TrongRaymond Yeung
    • Juergen HaessMichael KroenerDung NguyenEric SchwarzSon Dao-TrongRaymond Yeung
    • G06F9/44
    • G06F9/3867G06F9/3838
    • A system for performing limited out-of order execution of floating point loads. The system includes a plurality of stages making up a pipeline, the stages including an early stage. The system also includes a mechanism for inputting an arithmetic instruction into the pipeline, the arithmetic instruction including a result address. The mechanism also determines if the arithmetic instruction causes a write after write (WAW) condition to occur before writing a result of the arithmetic instruction to the result address. The determining includes comparing the result address to a load address associated with a load instruction subsequent to the arithmetic instruction in the pipeline. The load data associated with the load instruction was written to the load address in the early stage of the pipeline. A WAW condition occurs if the result address is equal to the load address. Writing a result of the arithmetic instruction is suppressed in response to the WAW condition occurring.
    • 用于执行浮点负载有限次序执行的系统。 该系统包括构成管道的多个阶段,这些阶段包括早期阶段。 该系统还包括用于将算术指令输入流水线的机构,算术指令包括结果地址。 该机制还确定在将算术指令的结果写入结果地址之前,算术指令是否在写入(WAW)条件之后发生写入。 确定包括将结果地址与在流水线中的算术指令之后的加载指令相关联的加载地址进行比较。 与加载指令相关联的加载数据在管道的早期阶段被写入加载地址。 如果结果地址等于加载地址,则会发生WAW条件。 响应于发生的WAW状态,写入算术指令的结果被抑制。
    • 5. 发明申请
    • Zero detect in partial sums while adding
    • 零点检测部分和,而添加
    • US20060184603A1
    • 2006-08-17
    • US11056036
    • 2005-02-11
    • Son TrongMark ErleBruce FleischerJuergen HaessMichael KellyKlaus KroenerMartin SchmooklerEric Schwarz
    • Son TrongMark ErleBruce FleischerJuergen HaessMichael KellyKlaus KroenerMartin SchmooklerEric Schwarz
    • G06F7/52
    • G06F7/53G06F7/74
    • The present invention relates to a method and circuit for performing multiply-operations in an arithmetic unit of a computer processor. In a multiplier thereof, zero detection of the resulting product bit string (22) is needed for a proper setting of condition code and overflow status information. Zero detection according to prior art decreases the calculation speed in the multiplier. In order to provide a method and respective electronic circuit, wherein the zero detection is earlier completed, it is proposed to use a leading zero anticipation (LZA) hardware—i.e., an LZA circuit (40), which exists usually anyway in floating point processor adders for calculating the number of leading zeros for operand normalization purposes—for performing a zero detection of the product by aid of the partial results (16, 17) emerging at the output of the Wallace tree of the multiplier. MSB-most and LSB-most margin bits (24, 26) of the partial (16, 17) results which cannot be processed by the LZA circuit (40), are read directly from the final product bit string (22).
    • 本发明涉及一种用于在计算机处理器的运算单元中执行乘法运算的方法和电路。 在其乘数中,为了适当地设置条件代码和溢出状态信息,需要对所得到的产品位串(22)进行零检测。 根据现有技术的零检测降低了乘法器中的计算速度。 为了提供一种方法和各自的电子电路,其中零检测较早完成,提出使用前导零预期(LZA)硬件即LZA电路(40),其通常在浮点处理器 用于计算操作数归一化目的的前导零数量的加法器,用于通过在乘法器的华莱士树的输出处出现的部分结果(16,17)执行零检测。 从最终产品位串(22)直接读取不能由LZA电路(40)处理的部分(16,17)结果的MSB最大和LSB最大的边缘位(24,26)。
    • 6. 发明授权
    • Floating point unit with fused multiply add and method for calculating a result with a floating point unit
    • 具有融合乘法的浮点单元和用浮点单元计算结果的方法
    • US07461117B2
    • 2008-12-02
    • US11055812
    • 2005-02-11
    • Son Dao TrongJuergen HaessChristian JacobiKlaus Michael KroenerSilvia Melitta MuellerJochen Preiss
    • Son Dao TrongJuergen HaessChristian JacobiKlaus Michael KroenerSilvia Melitta MuellerJochen Preiss
    • G06F7/483
    • G06F7/483G06F7/5443
    • The invention proposes a Floating Point Unit (1) with fused multiply add, with one addend operand (eb, fb) and two multiplicand operands (ea, fa; ec, fc), with a shift amount logic (2) which based on the exponents of the operands (ea, eb and ec) computes an alignment shift amount, with an alignment logic (3) which uses the alignment shift amount to align the fraction (fb) of the addend operand, with a multiply logic (4) which multiplies the fractions of the multiplicand operands (fa, fc), with a adder logic (5) which adds the outputs of the alignment logic (3) and the multiply logic (4), with a normalization logic (6) which normalizes the output of the adder logic (5), which is characterized in that a leading zero logic (7) is provided which computes the number of leading zeros of the fraction of the addend operand (fb), and that a compare logic (8) is provided which based on the number of leading zeros and the alignment shift amount computes select signals that indicate whether the most significant bits of the alignment logic (3) output have all the same value in order to: a) control the carry logic of the adder logic (5) and/or b) control a stage of the normalization logic (6).
    • 本发明提出了一种具有融合乘法运算的浮点单元(1),具有一个加数运算数(eb,fb)和两个被乘数运算符(ea,fa; ec,fc),其中移位量逻辑(2)基于 操作数(ea,eb和ec)的指数利用对准逻辑(3)计算对准偏移量,该对准逻辑(3)使用对准移位量来对齐加数操作数的分数(fb)与乘法逻辑(4) 将乘法器操作数(fa,fc)的分数与加法器逻辑(5)相乘,该逻辑(5)将对准逻辑(3)和乘法逻辑(4)的输出与归一化逻辑(6)进行归一化,归一化逻辑(6) 加法器逻辑(5)的特征在于提供一个前导零逻辑(7),其计算加法运算数(fb)的分数的前导零的数量,并且提供比较逻辑(8) 其基于前导零的数量和对准偏移量计算指示mo的选择信号 对准逻辑(3)输出的高有效位具有全部相同的值,以便:a)控制加法器逻辑(5)的进位逻辑和/或b)控制归一化逻辑(6)的阶段。
    • 9. 发明授权
    • System and method for performing floating point store folding
    • 执行浮点存储折叠的系统和方法
    • US07188233B2
    • 2007-03-06
    • US11054686
    • 2005-02-09
    • Juergen HaessMichael KroenerDung Quoc NguyenLawrence J. Powell, Jr.Eric M. SchwarzSon Dao-TrongRaymond C. Yeung
    • Juergen HaessMichael KroenerDung Quoc NguyenLawrence J. Powell, Jr.Eric M. SchwarzSon Dao-TrongRaymond C. Yeung
    • G06F9/312
    • G06F9/3826G06F9/30014G06F9/3824G06F9/3838G06F9/3885
    • A system for performing floating point arithmetic operations including a plurality of stages making up a pipeline, the stages including a first stage and a last stage. The system also includes a register file adapted for receiving a store instruction for input to the pipeline, where the data associated with the store instruction is dependent on a previous operation still in the pipeline. The system further includes a store register adapted for outputting the data associated with the store instruction to memory and a control unit having instructions. The instructions are directed to inputting the store instruction into the pipeline and to providing a path for forwarding the data associated with the store instruction from the last stage in the pipeline to the store register for use by the store instruction if the previous operation immediately precedes the store operation in the pipeline and if there is a data type match between the store instruction and the previous operation. In addition, the instructions are directed to inputting the store instruction into the pipeline and to providing a path for forwarding the data associated with the store instruction from the first stage in the pipeline to the store register for use by the store instruction if the previous operation precedes the store operation by one or more stage in the pipeline and if there is a data type match between the store instruction and the previous operation.
    • 一种用于执行浮点算术运算的系统,包括构成流水线的多个级,所述级包括第一级和最后级。 该系统还包括适于接收用于输入到流水线的存储指令的寄存器文件,其中与存储指令相关联的数据依赖于仍在流水线中的先前操作。 该系统还包括适于将与存储指令相关联的数据输出到存储器的存储寄存器和具有指令的控制单元。 这些指令旨在将存储指令输入到流水线中,并且提供一个路径,用于将与流水线中的最后一级相关联的存储指令的数据转发到存储寄存器以供存储指令使用,如果先前的操作紧接在 存储操作在流水线中,并且存储指令与先前操作之间存在数据类型匹配。 此外,该指令旨在将存储指令输入到流水线中,并且提供用于将与流水线中的第一级相关联的存储指令的数据转发到存储寄存器以供存储指令使用的路径,如果先前的操作 在存储操作之前在流水线中的一个或多个阶段,以及存储指令和先前操作之间是否存在数据类型匹配。
    • 10. 发明授权
    • High-sticky calculation in pipelined fused multiply/add circuitry
    • 流水线融合乘法/加法电路中的高粘度计算
    • US07392273B2
    • 2008-06-24
    • US10732039
    • 2003-12-10
    • Guenter GerwigJuergen HaessKlaus Michael Kroener
    • Guenter GerwigJuergen HaessKlaus Michael Kroener
    • G06F7/485G06F7/787
    • G06F7/483G06F5/012G06F7/5443
    • Arithmetic processing circuits in a circuit in a floating point processor having a fused multiply/ADD circuitry. In order to avoid waiting cycles in the normalizer of the floating point arithmetic, control logic calculates in an extremely early state of the overall Multiply/Add processing. Parts of the intermediate add result are significant and have to be selected in the pre-normalizer multiplexer to be fed to the normalizer by counting the leading zero bits (LAB) of the addend in a dedicated circuit right at the beginning of the pipe. LAB is added to the shift amount (SA) that is calculated to align the addend and is then compared with the width of the incrementer. If the sum of (SA+LAB) is larger than the width of the incrementer, which is a constant value, then no significant bits are in the high-part of the intermediate result, and the pre-normalizer multiplexer selects the data from a second predetermined position, otherwise from a first predetermined position.
    • 具有融合乘法/ ADD电路的浮点处理器中的电路中的算术处理电路。 为了避免浮点运算的归一化器中的等待周期,控制逻辑在整体乘法/加法处理的极早期状态下进行计算。 中间加法结果的部分是重要的,必须在预归一化器多路复用器中选择,以通过在管道开头右侧的专用电路中的加数的前导零比特(LAB)进行计数来馈送到归一化器。 将LAB加到被计算以对齐加数的移位量(SA),然后与增量器的宽度进行比较。 如果(SA + LAB)的和大于作为常数值的增量器的宽度,则中间结果的高部分中没有有效位,并且预标准化器多路复用器选择来自 第二预定位置,否则从第一预定位置。