会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 22. 发明授权
    • Decimal multiplication for superscaler processors
    • 超标量处理器的十进制乘法
    • US07412476B2
    • 2008-08-12
    • US11460296
    • 2006-07-27
    • Fadi Y. BusabaSteven R. CarloughChristopher A. KrygowskiJohn G. Rell, Jr.
    • Fadi Y. BusabaSteven R. CarloughChristopher A. KrygowskiJohn G. Rell, Jr.
    • G06F7/523
    • G06F9/3001G06F7/496
    • A method for decimal multiplication in a superscaler processor comprising: obtaining a first operand and a second operand; establishing a multiplier and an effective multiplicand from the first operand and the second operand; and generating and accumulating a partial product term every two cycles. The partial product terms are created from the effective multiplicand and multiples of the multiplier, where the effective multiplicand is stored in a first register file, the multiples being ones times the effective multiplier, two times the effective multiplier, four times the effective multiplier and eight times the effective multiplier and the partial product terms are added to an accumulation of previous partial product terms shifted one digit right such that a digit shifted off is preserved as a result digit.
    • 一种用于在超标量处理器中进行十进制相乘的方法,包括:获得第一操作数和第二操作数; 从第一个操作数和第二个操作数建立乘数和有效的被乘数; 并且每两个周期产生和累积部分乘积项。 部分乘积项是从乘法器的有效乘数和乘数创建的,其中有效被乘数存储在第一个寄存器文件中,倍数是有效乘数的倍数,有效乘数的两倍,有效乘数的四倍和八倍 乘以有效乘数和部分乘积项添加到前一个部分乘积项的累积中,该乘积项被移位一位数字,使得数字移位被保留为结果位。
    • 23. 发明授权
    • Storage pre-alignment and EBCDIC, ASCII and unicode basic latin conversions for packed decimal data
    • 用于打包十进制数据的存储预对齐和EBCDIC,ASCII和unicode基本拉丁转换
    • US07167968B2
    • 2007-01-23
    • US10834637
    • 2004-04-29
    • Fadi Y. BusabaSteven R. CarloughMark A. CheckChristopher A. KrygowskiJohn G. Rell, Jr.Frank Tanzi
    • Fadi Y. BusabaSteven R. CarloughMark A. CheckChristopher A. KrygowskiJohn G. Rell, Jr.Frank Tanzi
    • G06F12/06
    • G06F9/30025G06F9/30032G06F9/30036G06F9/30043G06F9/3816G06F9/3824G06F12/0886
    • A method of pre-aligning data for storage during instruction execution improves performance by eliminating the cycles otherwise required for data alignment. The method can convert data between ASCII and Packed Decimal format, and between Unicode Basic Latin and Packed Decimal format. Conversion to Packed Decimal format is needed for decimal hardware in a microprocessor designed to generate decimal results. Converting from Packed Decimal to ASCII and Unicode Basic Latin is necessary to report Decimal Arithmetic results in a required format for the application program. To further improve performance, all available write ports in the fixed point unit (FXU) are utilized to reduce the number of cycles necessary to store results. To prevent data fetching of the unused destination data from slowing down instruction execution, the destination locations are tested for storage access exceptions, but the data for these operands are not actually fetched. A single read request from the FXU to the operand buffers effectively reads the entire destination address (up to 8 double-words of data) in a single cycle.
    • 在指令执行期间预先对准用于存储的数据的方法通过消除数据对准所需的周期来提高性能。 该方法可以在ASCII和Packed Decimal格式之间以及Unicode Basic Latin和Packed Decimal格式之间转换数据。 转换为打包十进制硬件需要十进制格式,用于生成十进制结果的微处理器。 从包装十进制转换为ASCII和Unicode基本拉丁文需要以应用程序所需的格式报告十进制算术结果。 为了进一步提高性能,利用固定点单元(FXU)中的所有可用写入端口来减少存储结果所需的周期数。 为了防止数据获取未使用的目标数据缓慢的指令执行,目标位置被测试存储访问异常,但是这些操作数的数据实际上并没有被提取。 从FXU到操作数缓冲区的单个读取请求在单个周期中有效读取整个目标地址(最多8个双字的数据)。
    • 24. 发明授权
    • Decimal multiplication for superscaler processors
    • 超标量处理器的十进制乘法
    • US07167889B2
    • 2007-01-23
    • US10436392
    • 2003-05-12
    • Fadi Y. BusabaSteven R. CarloughChristopher A. KrygowskiJohn G. Rell, Jr.
    • Fadi Y. BusabaSteven R. CarloughChristopher A. KrygowskiJohn G. Rell, Jr.
    • G06F7/523
    • G06F9/3001G06F7/496
    • A method for decimal multiplication in a superscaler processor comprising: obtaining a first operand and a second operand; establishing a multiplier and an effective multiplicand from the first operand and the second operand; and generating and accumulating a partial product term every two cycles. The partial product terms are created from the effective multiplicand and multiples of the multiplier, where the effective multiplicand is stored in a first register file, the multiples being ones times the effective multiplier, two times the effective multiplier, four times the effective multiplier and eight times the effective multiplier and the partial product terms are added to an accumulation of previous partial product terms shifted one digit right such that a digit shifted off is preserved as a result digit.
    • 一种用于在超标量处理器中进行十进制相乘的方法,包括:获得第一操作数和第二操作数; 从第一个操作数和第二个操作数建立乘数和有效的被乘数; 并且每两个周期产生和累积部分乘积项。 部分乘积项是从乘法器的有效乘数和乘数创建的,其中有效被乘数存储在第一个寄存器文件中,倍数是有效乘数的倍数,有效乘数的两倍,有效乘数的四倍和八倍 乘以有效乘数和部分乘积项添加到前一个部分乘积项的累积中,该乘积项被移位一位数字,使得数字移位被保留为结果位。
    • 26. 发明授权
    • Modular binary multiplier for signed and unsigned operands of variable widths
    • 具有可变宽度的有符号和无符号操作数的模块二进制乘法器
    • US07266580B2
    • 2007-09-04
    • US10435976
    • 2003-05-12
    • Fadi Y. BusabaSteven R. CarloughDavid S. HuttonChristopher A. KrygowskiJohn G. Rell, Jr.Sheryll H. Veneracion
    • Fadi Y. BusabaSteven R. CarloughDavid S. HuttonChristopher A. KrygowskiJohn G. Rell, Jr.Sheryll H. Veneracion
    • G06F7/52
    • G06F7/5324G06F7/5332G06F9/30014G06F2207/3816
    • A method and apparatuses for performing binary multiplication on signed and unsigned operands of various lengths is discussed herein. It is a concept that may be split into two parts, the first of which is the multiplication hardware itself, a compact, less than-full sized multiplier employing Booth or other type of recoding methods upon the multiplier to reduce the number of partial products per scan, and implemented in such a manner so that a multiplication operation with large operands may be broken into subgroups of operations that will fit into this mid-sized multiplier whose results, here called modular products, may be knitted back together to form a correct, final product. The second part of the concept is the supporting hardware used to separate the operands into subgroups and input the data and control signals to the multiplier, and the algorithms and apparatuses used to align and combine the modular products properly to obtain the final product. These algorithms used to obtain a result as specified by the operation may be as varied as the supporting hardware with which the multiplier may be used, making this multiplier a very flexible and powerful design.
    • 本文中讨论了用于对具有各种长度的有符号和无符号操作数进行二进制乘法的方法和装置。 这是一个概念,可以分为两部分,第一部分是乘法硬件本身,紧凑型,小于满量程的乘法器,在乘数上使用Booth或其他类型的重新编码方法,以减少每个部分产品的数量 扫描和实现,使得具有大操作数的乘法运算可以被分解成适合于该中型乘法器的操作子组,其结果(这里称为模块化产品)可以针织在一起以形成正确的, 完成品。 该概念的第二部分是用于将操作数分成子组并将数据和控制信号输入到乘法器的支持硬件,以及用于对准和组合模块化产品以获得最终产品的算法和装置。 用于获得由操作指定的结果的这些算法可以与可以使用乘法器的支持硬件一样变化,使得该乘法器是非常灵活和强大的设计。
    • 29. 发明申请
    • METHOD, SYSTEM, AND COMPUTER PROGRAM PRODUCT FOR SELECTIVELY ACCELERATING EARLY INSTRUCTION PROCESSING
    • 方法,系统和计算机程序产品,用于选择性加速早期指导处理
    • US20090217005A1
    • 2009-08-27
    • US12037861
    • 2008-02-26
    • Khary J. AlexanderFadi Y. BusabaBruce C. GiameiDavid S. HuttonChung-Lung K. Shum
    • Khary J. AlexanderFadi Y. BusabaBruce C. GiameiDavid S. HuttonChung-Lung K. Shum
    • G06F9/30
    • G06F9/3826G06F9/3836
    • A method for selectively accelerating early instruction processing including receiving an instruction data that is normally processed in an execution stage of a processor pipeline, wherein a configuration of the instruction data allows a processing of the instruction data to be accelerated from the execution stage to an address generation stage that occurs earlier in the processor pipeline than the execution stage, determining whether the instruction data can be dispatched to the address generation stage to be processed without being delayed due to an unavailability of a processing resource needed for the processing of the instruction data in the address generation stage, dispatching the instruction data to be processed in the address generation stage if it can be dispatched without being delayed due to the unavailability of the processing resource, and dispatching the instruction data to be processed in the execution stage if it can not be dispatched without being delayed due to the unavailability of the processing resource, wherein the processing of the instruction data is selectively accelerated using an address generation interlock scheme. A corresponding system and computer program product.
    • 一种用于选择性地加速早期指令处理的方法,包括接收在处理器流水线的执行阶段中正常处理的指令数据,其中指令数据的配置允许指令数据的处理从执行阶段加速到地址 在处理器流水线中比执行阶段更早发生的生成阶段,确定指令数据是否可以被分派到要处理的地址生成阶段,而不会由于处理指令数据所需的处理资源的不可用而被延迟 地址生成阶段,如果能够由于处理资源的不可用而被分派而不被延迟,则在地址生成阶段调度要处理的指令数据,并且如果不能在执行阶段调度要处理的指令数据 由于你而不被推迟 处理资源的可用性,其中使用地址生成互锁方案选择性地加速指令数据的处理。 相应的系统和计算机程序产品。