会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Hybrid queue for storing instructions from fetch queue directly in out-of-order queue or temporarily in in-order queue until space is available
    • 用于存储来自提取队列的指令的混合队列,直接在乱序队列中或临时按顺序队列直到空间可用
    • US09164772B2
    • 2015-10-20
    • US13357652
    • 2012-01-25
    • Kenneth Alan DockserYusuf Cagatay Tekmen
    • Kenneth Alan DockserYusuf Cagatay Tekmen
    • G06F9/30G06F9/38
    • G06F9/3836G06F9/3814G06F9/3855G06F9/3877
    • A queuing apparatus having a hierarchy of queues, in one of a number of aspects, is configured to control backpressure between processors in a multiprocessor system. A fetch queue is coupled to an instruction cache and configured to store first instructions for a first processor and second instructions for a second processor in an order fetched from the instruction cache. An in-order queue is coupled to the fetch queue and configured to store the second instructions accepted from the fetch queue in response to a write indication. An out-of-order queue is coupled to the fetch queue and to the in-order queue and configured to store the second instructions accepted from the fetch queue in response to an indication that space is available in the out-of-order queue, wherein the second instructions may be accessed out-of-order with respect to other second instructions executing on different execution pipelines.
    • 在多个方面之一中具有队列层级的排队装置被配置为控制多处理器系统中的处理器之间的背压。 获取队列被耦合到指令高速缓存并且被配置为按照从指令高速缓存取出的顺序存储第一处理器的第一指令和第二处理器的第二指令。 顺序队列被耦合到获取队列并且被配置为响应于写指示来存储从获取队列接受的第二指令。 无序队列被耦合到所述获取队列和所述按顺序队列,并且被配置为响应于所述无序队列中的空间可用的指示来存储从所述获取队列接受的所述第二指令, 其中所述第二指令可以相对于在不同执行管线上执行的其它第二指令无序地被访问。
    • 2. 发明授权
    • Mode-based multiply-add recoding for denormal operands
    • 基于模式的乘法加法重新编码用于反常操作数
    • US08447800B2
    • 2013-05-21
    • US13026335
    • 2011-02-14
    • Kenneth Alan DockserPathik Sunil Lall
    • Kenneth Alan DockserPathik Sunil Lall
    • G06F7/38
    • G06F7/49915G06F7/4876G06F7/5443G06F2207/3884
    • In a denormal support mode, the normalization circuit of a floating-point adder is used to normalize or denormalized the output of a floating-point multiplier. Each floating-point multiply instruction is speculatively converted to a multiply-add instruction, with the addend forced to zero. This preserves the value of the product, while normalizing or denormalizing the product using the floating-point adder's normalization circuit. When the operands to the multiply operation are available, they are inspected. If the operands will not generate an unnormal intermediate product or a denormal final product, the add operation is suppressed, such as by operand-forwarding. Additionally, each non-fused floating-point multiply-add instruction is replaced with a multiply-add instruction having a zero addend, and a floating-point add instruction having the addend of the original multiply-add instruction is inserted into the instruction stream. Upon inspection of the operands, if an unnormal intermediate result or a denormal final result will not occur, the addend may be restored to the multiply-add instruction and the add instruction converted to a NOP.
    • 在非正常支持模式中,浮点加法器的归一化电路用于对浮点乘法器的输出进行归一化或非归一化。 每个浮点乘法指令被推测转换为乘法加法指令,加数被强制为零。 这将保留产品的价值,同时使用浮点加法器的归一化电路对产品进行规范化或非规范化。 当乘法运算的操作数可用时,它们将被检查。 如果操作数不会产生非正常的中间产品或非正常的最终产品,则通过操作数转发来抑制添加操作。 此外,每个非融合浮点乘法指令被替换为具有零加法的加法指令,并且具有原始加法指令的加数的浮点加法指令被插入到指令流中。 在检查操作数时,如果不会发生非正常的中间结果或非正常的最终结果,则可以将加数恢复为乘法指令,并将加法指令转换为NOP。
    • 3. 发明授权
    • Floating-point processor with selectable subprecision
    • 具有可选择精度的浮点处理器
    • US07725519B2
    • 2010-05-25
    • US11244492
    • 2005-10-05
    • Kenneth Alan Dockser
    • Kenneth Alan Dockser
    • G06F7/483
    • G06F7/483G06F9/30014G06F9/30141G06F9/30189G06F2207/3816
    • A floating-point processor with selectable subprecision includes a register configured to store a plurality of bits in a floating-point format, a controller, and a floating-point mathematical operator. The controller is configured to select a subprecision for a floating-point operation, in response to user input. The controller is configured to determine a subset of the bits, in accordance with the selected subprecision. The floating-point operator is configured to perform the floating-point operation using only the subset of the bits. Excess bits that are not used in the floating-point operation may be forced into a low-leakage state. The output value resulting from the floating-point operation is either truncated or rounded to the selected subprecision.
    • 具有可选择精度的浮点处理器包括被配置为以浮点格式存储多个位的寄存器,控制器和浮点数学运算符。 控制器被配置为响应于用户输入来选择用于浮点运算的子精度。 控制器被配置为根据所选择的精度来确定比特的子集。 浮点运算符被配置为仅使用位的子集执行浮点运算。 在浮点运算中未使用的多个位可能被迫进入低泄漏状态。 由浮点运算产生的输出值要么被截断,要么舍入到选定的子精度。
    • 5. 发明授权
    • Software selectable adjustment of SIMD parallelism
    • 软件可选择调整SIMD并行性
    • US08799627B2
    • 2014-08-05
    • US13350949
    • 2012-01-16
    • Kenneth Alan Dockser
    • Kenneth Alan Dockser
    • G06F9/00G06F1/00
    • G06F9/30076G06F9/30036G06F9/30083
    • Selective power control of one or more processing elements matches a degree of parallelism to requirements of a task performed in a highly parallel programmable data processor. For example, when program operations require less than the full width of the data path, a software instruction of the program sets a mode of operation requiring a subset of the parallel processing capacity. At least one parallel processing element, that is not needed, can be shut down to conserve power. At a later time, when the added capacity is needed, execution of another software instruction sets the mode of operation to that of the wider data path, typically the full width, and the mode change reactivates the previously shut-down processing element.
    • 一个或多个处理元件的选择性功率控制与高度并行的可编程数据处理器中执行的任务的要求相匹配。 例如,当程序操作需要小于数据路径的全宽时,该程序的软件指令设置需要并行处理能力子集的操作模式。 可以关闭至少一个不需要的并行处理元件以节省功率。 稍后,当需要添加容量时,另一软件指令的执行将操作模式设置为较宽数据路径的操作模式,通常为全宽,并且模式更改重新激活先前关闭的处理元素。
    • 8. 发明申请
    • Processor with a Coprocessor having Early Access to Not-Yet Issued Instructions
    • 具有协处理器的处理器可以及早访问未发布的指令
    • US20120204005A1
    • 2012-08-09
    • US13363541
    • 2012-02-01
    • Kenneth Alan DockserYusuf Cagatay Tekmen
    • Kenneth Alan DockserYusuf Cagatay Tekmen
    • G06F9/312
    • G06F9/3814G06F9/382G06F9/3836G06F9/3859G06F9/3877G06F9/3885
    • Apparatus and methods provide early access of instructions. A fetch queue is coupled to an instruction cache and configured to store a mix of processor instructions for a first processor and coprocessor instructions for a second processor. A coprocessor instruction selector is coupled to the fetch queue and configured to copy coprocessor instructions from the fetch queue. A queue is coupled to the coprocessor instruction selector and from which coprocessor instructions are accessed for execution before the coprocessor instruction is issued to the first processor. Execution of the copied coprocessor instruction is started in the coprocessor before the coprocessor instruction is issued to a processor. The execution of the copied coprocessor instruction is completed based on information received from the processor after the coprocessor instruction has been issued to the processor.
    • 装置和方法提供了指令的早期访问。 获取队列被耦合到指令高速缓存并且被配置为存储用于第一处理器的处理器指令和用于第二处理器的协处理器指令的混合。 协处理器指令选择器被耦合到获取队列并被配置为从提取队列中复制协处理器指令。 在协处理器指令被发送到第一处理器之前,队列被耦合到协处理器指令选择器并且从哪个协处理器指令被访问以执行。 在将协处理器指令发送到处理器之前,在协处理器中启动复制的协处理器指令的执行。 复制的协处理器指令的执行是在协处理器指令发出到处理器之后基于从处理器接收到的信息完成的。
    • 10. 发明申请
    • Method and Apparatus for Executing Processor Instructions Based on a Dynamically Alterable Delay
    • 基于动态可变延迟执行处理器指令的方法和装置
    • US20080046692A1
    • 2008-02-21
    • US11464839
    • 2006-08-16
    • Gerald Paul MichalakKenneth Alan Dockser
    • Gerald Paul MichalakKenneth Alan Dockser
    • G06F9/30
    • G06F9/30145G06F9/3806G06F9/3836G06F9/384G06F9/3842G06F9/3869G06F9/3877
    • Instruction execution delay is alterable after the system design has been finalized, thus enabling the system to dynamically account for various conditions that impact instruction execution. In some embodiments, the dynamic delay is determined by an application to be executed by the processing system. In other embodiments, the dynamic delay is determined by analyzing the history of previously executed instructions. In yet other embodiments, the dynamic delay is determined by assessing the processing resources available to a given application. Regardless, the delay may be dynamically altered on a per-instruction, multiple instruction, or application basis. Processor instruction execution may be controlled by determining a first delay value for a first set of one or more instructions and a second delay value for a second set of one or more instructions. Execution of the sets of instructions is delayed based on the corresponding delay value.
    • 指令执行延迟在系统设计完成后可以改变,从而使系统能够动态地考虑影响指令执行的各种条件。 在一些实施例中,动态延迟由处理系统要执行的应用程序确定。 在其他实施例中,通过分析先前执行的指令的历史来确定动态延迟。 在其他实施例中,通过评估给定应用可用的处理资源来确定动态延迟。 无论如何,延迟可以在每个指令,多个指令或应用程序的基础上动态地改变。 处理器指令执行可以通过确定一个或多个指令的第一组的第一延迟值和第二组一个或多个指令的第二延迟值来控制。 基于相应的延迟值来延迟指令集的执行。