会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • PROGRAMMABLE STREAMING PROCESSOR WITH MIXED PRECISION INSTRUCTION EXECUTION
    • 具有混合精度指令执行的可编程流水处理器
    • WO2009132013A1
    • 2009-10-29
    • PCT/US2009/041268
    • 2009-04-21
    • QUALCOMM INCORPORATEDDU, YunYU, ChunJIAO, GuofangMOLLOY, Stephen
    • DU, YunYU, ChunJIAO, GuofangMOLLOY, Stephen
    • G06T15/00G06F1/32
    • G06T15/005G06F8/47
    • The disclosure relates to a programmable streaming processor that is capable of executing mixed-precision (e.g., full-precision, half-precision) instructions using different execution units. The various execution units are each capable of using graphics data to execute instructions at a particular precision level. An exemplary programmable shader processor includes a controller and multiple execution units. The controller is configured to receive an instruction for execution and to receive an indication of a data precision for execution of the instruction. The controller is also configured to receive a separate conversion instruction that, when executed, converts graphics data associated with the instruction to the indicated data precision. When operable, the controller selects one of the execution units based on the indicated data precision. The controller then causes the selected execution unit to execute the instruction with the indicated data precision using the graphics data associated with the instruction.
    • 本公开涉及一种能够使用不同的执行单元执行混合精度(例如,全精度,半精度)指令的可编程流处理器。 各种执行单元都能够使用图形数据来执行特定精度级别的指令。 示例性可编程着色器处理器包括控制器和多个执行单元。 控制器被配置为接收用于执行的指令并且接收用于执行指令的数据精度的指示。 控制器还被配置为接收单独的转换指令,该指令在执行时将与指令相关联的图形数据转换为所指示的数据精度。 当可操作时,控制器基于指示的数据精度选择一个执行单元。 然后,控制器使所选择的执行单元使用与指令相关联的图形数据,以指示的数据精度执行指令。
    • 3. 发明申请
    • GRAPHICS PROCESSING UNIT WITH UNIFIED VERTEX CACHE AND SHADER REGISTER FILE
    • 具有统一VERTEX CACHE和SHADER寄存器文件的图形处理单元
    • WO2008039950A1
    • 2008-04-03
    • PCT/US2007/079784
    • 2007-09-27
    • QUALCOMM IncorporatedJIAO, GuofangYU, ChunDU, Yun
    • JIAO, GuofangYU, ChunDU, Yun
    • G06T15/00
    • G06T15/005
    • Techniques are described for processing computerized images with a graphics processing unit (GPU) using a unified vertex cache and shader register file. The techniques include creating a shared shader coupled to the GPU pipeline and a unified vertex cache and shader register file coupled to the shared shader to substantially eliminate data movement within the GPU pipeline. The GPU pipeline sends image geometry information based on an image geometry for an image to the shared shader. The shared shader performs vertex shading to generate vertex coordinates and attributes of vertices in the image. The shared shader then stores the vertex attributes in the unified vertex cache and shader register file, and sends only the vertex coordinates of the vertices back to the GPU pipeline. The GPU pipeline processes the image based on the vertex coordinates, and the shared shader processes the image based on the vertex attributes.
    • 描述了使用统一的顶点高速缓存和着色器寄存器文件处理具有图形处理单元(GPU)的计算机化图像的技术。 这些技术包括创建耦合到GPU流水线的共享着色器和耦合到共享着色器的统一顶点高速缓存和着色器寄存器文件,以基本上消除GPU流水线内的数据移动。 GPU管道将基于图像的图像几何的图像几何信息发送到共享着色器。 共享着色器执行顶点着色以生成图像中顶点坐标和顶点属性。 共享着色器然后将顶点属性存储在统一的顶点缓存和着色器寄存器文件中,并且仅将顶点的顶点坐标发送回GPU管道。 GPU流水线基于顶点坐标处理图像,共享着色器基于顶点属性处理图像。
    • 4. 发明申请
    • UNIFIED VIRTUAL ADDRESSED REGISTER FILE
    • 统一的虚拟寻址寄存器文件
    • WO2007149979A2
    • 2007-12-27
    • PCT/US2007/071775
    • 2007-06-21
    • QUALCOMM INCORPORATEDDU, YunYU, ChunHSU, De,D.JIAO, Golf
    • DU, YunYU, ChunHSU, De,D.JIAO, Golf
    • G06F9/3851G06F9/3012G06F9/30123G06F9/30138G06F9/384G06T15/005
    • A multi-threaded processor is provided, such as a shader processor, having an internal unified memory space that is shared by a plurality of threads and is dynamically assigned to threads as needed. A mapping table that maps virtual registers to available internal addresses in the unified memory space so that thread registers can be stored in contiguous or non-contiguous memory addresses. Dynamic sizing of the virtual registers allows flexible allocation of the unified memory space depending on the type and size of data in a thread register. Yet another feature provides an efficient method for storing graphics data in the unified memory space to improve fetch and store operations from the memory space. In particular, pixel data for four pixels in a thread are stored across four memory devices having independent input/output ports that permit the four pixels to be read in a single clock cycle for processing.
    • 提供了多线程处理器,例如着色器处理器,具有由多个线程共享的内部统一存储器空间,并且根据需要动态分配给线程。 映射表将虚拟寄存器映射到统一存储空间中的可用内部地址,以便线程寄存器可以存储在连续或不连续的存储器地址中。 虚拟寄存器的动态大小允许根据线程寄存器中数据的类型和大小灵活分配统一存储空间。 另一个特征提供了用于将统计存储器空间中的图形数据存储以改善从存储器空间获取和存储操作的有效方法。 特别地,线程中的四个像素的像素数据被存储在具有独立输入/输出端口的四个存储器件中,这些存储器件允许以单个时钟周期读取四个像素进行处理。
    • 5. 发明申请
    • GRAPHICS PROCESSOR WITH ARITHMETIC AND ELEMENTARY FUNCTION UNITS
    • 具有算术和元素功能单元的图形处理器
    • WO2007140338A2
    • 2007-12-06
    • PCT/US2007/069803
    • 2007-05-25
    • QUALCOMM IncorporatedBOURD, Alexei, V.DU, YunYU, ChunJIAO, Guofang
    • BOURD, Alexei, V.DU, YunYU, ChunJIAO, Guofang
    • G06F9/38
    • G06T1/20G06F9/30167G06F9/383G06F9/3851G06F9/3885
    • A graphics processor capable of efficiently performing arithmetic operations and computing elementary functions is described. The graphics processor has at least one arithmetic logic unit (ALU) that can perform arithmetic operations and at least one elementary function unit that can compute elementary functions. The ALU(s) and elementary function unit(s) may be arranged such that they can operate in parallel to improve throughput. The graphics processor may also include fewer elementary function units than ALUs, e.g., four ALUs and a single elementary function unit. The four ALUs may perform an arithmetic operation on (1) four components of an attribute for one pixel or (2) one component of an attribute for four pixels. The single elementary function unit may operate on one component of one pixel at a time. The use of a single elementary function unit may reduce cost while still providing good performance.
    • 描述能够有效执行算术运算和计算基本功能的图形处理器。 图形处理器具有至少一个可执行算术运算的算术逻辑单元(ALU)和至少一个可以计算基本功能的基本功能单元。 ALU和基本功能单元可以被布置成使得它们可以并行操作以提高吞吐量。 图形处理器还可以包括比ALU更少的基本功能单元,例如四个ALU和单个基本功能单元。 四个ALU可以对(1)四个像素的属性的四个分量或(2)四个像素的属性的一个分量执行算术运算。 单个基本功能单元可以一次操作一个像素的一个分量。 使用单个基本功能单元可以降低成本,同时仍然提供良好的性能。
    • 8. 发明申请
    • MEMORY MANAGEMENT USING DYNAMICALLY ALLOCATED DIRTY MASK SPACE
    • 使用动态分配的真皮掩蔽空间进行记忆管理
    • WO2014085002A1
    • 2014-06-05
    • PCT/US2013/067111
    • 2013-10-28
    • QUALCOMM INCORPORATED
    • LIANG, JianYU, ChunXU, Fei
    • G06F12/08
    • G06F12/0891G06F12/0804G06F12/0886G06F12/0895G06F2212/604
    • Systems and methods related to a memory system including a cache memory are disclosed. The cache memory system includes a cache memory including a plurality of cache memory lines and a dirty buffer including a plurality of dirty masks. A cache controller is configured to allocate one of the dirty masks to each of the cache memory lines when a write to the respective cache memory line is not a full write to that cache memory line. Each of the dirty masks indicates dirty states of data units in one of the cache memory lines. The cache controller stores an identification (ID) information that associates the dirty masks with the cache memory lines to which the dirty masks are allocated.
    • 公开了与包括高速缓冲存储器的存储器系统有关的系统和方法。 高速缓冲存储器系统包括包括多个高速缓存存储器线的高速缓存存储器和包括多个脏掩模的脏缓冲器。 高速缓存控制器被配置为当对相应高速缓存存储器线的写入不是对该高速缓存存储器线的完全写入时,将一个脏掩模分配给每个高速缓存存储器线。 每个脏屏蔽指示一个缓存存储器线中的数据单元的脏状态。 高速缓存控制器存储将脏屏蔽与分配有脏屏蔽的高速缓冲存储器线相关联的标识(ID)信息。