会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 51. 发明申请
    • PARALLEL DATA PROCESSING SYSTEMS AND METHODS USING COOPERATIVE THREAD ARRAYS
    • 并行数据处理系统和使用合作螺纹阵列的方法
    • US20110087860A1
    • 2011-04-14
    • US12972361
    • 2010-12-17
    • John R. NickollsStephen D. Lew
    • John R. NickollsStephen D. Lew
    • G06F15/16
    • G06F9/544G06F9/3851G06F9/3887G06F9/522
    • Parallel data processing systems and methods use cooperative thread arrays (CTAs), i.e., groups of multiple threads that concurrently execute the same program on an input data set to produce an output data set. Each thread in a CTA has a unique identifier (thread ID) that can be assigned at thread launch time. The thread ID controls various aspects of the thread's processing behavior such as the portion of the input data set to be processed by each thread, the portion of an output data set to be produced by each thread, and/or sharing of intermediate results among threads. Mechanisms for loading and launching CTAs in a representative processing core and for synchronizing threads within a CTA are also described.
    • 并行数据处理系统和方法使用协同线程数组(CIA),即在输入数据集上同时执行相同程序的多线程组,以产生输出数据集。 CTA中的每个线程都有一个唯一的标识符(线程ID),可以在线程启动时分配。 线程ID控制线程的处理行为的各个方面,例如由每个线程处理的输入数据集的部分,由每个线程产生的输出数据集的部分和/或线程之间的中间结果的共享 。 还描述了在代表性处理核心中加载和启动CTA并在CTA内同步线程的机制。
    • 52. 发明授权
    • Distributing processing tasks within a processor
    • 在处理器中分配处理任务
    • US07865894B1
    • 2011-01-04
    • US11311997
    • 2005-12-19
    • Bryon S. NordquistJohn R. Nickolls
    • Bryon S. NordquistJohn R. Nickolls
    • G06F9/46
    • G06F9/5044
    • Embodiments of the present invention facilitate distributing processing tasks within a processor. In one embodiment, processing clusters keep track of resource requirements. If sufficient resources are available within a particular processing cluster, the available processing cluster asserts a ready signal to a dispatch unit. The dispatch unit is configured to pass a processing task (such as a cooperative thread array or CTA) to an available processing cluster that asserted a ready signal. In another embodiment, a processing task is passed around a ring of processing clusters until a processing cluster with sufficient resources available accepts the processing task.
    • 本发明的实施例便于在处理器内分发处理任务。 在一个实施例中,处理集群跟踪资源需求。 如果在特定处理集群内有足够的资源可用,则可用的处理集群将向就绪信号发出准备好的信号。 调度单元被配置为将处理任务(诸如协作线程数组或CTA)传递到断言就绪信号的可用处理簇。 在另一个实施例中,处理任务围绕处理集群环传递,直到具有足够资源的处理集群接受处理任务为止。
    • 60. 发明申请
    • Architecture and Instructions for Accessing Multi-Dimensional Formatted Surface Memory
    • 用于访问多维格式化表面存储器的体系结构和说明
    • US20110074802A1
    • 2011-03-31
    • US12890171
    • 2010-09-24
    • John R. NickollsBrian FahsLars NylandJohn Erik LindholmRichard Craig Johnson
    • John R. NickollsBrian FahsLars NylandJohn Erik LindholmRichard Craig Johnson
    • G06F12/00
    • G06T1/60
    • One embodiment of the present invention sets forth a technique for a program to access multi-dimensional formatted graphics surface memory. Multi-dimensional memory objects called “surfaces” stored in a user-specified data or pixel format and arranged in a graphics optimized layout are accessed by programs using surface instructions. A set of memory access instructions e.g., load, store, reduce, and atomic, referred to as surface instructions, may be used to access the surfaces. Coordinate bounds checking is performed with configurable clamping. Caching behavior may also be specified by the surface instructions. Data format conversion and packing to a specified storage format is supported for store, reduction, and atomic surface instructions. Data format conversion and unpacking from a specified storage format is supported for loads and atomic surface instructions.
    • 本发明的一个实施例提出了一种用于访问多维格式化图形表面存储器的程序的技术。 称为“表面”的多维存储器对象以用户指定的数据或像素格式存储并以图形优化的布局布置,由使用表面指令的程序访问。 可以使用一组存储器访问指令,例如加载,存储,减少和原子,称为表面指令,以访问表面。 通过可配置的夹紧进行坐标界限检查。 缓存行为也可以由表面指令指定。 支持存储,缩小和原子表面指令的数据格式转换和打包到指定的存储格式。 负载和原子表面指令支持从指定的存储格式进行数据格式转换和解包。