会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Cooperative thread array reduction and scan operations
    • 合作线程数组减少和扫描操作
    • US08539204B2
    • 2013-09-17
    • US12890227
    • 2010-09-24
    • Brian FahsMing Y. SiuBrett W. CoonJohn R. NickollsLars Nyland
    • Brian FahsMing Y. SiuBrett W. CoonJohn R. NickollsLars Nyland
    • G06F9/30G06F9/40G06F15/00
    • G06F9/522G06F8/458G06F9/3004G06F9/30087G06F9/30145G06F9/3851
    • One embodiment of the present invention sets forth a technique for performing aggregation operations across multiple threads that execute independently. Aggregation is specified as part of a barrier synchronization or barrier arrival instruction, where in addition to performing the barrier synchronization or arrival, the instruction aggregates (using reduction or scan operations) values supplied by each thread. When a thread executes the barrier aggregation instruction the thread contributes to a scan or reduction result, and waits to execute any more instructions until after all of the threads have executed the barrier aggregation instruction. A reduction result is communicated to each thread after all of the threads have executed the barrier aggregation instruction and a scan result is communicated to each thread as the barrier aggregation instruction is executed by the thread.
    • 本发明的一个实施例提出了一种用于跨独立执行的多个线程执行聚合操作的技术。 聚合被指定为屏障同步或屏障到达指令的一部分,其中除了执行屏障同步或到达之外,指令聚合(使用缩减或扫描操作)由每个线程提供的值。 当线程执行屏障聚合指令时,线程有助于扫描或缩小结果,并等待执行任何更多指令,直到所有线程都执行了阻挡聚合指令为止。 在所有线程执行了屏障聚合指令之后,向每个线程传送减少结果,并且当线程执行屏障聚合指令时,将扫描结果传送给每个线程。
    • 7. 发明申请
    • Opcode-Specified Predicatable Warp Post-Synchronization
    • 操作码指定的可预测变形后同步
    • US20110078690A1
    • 2011-03-31
    • US12892887
    • 2010-09-28
    • Brian FahsMing Y. SiuRobert Steven Glanville
    • Brian FahsMing Y. SiuRobert Steven Glanville
    • G06F9/46
    • G06F9/46G06F9/30072G06F9/30087G06F9/30185G06F9/3851G06F9/3887
    • One embodiment of the present invention sets forth a technique for performing a method for synchronizing divergent executing threads. The method includes receiving a plurality of instructions that includes at least one set-synchronization instruction and at least one instruction that includes a synchronization command, and determining an active mask that indicates which threads in a plurality of threads are active and which threads in the plurality of threads are disabled. For each instruction included in the plurality of instructions, the instruction is transmitted to each of the active threads included in the plurality of threads. If the instruction is a set-synchronization instruction, then a synchronization token, the active mask and the synchronization point is each pushed onto a stack. Or, if the instruction is a predicated instruction that includes a synchronization command, then each active thread that executes the predicated instruction is monitored to determine when the active mask has been updated to indicate that each active thread, after executing the predicated instruction, has been disabled.
    • 本发明的一个实施例提出了一种用于执行用于同步发散执行线程的方法的技术。 该方法包括接收包括至少一个集合同步指令和包括同步命令的至少一个指令的多个指令,以及确定指示多个线程中的哪些线程是活动的活动掩码,以及多个线程中的哪些线程 的线程被禁用。 对于包括在多个指令中的每个指令,指令被发送到包括在多个线程中的每个活动线程。 如果指令是设置同步指令,则将同步令牌,活动掩码和同步点分别压入堆栈。 或者,如果指令是包括同步命令的预测指令,则监视执行预测指令的每个活动线程,以确定何时更新活动掩码以指示在执行预定指令之后每个活动线程已被 残疾人士
    • 9. 发明申请
    • Architecture and Instructions for Accessing Multi-Dimensional Formatted Surface Memory
    • 用于访问多维格式化表面存储器的体系结构和说明
    • US20110074802A1
    • 2011-03-31
    • US12890171
    • 2010-09-24
    • John R. NickollsBrian FahsLars NylandJohn Erik LindholmRichard Craig Johnson
    • John R. NickollsBrian FahsLars NylandJohn Erik LindholmRichard Craig Johnson
    • G06F12/00
    • G06T1/60
    • One embodiment of the present invention sets forth a technique for a program to access multi-dimensional formatted graphics surface memory. Multi-dimensional memory objects called “surfaces” stored in a user-specified data or pixel format and arranged in a graphics optimized layout are accessed by programs using surface instructions. A set of memory access instructions e.g., load, store, reduce, and atomic, referred to as surface instructions, may be used to access the surfaces. Coordinate bounds checking is performed with configurable clamping. Caching behavior may also be specified by the surface instructions. Data format conversion and packing to a specified storage format is supported for store, reduction, and atomic surface instructions. Data format conversion and unpacking from a specified storage format is supported for loads and atomic surface instructions.
    • 本发明的一个实施例提出了一种用于访问多维格式化图形表面存储器的程序的技术。 称为“表面”的多维存储器对象以用户指定的数据或像素格式存储并以图形优化的布局布置,由使用表面指令的程序访问。 可以使用一组存储器访问指令,例如加载,存储,减少和原子,称为表面指令,以访问表面。 通过可配置的夹紧进行坐标界限检查。 缓存行为也可以由表面指令指定。 支持存储,缩小和原子表面指令的数据格式转换和打包到指定的存储格式。 负载和原子表面指令支持从指定的存储格式进行数据格式转换和解包。
    • 10. 发明授权
    • Opcode-specified predicatable warp post-synchronization
    • 操作码指定的可预测扭曲后同步
    • US08850436B2
    • 2014-09-30
    • US12892887
    • 2010-09-28
    • Brian FahsMing Y. SiuRobert Steven Glanville
    • Brian FahsMing Y. SiuRobert Steven Glanville
    • G06F9/46G06F9/38G06F9/30
    • G06F9/46G06F9/30072G06F9/30087G06F9/30185G06F9/3851G06F9/3887
    • One embodiment of the present invention sets forth a technique for performing a method for synchronizing divergent executing threads. The method includes receiving a plurality of instructions that includes at least one set-synchronization instruction and at least one instruction that includes a synchronization command, and determining an active mask that indicates which threads in a plurality of threads are active and which threads in the plurality of threads are disabled. For each instruction included in the plurality of instructions, the instruction is transmitted to each of the active threads included in the plurality of threads. If the instruction is a set-synchronization instruction, then a synchronization token, the active mask and the synchronization point is each pushed onto a stack. Or, if the instruction is a predicated instruction that includes a synchronization command, then each active thread that executes the predicated instruction is monitored to determine when the active mask has been updated to indicate that each active thread, after executing the predicated instruction, has been disabled.
    • 本发明的一个实施例提出了一种用于执行用于同步发散执行线程的方法的技术。 该方法包括接收包括至少一个集合同步指令和包括同步命令的至少一个指令的多个指令,以及确定指示多个线程中的哪些线程是活动的活动掩码,以及多个线程中的哪些线程 的线程被禁用。 对于包括在多个指令中的每个指令,指令被发送到包括在多个线程中的每个活动线程。 如果指令是设置同步指令,则将同步令牌,活动掩码和同步点分别压入堆栈。 或者,如果指令是包括同步命令的预测指令,则监视执行预测指令的每个活动线程,以确定何时更新活动掩码以指示在执行预定指令之后每个活动线程已被 残疾人士