会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 15. 发明授权
    • Method for transforming a multithreaded program for general execution
    • 用于转换用于一般执行的多线程程序的方法
    • US09367306B2
    • 2016-06-14
    • US13076258
    • 2011-03-30
    • Jaydeep MaratheVinod Grover
    • Jaydeep MaratheVinod Grover
    • G06F9/46G06F9/44G06F9/52
    • G06F8/72G06F9/522
    • A technique is disclosed for executing a program designed for multi-threaded operation on a general purpose processor. Original source code for the program is transformed from a multi-threaded structure into a computationally equivalent single-threaded structure. A transform operation modifies the original source code to insert code constructs for serial thread execution. The transform operation also replaces synchronization barrier constructs in the original source code with synchronization barrier code that is configured to facilitate serialization. The transformed source code may then be conventionally compiled and advantageously executed on the general purpose processor.
    • 公开了一种用于在通用处理器上执行针对多线程操作设计的程序的技术。 程序的原始源代码从多线程结构转换为计算等效的单线程结构。 转换操作修改原始源代码以插入用于串行线程执行的代码结构。 变换操作还用原始源代码中的同步屏障代码替代配置为便于序列化的同步屏障代码。 然后可以在通用处理器上常规地编译和有利地执行变换的源代码。
    • 16. 发明授权
    • Partitioning CUDA code for execution by a general purpose processor
    • 将CUDA代码分区以供通用处理器执行
    • US08776030B2
    • 2014-07-08
    • US12415075
    • 2009-03-31
    • Vinod GroverBastiaan Joannes Matheus AartsMichael Murphy
    • Vinod GroverBastiaan Joannes Matheus AartsMichael Murphy
    • G06F9/44
    • G06F8/456
    • One embodiment of the present invention sets forth a technique for translating application programs written using a parallel programming model for execution on multi-core graphics processing unit (GPU) for execution by general purpose central processing unit (CPU). Portions of the application program that rely on specific features of the multi-core GPU are converted by a translator for execution by a general purpose CPU. The application program is partitioned into regions of synchronization independent instructions. The instructions are classified as convergent or divergent and divergent memory references that are shared between regions are replicated. Thread loops are inserted to ensure correct sharing of memory between various threads during execution by the general purpose CPU.
    • 本发明的一个实施例提出了一种用于翻译使用并行编程模型编写的应用程序的技术,用于在多核图形处理单元(GPU)上执行以由通用中央处理单元(CPU)执行。 依赖于多核GPU的特定功能的应用程序的部分由翻译器转换,以供通用CPU执行。 应用程序被划分为独立于同步的指令的区域。 指令被分类为在区域之间共享的收敛或发散和不同的存储器引用。 插入线程循环以确保在通用CPU执行期间在不同线程之间正确共享内存。
    • 18. 发明授权
    • Method, apparatus, system and computer program product for initializing
a data structure at its first active use
    • 用于在首次主动使用时初始化数据结构的方法,装置,系统和计算机程序产品
    • US6148302A
    • 2000-11-14
    • US31229
    • 1998-02-26
    • Boris BeylinVinod Grover
    • Boris BeylinVinod Grover
    • G06F9/45G06F9/44G06F9/445G06F7/00G06F12/00
    • G06F9/4428G06F9/445Y10S707/99942Y10S707/99943Y10S707/99944Y10S707/99945
    • Apparatus, methods, systems and computer program products are disclosed that provide an efficient mechanism for invoking a programmed operation at the first active use of the OOP object or data structure. The programmed operation can be used to initialize an object-oriented programming (OOP) object or data structure. The first active use of the data structure or OOP object is detected because the initial access mechanism is constrained to cause a misaligned memory access fault (trap) by attempting a non-byte access-mode memory access to an odd byte address. As the fault is processed, the access mechanism is converted so that the initial and subsequent non-byte access-mode memory accesses will succeed. In addition, the OOP object or data structure is initialized. Then the initial access attempt is repeated on the just initialized OOP object or data structure using the converted access mechanism. The use of the invention improves the performance of computers by reducing the overhead involved with particular computational operations.
    • 公开了装置,方法,系统和计算机程序产品,其提供了在第一主动使用OOP对象或数据结构时调用编程操作的有效机构。 编程操作可用于初始化面向对象编程(OOP)对象或数据结构。 检测到数据结构或OOP对象的第一个主动使用,因为初始访问机制被限制为通过尝试非字节访问模式存储器访问奇数字节地址而导致未对齐的存储器访问故障(陷阱)。 当故障被处理时,转换访问机制,使得初始和随后的非字节访问模式存储器访问将成功。 此外,OOP对象或数据结构已初始化。 然后,使用转换的访问机制,在刚刚初始化的OOP对象或数据结构上重复初始访问尝试。 本发明的使用通过减少特定计算操作涉及的开销来改善计算机的性能。
    • 20. 发明授权
    • Method for compiling a parallel thread execution program for general execution
    • 用于编译并行线程执行程序以进行一般执行的方法
    • US09361079B2
    • 2016-06-07
    • US13361408
    • 2012-01-30
    • Vinod GroverAndrew KerrSean Lee
    • Vinod GroverAndrew KerrSean Lee
    • G06F9/44G06F9/45
    • G06F8/53
    • A technique is disclosed for executing a compiled parallel application on a general purpose processor. The compiled parallel application comprises parallel thread execution code, which includes single-instruction multiple-data (SIMD) constructs, as well as references to intrinsic functions conventionally available in a graphics processing unit. The parallel thread execution code is transformed into an intermediate representation, which includes vector instruction constructs. The SIMD constructs are mapped to vector instructions available within the intermediate representation. Intrinsic functions are mapped to corresponding emulated runtime implementations. The technique advantageously enables parallel applications compiled for execution on a graphics processing unit to be executed on a general purpose central processing unit configured to support vector instructions.
    • 公开了一种用于在通用处理器上执行编译并行应用的技术。 编译并行应用程序包括并行线程执行代码,其包括单指令多数据(SIMD)结构,以及对图形处理单元中常规可用的内在函数的引用。 并行线程执行代码被转换为包含向量指令结构的中间表示。 SIMD构造被映射到中间表示中可用的向量指令。 内在函数映射到相应的仿真运行时实现。 该技术有利地使编译为在图形处理单元上执行的并行应用能够在被配置为支持向量指令的通用中央处理单元上执行。