会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • EFFICIENT IMPLEMENTATION OF RSA USING GPU/CPU ARCHITECTURE
    • 使用GPU / CPU架构的RSA的有效实现
    • US20130297919A1
    • 2013-11-07
    • US13997239
    • 2011-11-30
    • Xiaozhu KangBiju GeorgeKen Lueh
    • Xiaozhu KangBiju GeorgeKen Lueh
    • G06F9/38
    • G06F9/38G06F8/452G06F9/30G06F21/00
    • Various embodiments are directed to a heterogeneous processor architecture comprised of a CPU and a GPU on the same processor die. The heterogeneous processor architecture may optimize source code in a GPU compiler using vector strip mining to reduce instructions of arbitrary vector lengths into GPU supported vector lengths and loop peeling. It may be first determined that the source code is eligible for optimization if more than one machine code instruction of compiled source code under-utilizes GPU instruction bandwidth limitations. The initial vector strip mining results may be discarded and the first iteration of the inner loop body may be peeled out of the loop. The type of operands in the source code may be lowered and the peeled out inner loop body of source code may be vector strip mined again to obtain optimized source code.
    • 各种实施例涉及由同一处理器管芯上的CPU和GPU组成的异构处理器架构。 异构处理器架构可以使用向量带挖掘来优化GPU编译器中的源代码,以将任意矢量长度的指令减少到GPU支持的矢量长度和循环剥离。 如果编译源代码的多个机器码指令利用了GPU指令带宽限制,则可以首先确定源代码是否符合优化条件。 可以丢弃初始矢量条带挖掘结果,并且内环体的第一次迭代可能被剥离出环路。 可以降低源代码中的操作数类型,并且可以再次剥离源代码的剥离内圈体,以获得优化的源代码。
    • 5. 发明授权
    • Efficient implementation of RSA using GPU/CPU architecture
    • 使用GPU / CPU架构高效实现RSA
    • US09262166B2
    • 2016-02-16
    • US13997239
    • 2011-11-30
    • Xiaozhu KangBiju GeorgeKen Lueh
    • Xiaozhu KangBiju GeorgeKen Lueh
    • G06F9/45G06F9/38G06F9/30G06F21/00
    • G06F9/38G06F8/452G06F9/30G06F21/00
    • Various embodiments are directed to a heterogeneous processor architecture comprised of a CPU and a GPU on the same processor die. The heterogeneous processor architecture may optimize source code in a GPU compiler using vector strip mining to reduce instructions of arbitrary vector lengths into GPU supported vector lengths and loop peeling. It may be first determined that the source code is eligible for optimization if more than one machine code instruction of compiled source code under-utilizes GPU instruction bandwidth limitations. The initial vector strip mining results may be discarded and the first iteration of the inner loop body may be peeled out of the loop. The type of operands in the source code may be lowered and the peeled out inner loop body of source code may be vector strip mined again to obtain optimized source code.
    • 各种实施例涉及由同一处理器管芯上的CPU和GPU组成的异构处理器架构。 异构处理器架构可以使用向量带挖掘来优化GPU编译器中的源代码,以将任意矢量长度的指令减少到GPU支持的矢量长度和循环剥离。 如果编译源代码的多个机器码指令利用了GPU指令带宽限制,则可以首先确定源代码是否符合优化条件。 可以丢弃初始矢量条带挖掘结果,并且内环体的第一次迭代可能被剥离出环路。 可以降低源代码中的操作数类型,并且可以再次剥离源代码的剥离内圈体,以获得优化的源代码。