会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 61. 发明授权
    • Manifold array processor
    • 歧管阵列处理器
    • US06338129B1
    • 2002-01-08
    • US09323609
    • 1999-06-01
    • Gerald G. PechanekCharles W. Kurak, Jr.
    • Gerald G. PechanekCharles W. Kurak, Jr.
    • G06F1516
    • G06F15/17381G06F9/30076G06F15/17337G06F15/8023
    • An array processor includes processing elements arranged in clusters which are, in turn, combined in a rectangular array. Each cluster is formed of processing elements which preferably communicate with the processing elements of at least two other clusters. Additionally each inter-cluster communication path is mutually exclusive, that is, each path carries either north and west, south and east, north and east, or south and west communications. Due to the mutual exclusivity of the data paths, communications between the processing elements of each cluster may be combined in a single inter-cluster path. That is, communications from a cluster which communicates to the north and east with another cluster may be combined in one path, thus eliminating half the wiring required for the path. Additionally, the length of the longest communication path is not directly determined by the overall dimension of the array, as it is in conventional torus arrays. Rather, the longest communications path is limited only by the inter-cluster spacing. In one implementation, transpose elements of an N×N torus are combined in clusters and communicate with one another through intra-cluster communications paths. Since transpose elements have direct connections to one another, transpose operation latency is eliminated in this approach. Additionally, each PE may have a single transmit port and a single receive port. As a result, the individual PEs are decoupled from the topology of the array.
    • 阵列处理器包括按簇排列的处理元件,它们依次以矩形阵列组合。 每个簇由优选地与至少两个其他簇的处理元件通信的处理元件形成。 另外每个集群间的通信路径是相互排斥的,也就是说,每条路径都有北西,南,东,北,东,或南,西通信。 由于数据路径的相互独占性,每个集群的处理元件之间的通信可以组合在单个集群间路径中。 也就是说,来自与北部和东部与另一个群集通信的群集的通信可以组合在一个路径中,从而消除路径所需的一半布线。 此外,最长通信路径的长度不是直接由阵列的整体尺寸决定,就像在传统的环面阵列中一样。 相反,最长的通信路径仅受群间间隔限制。 在一个实现中,将NxN环面的转置元素组合在一起并通过集群内通信路径相互通信。 由于转置元素具有彼此的直接连接,因此在此方法中消除了转置操作延迟。 另外,每个PE可以具有单个发送端口和单个接收端口。 因此,各个PE与阵列的拓扑结构分离。
    • 62. 发明授权
    • Merged array controller and processing element
    • 合并阵列控制器和处理元件
    • US06219776B1
    • 2001-04-17
    • US09169072
    • 1998-10-09
    • Gerald G. PechanekJuan G. Revilla
    • Gerald G. PechanekJuan G. Revilla
    • G06F1300
    • G06F9/3885G06F9/3012G06F9/30145G06F9/30189G06F9/3887G06F15/17343G06F15/8007
    • A highly parallel data processing system includes an array of n processing elements (PEs) and a controller sequence processor (SP) wherein at least one PE is combined with the controller SP to create a Dynamic Merged Processor (DP) which supports two modes of operation. In its first mode of operation, the DP acts as one of the PEs in the array and participates in the execution of single-instruction-multiple-data (SIMD) instructions. In the second mode of operation, the DP acts as the controlling element for the array of PEs and executes non-array instructions. To support these two modes of operation, the DP includes a plurality of execution units and two general-purpose register files. The execution units are “shared” in that they can execute instructions in either mode of operation. With very long instruction word (VLIW) capability, both modes of operation can be in effect on a cycle by cycle basis for every VLIW executed. This structure allows the controlling element in a highly parallel SIMD processor to be reused as one of the processing elements in the array to reduce the overall number of transistors and wires in the SIMD processor while maintaining its capabilities and performance.
    • 高度并行的数据处理系统包括n个处理元件(PE)和控制器序列处理器(SP)的阵列,其中至少一个PE与控制器SP组合以创建支持两种操作模式的动态合并处理器(DP) 。 在其第一种操作模式中,DP充当阵列中的PE之一,并参与执行单指令多数据(SIMD)指令。 在第二种操作模式中,DP充当PE阵列的控制元件,并执行非阵列指令。 为了支持这两种操作模式,DP包括多个执行单元和两个通用寄存器文件。 执行单元是“共享的”,因为它们可以在任一操作模式下执行指令。 具有非常长的指令字(VLIW)能力,两种操作模式可以在执行的每个VLIW的基础上逐周期生效。 这种结构允许高度并行的SIMD处理器中的控制元件被重新用作阵列中的处理元件之一,以在保持其能力和性能的同时减少SIMD处理器中的晶体管和导线的总数。
    • 63. 发明授权
    • Methods and apparatus for dynamic very long instruction word sub-instruction selection for execution time parallelism in an indirect very long instruction word processor
    • 用于动态超长指令字子指令选择的方法和装置,用于间接非常长的指令字处理器中的执行时间并行性
    • US06173389B2
    • 2001-01-09
    • US09205588
    • 1998-12-04
    • Gerald G. PechanekJuan Guillermo RevillaEdwin F. Barry
    • Gerald G. PechanekJuan Guillermo RevillaEdwin F. Barry
    • G06F1580
    • G06F9/3842G06F9/3017G06F9/3822G06F9/3853
    • A pipelined data processing unit includes an instruction sequencer and n functional units capable of executing n operations in parallel. The instruction sequencer includes a random access memory for storing very-long-instruction-words (VLIWs) used in operations involving the execution of two or more functional units in parallel. Each VLIW comprises a plurality of short-instruction-words (SIWs) where each SIW corresponds to a unique type of instruction associated with a unique functional unit. VLIWs are composed in the VLIW memory by loading and concatenating SIWs in each address, or entry. VLIWs are executed via the execute-VLIW (XV) instruction. The iVLIWs can be compressed at a VLIW memory address by use of a mask field contained within the XV1 instruction which specifics which functional units are enabled, or disabled, during the execution of the VLIW. The mask can be changed each time the XV1 instruction is executed, effectively modifying the VLIW every time it is executed. The VLIW memory (VIM) can be further partitioned into separate memories each associated with a function decode-and-execute unit. With a second execute VLIW instruction XV2, each functional unit's VIM can be independently addressed thereby removing duplicate SIWs within the functional unit's VIM. This provides a further optimization of the VLIW storage thereby allowing the use of smaller VLIW memories in cost sensitive applications.
    • 流水线数据处理单元包括指令定序器和能够并行执行n个操作的n个功能单元。 指令定序器包括用于存储在涉及并行执行两个或多个功能单元的操作中使用的非常长指令字(VLIW)的随机存取存储器。 每个VLIW包括多个短指令字(SIW),其中每个SIW对应于与唯一功能单元相关联的唯一类型的指令。 VLIW通过在每个地址或条目中加载和连接SIW来组成VLIW存储器。 VLIW通过执行VLIW(XV)指令执行。 通过使用包含在XV1指令中的掩码字段,可以在VLIW存储器地址处压缩iVLIW,该掩码字段指定在执行VLIW期间启用或禁用哪些功能单元。 每次执行XV1指令时,可以更改掩码,每次执行时都可以有效地修改VLIW。 VLIW存储器(VIM)可以被进一步划分成各自与功能解码和执行单元相关联的存储器。 通过第二执行VLIW指令XV2,可以独立地对每个功能单元的VIM进行寻址,从而去除功能单元的VIM内的重复SIW。 这提供了VLIW存储器的进一步优化,从而允许在成本敏感的应用中使用较小的VLIW存储器。
    • 64. 发明授权
    • Manifold array processor
    • US6023753A
    • 2000-02-08
    • US885310
    • 1997-06-30
    • Gerald G. PechanekCharles W. Kurak, Jr.
    • Gerald G. PechanekCharles W. Kurak, Jr.
    • G06F15/173G06F15/80G06F15/00
    • G06F15/17381G06F15/17337G06F15/8023G06F9/30076
    • An array processor includes processing elements arranged in clusters which are, in turn, combined in a rectangular array. Each cluster is formed of processing elements which preferably communicate with the processing elements of at least two other clusters. Additionally each inter-cluster communication path is mutually exclusive, that is, each path carries either north and west, south and east, north and east, or south and west communications. Due to the mutual exclusivity of the data paths, communications between the processing elements of each cluster may be combined in a single inter-cluster path. That is, communications from a cluster which communicates to the north and east with another cluster may be combined in one path, thus eliminating half the wiring required for the path. Additionally, the length of the longest communication path is not directly determined by the overall dimension of the array, as it is in conventional torus arrays. Rather, the longest communications path is limited only by the inter-cluster spacing. In one implementation, transpose elements of an N.times.N torus are combined in clusters and communicate with one another through intra-cluster communications paths. Since transpose elements have direct connections to one another, transpose operation latency is eliminated in this approach. Additionally, each PE may have a single transmit port and a single receive port. As a result, the individual PEs are decoupled from the topology of the array.
    • 70. 发明授权
    • Merged control/process element processor for executing VLIW simplex instructions with SISD control/SIMD process mode bit
    • 用于使用SISD控制/ SIMD过程模式位执行VLIW单工指令的合并控制/处理元件处理器
    • US06874078B2
    • 2005-03-29
    • US10620144
    • 2003-07-15
    • Gerald G. PechanekJuan G. Revilla
    • Gerald G. PechanekJuan G. Revilla
    • G06F15/16G06F9/30G06F9/318G06F15/173G06F15/80G06F9/40
    • G06F9/3885G06F9/3012G06F9/30145G06F9/30189G06F9/3887G06F15/17343G06F15/8007
    • A highly parallel data processing system includes an array of n processing elements (PEs) and a controller sequence processor (SP) wherein at least one PE is combined with the controller SP to create a Dynamic Merged Processor (DP) which supports two modes of operation. In its first mode of operation, the DP acts as one of the PEs in the array and participates in the execution of single-instruction-multiple-data (SIMD) instructions. In the second mode of operation, the DP acts as the controlling element for the array of PEs and executes non-array instructions. To support these two modes of operation, the DP includes a plurality of execution units and two general-purpose register files. The execution units are “shared” in that they can execute instructions in either mode of operation. With very long instruction word (VLIW) capability, both modes of operation can be in effect on a cycle by cycle basis for every VLIW executed. This structure allows the controlling element in a highly parallel SIMD processor to be reused as one of the processing elements in the array to reduce the overall number of transistors and wires in the SIMD processor while maintaining its capabilities and performance.
    • 高度并行的数据处理系统包括n个处理元件(PE)和控制器序列处理器(SP)的阵列,其中至少一个PE与控制器SP组合以创建支持两种操作模式的动态合并处理器(DP) 。 在其第一种操作模式中,DP充当阵列中的PE之一,并参与执行单指令多数据(SIMD)指令。 在第二种操作模式中,DP充当PE阵列的控制元件,并执行非阵列指令。 为了支持这两种操作模式,DP包括多个执行单元和两个通用寄存器文件。 执行单元是“共享的”,因为它们可以在任一操作模式下执行指令。 具有非常长的指令字(VLIW)能力,两种操作模式可以在执行的每个VLIW的基础上逐周期生效。 这种结构允许高度并行的SIMD处理器中的控制元件被重新用作阵列中的处理元件之一,以在保持其能力和性能的同时减少SIMD处理器中的晶体管和导线的总数。