会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 21. 发明授权
    • Specifying different type generalized event and action pair in a processor
    • 在处理器中指定不同类型的广义事件和动作对
    • US06735690B1
    • 2004-05-11
    • US09598566
    • 2000-06-21
    • Edwin F. BarryPatrick R. MarchandGerald G. PechanekCharles W. Kurak, Jr.
    • Edwin F. BarryPatrick R. MarchandGerald G. PechanekCharles W. Kurak, Jr.
    • G06F1500
    • G06F9/30054G06F9/30101G06F9/30112G06F9/325
    • A processor with a generalized eventpoint architecture, which is scalable for use in a very long instruction word (VLIW) array processor, such as the manifold array (ManArray) processor is described. In one aspect, generalized processor event (p-event) detection facilities are provided by use of compares to check if an instruction address, a data memory address, an instruction, a data value, arithmetic-condition flags, or other processor change of state eventpoint has occurred. In another aspect, generalized processor action (p-action) facilities are provided to cause a change in the program flow by loading the program counter with a new instruction address, generate an interrupt, signal a semaphore, log or count the p-event, time stamp the event, initiate a background operation, or to cause other p-actions to occur. The generalized facilities are defined in the eventpoint architecture as consisting of a control register and three eventpoint parameters, namely at least one register to compare against, a register containing a second compare register, a vector address, or parameter to be passed, and a count or mask register. Based upon this generalized eventpoint architecture, new capabilities are enabled. For example, auto-looping with capabilities to branch out of a nested auto-loop upon detection of a specified condition, background DMA facilities, the ability to link a chain of p-events together for debug purposes, and others are all important capabilities which are readily obtained.
    • 描述了具有广泛事件点架构的处理器,其可扩展以用于非常长的指令字(VLIW)阵列处理器,例如歧管阵列(ManArray)处理器。 在一个方面,通过使用比较来提供广义处理器事件(p事件)检测设施,以检查指令地址,数据存储器地址,指令,数据值,算术条件标志或其他处理器状态变化 事件点已发生。 在另一方面,提供通用处理器动作(p-action)功能以通过用新的指令地址加载程序计数器来产生程序流程的改变,生成中断,信号信号,记录或计数p事件, 事件时间戳,启动后台操作,或导致其他动作发生。 广义设施在事件点架构中被定义为由控制寄存器和三个事件点参数组成,即至少要有一个要比较的寄存器,一个包含第二个比较寄存器的寄存器,一个向量地址或要传递的参数,以及一个计数 或屏蔽寄存器。 基于这种广义的事件点架构,启用了新的功能。 例如,在检测到指定的条件时,自动循环具有分支出嵌套自动循环的功能,后台DMA设施,将p个事件链链接在一起用于调试目的的能力等等都是重要的功能 容易获得。
    • 25. 发明申请
    • Manifold Array Processor
    • 歧管阵列处理器
    • US20130019082A1
    • 2013-01-17
    • US13616942
    • 2012-09-14
    • Gerald G. PechanekCharles W. Kurak, JR.
    • Gerald G. PechanekCharles W. Kurak, JR.
    • G06F15/80
    • G06F15/17381G06F9/30076G06F15/17337G06F15/8023
    • An array processor includes processing elements arranged in to form a rectangular array. Inter-cluster communication paths are mutually exclusive. Due to the mutual exclusivity of the data paths, communications between the processing elements of each cluster may be combined in a single inter-cluster path, thus eliminating half the wiring required for the path. The length of the longest communication path is not directly determined by the overall dimension of the array, as in conventional torus arrays. Rather, the longest communications path is limited by the inter-cluster spacing. Transpose elements of an N×N torus may be combined in clusters and communicate with one another through intra-cluster communications paths. Transpose operation latency is eliminated in this approach. Each PE may have a single transmit port and a single receive port. Thus, the individual PEs are decoupled from the array topology.
    • 阵列处理器包括布置成形成矩形阵列的处理元件。 群集间通信路径是互斥的。 由于数据路径的相互独占性,每个集群的处理元件之间的通信可以组合在单个集群间路径中,从而消除路径所需的一半接线。 最长通信路径的长度不直接取决于阵列的整体尺寸,如在常规环形阵列中。 相反,最长的通信路径受群间间隔的限制。 N×N环面的移位元素可以组合在一起,并通过群内通信路径相互通信。 这种方法消除了转置操作延迟。 每个PE可以具有单个发送端口和单个接收端口。 因此,各个PE与阵列拓扑分离。
    • 29. 发明授权
    • Methods and apparatus for loading a very long instruction word memory
    • 用于加载非常长的指令字存储器的方法和装置
    • US06704857B2
    • 2004-03-09
    • US09747056
    • 2000-12-22
    • Edwin Frank BarryGerald G. Pechanek
    • Edwin Frank BarryGerald G. Pechanek
    • G06F1500
    • G06F9/382G06F9/30149G06F9/3017G06F9/3802G06F9/3853G06F13/28G06F2213/2808
    • The ManArray processor is a scalable indirect VLIW array processor that defines two preferred architectures for indirect VLIW memories. One approach treats the VIM as one composite block of memory using one common address interface to access any VLIW stored in the VIM. The second approach treats the VIM as made up of multiple smaller VIMs each individually associated with the functional units and each individually addressable for loading and reading during XV execution. The VIM memories, contained in each processing element (PE), are accessible by the same type of LV and XV Short Instruction Words (SIWs) as in a single processor instantiation of the indirect VLIW architecture. In the ManArray architecture, the control processor, also called a sequence processor (SP), fetches the instructions from the SIW memory and dispatches them to itself and the PEs. By using the LV instruction, VLIWs can be loaded into VIMs in the SP and the PEs. Since the LV instruction is supplied by the SP through the instruction stream, when VLIWs are being loaded into any VIM no other processing takes place. In addition, as defined in the ManArray architecture, when the SP is processing SIWs, such as control and other sequential code, the PE array is not executing any instructions. Techniques are provided herein to independently load the VIMs concurrent with SIW or iVLIW execution on the SP or on the PEs thereby allowing the load latency to be hidden by the computation.
    • ManArray处理器是可扩展的间接VLIW阵列处理器,它定义了间接VLIW存储器的两种优选架构。 一种方法将VIM视为一个复合的存储器块,使用一个公共地址接口访问存储在VIM中的任何VLIW。 第二种方法将VIM视为由功能单元单独关联的多个较小的VIM组成,并且每个VIM单独可寻址以在XV执行期间进行加载和读取。 包含在每个处理元件(PE)中的VIM存储器可以通过与间接VLIW架构的单处理器实例化中相同类型的LV和XV短指令字(SIW)来访问。 在ManArray架构中,控制处理器(也称为序列处理器(SP))从SIW存储器中获取指令,并将它们分派给自身和PE。 通过使用LV指令,VLIW可以加载到SP和PE中的VIM中。 由于LV指令由SP通过指令流提供,当VLIW被加载到任何VIM中时,不会发生其他处理。 另外,如ManArray架构所定义的,当SP正在处理SIW(例如控制和其他顺序代码)时,PE阵列不执行任何指令。 本文提供了技术来独立地在SP或PE上独立地加载与SIW或iVLIW执行的VIM,从而允许通过计算隐藏负载等待时间。
    • 30. 发明授权
    • Array processor communication architecture with broadcast processor
instructions
    • 具有广播处理器指令的阵列处理器通信架构
    • US5659785A
    • 1997-08-19
    • US386384
    • 1995-02-10
    • Gerald G. PechanekLarry D. LarsenClair John GlossnerStamatis Vassiliaadis
    • Gerald G. PechanekLarry D. LarsenClair John GlossnerStamatis Vassiliaadis
    • G06F15/16G06F15/173G06F15/177G06F15/80
    • G06F15/17381
    • A plurality of processor elements (PEs) are connected in a duster by a common instruction bus to a sequencing control unit with its associated instruction memory. Each PE has data buses connected to at least its four nearest PE neighbors, referred to as its North, South, East and West PE neighbors. Each PE also has a general purpose register file containing several operand registers. A common instruction is fetched from the instruction memory by the sequencing control unit and broadcast over the instruction bus to each PE in the cluster. The instruction includes an upcode value that controls the arithmetic or logical operation performed by an execution unit in the PE on one or more operands in the register file. A switch is included in each PE to interconnect it with a first PE neighbor as the destination to which the result from the execution unit is sent. The broadcast instruction includes a destination field that controls the switch in the PE, to dynamically select the destination neighbor PE to which the result is sent. Further, the broadcast instruction includes a target field that controls the switch in the PE, to dynamically select the operand register in the register file of the PE, to which another result received from another neighbor PE in the cluster is stored. In this manner, the instruction broadcast to all the PEs in the cluster, dynamically controls the communication of operands and results between the PEs in the cluster, in a single instruction, multiple data processor array.
    • 多个处理器元件(PE)通过公共指令总线连接到除尘器中,其具有与其相关联的指令存储器的排序控制单元。 每个PE都有数据总线连接到至少其四个最近的PE邻居,称为其北,南,东和西PE邻居。 每个PE还有一个包含多个操作数寄存器的通用寄存器文件。 通过排序控制单元从指令存储器取出通用指令,并通过指令总线广播到集群中的每个PE。 该指令包括一个上限值,其控制由PE中的执行单元在寄存器文件中的一个或多个操作数上执行的算术或逻辑操作。 每个PE中包括一个交换机,将其与第一个PE邻居进行互连,作为发送执行单元的结果的目的地。 广播指令包括控制PE中的交换机的目的地字段,动态地选择发送结果的目的地邻居PE。 此外,广播指令包括控制PE中的交换机的目标字段,动态地选择PE的寄存器文件中的操作数寄存器,存储从群集中的另一个邻居PE接收到的另一结果。 以这种方式,向集群中的所有PE广播的指令在单个指令,多数据处理器阵列中动态地控制集群中的PE之间的操作数和结果的通信。