专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US09595074B2 Multistage collector for outputs in multiprocessor systems 有权
标题翻译：多处理器系统中的输出多级收集器
公开(公告)号：US09595074B2
公开(公告)日：2017-03-14
申请号：US13611325
申请日：2012-09-12
申请人： James Alexander McCombe , Steven John Clohset , Jason Rupert Redgrave , Luke Tilman Peterson
发明人： James Alexander McCombe , Steven John Clohset , Jason Rupert Redgrave , Luke Tilman Peterson
IPC分类号： G06F15/80 , G06T1/20 , G06T15/06
CPC分类号： G06T1/60 , G06T1/20 , G06T15/06 , G06T2210/52
摘要： Aspects include a multistage collector to receive outputs from plural processing elements. Processing elements may comprise (each or collectively) a plurality of clusters, with one or more ALUs that may perform SIMD operations on a data vector and produce outputs according to the instruction stream being used to configure the ALU(s). The multistage collector includes substituent components each with at least one input queue, a memory, a packing unit, and an output queue; these components can be sized to process groups of input elements of a given size, and can have multiple input queues and a single output queue. Some components couple to receive outputs from the ALUs and others receive outputs from other components. Ultimately, the multistage collector can output groupings of input elements. Each grouping of elements (e.g., at input queues, or stored in the memories of component) can be formed based on matching of index elements.
摘要翻译：方面包括用于接收来自多个处理元件的输出的多级收集器。处理元件可以包括（每个或集体）多个聚类，其中一个或多个ALU可以对数据向量执行SIMD操作，并根据用于配置ALU的指令流产生输出。多级收集器包括各自具有至少一个输入队列，存储器，打包单元和输出队列的取代组件; 这些组件的大小可以处理给定大小的输入元素组，并且可以具有多个输入队列和单个输出队列。一些组件耦合以接收来自ALU和其他组件的输出，从其他组件接收输出。最终，多级收集器可以输出输入元素的分组。可以基于索引元素的匹配来形成每个元素组（例如，在输入队列处或存储在组件的存储器中）。

2. 发明授权

US09478062B2 Memory allocation in distributed memories for multiprocessing 有权
标题翻译：用于多处理的分布式存储器中的内存分配
公开(公告)号：US09478062B2
公开(公告)日：2016-10-25
申请号：US13368616
申请日：2012-02-08
申请人： Stephen John Clohset , James Alexander McCombe , Luke Tilman Peterson
发明人： Stephen John Clohset , James Alexander McCombe , Luke Tilman Peterson
IPC分类号： G06T15/06 , G06T15/00
CPC分类号： G06F9/4881 , G06F9/5016 , G06T15/005 , G06T15/06 , G06T2200/28 , G06T2210/52
摘要： In some aspects, finer grained parallelism is achieved by segmenting programmatic workloads into smaller discretized portions, where a first element can be indicative both of a configuration or program to be executed, and a first data set to be used in such execution, while a second element can be indicative of a second data element or group. The discretized portions can cause program execute on distributed processors. Approaches to selecting processors, and allocating local memory associated with those processors are disclosed. In one example, discretized portions that share a program have an anti-affinity to cause dispersion, for initial execution assignment. Flags, such as programmer and compiler generated flags can be used in determining such allocations. Workloads can be grouped according to compatibility of memory usage requirements.
摘要翻译：在一些方面，通过将编程工作负载分割成更小的离散部分来实现更精细的粒度并行性，其中第一元素可指示要执行的配置或程序以及要在其中执行的第一数据集，元素可以指示第二数据元素或组。离散化部分可以使分布式处理器上的程序执行。公开了选择处理器以及分配与这些处理器相关联的本地存储器的方法。在一个示例中，共享程序的离散化部分具有反关联性以引起分散，用于初始执行分配。诸如编程器和编译器生成的标志之类的标志可用于确定这样的分配。可以根据内存使用要求的兼容性对工作负载进行分组。

3. 发明授权

US08736610B2 Systems and methods for rendering with ray tracing 有权
标题翻译：用光线跟踪渲染的系统和方法
公开(公告)号：US08736610B2
公开(公告)日：2014-05-27
申请号：US13567076
申请日：2012-08-05
申请人： James Alexander McCombe , Luke Tilman Peterson , Ryan R. Salsbury
发明人： James Alexander McCombe , Luke Tilman Peterson , Ryan R. Salsbury
IPC分类号： G06T15/10 , G06T15/20
CPC分类号： G06T15/06 , G06T2210/52
摘要： For ray tracing scenes composed of primitives, systems and methods-accelerate ray/primitive intersection identification by testing rays against elements of geometry acceleration data (GAD) in a parallelized intersection testing resource. Groups of rays can be described as shared attribute information and individual ray data for ray data transfer. A host hosts shading and/or management processes can control the testing resource and adapting the ray tracing. The GAD elements can be arranged in a graph, and rays collected into collections based on whether a ray intersects a given element. When a collection is deemed ready for further testing, it is tested for intersection with GAD elements connected, in the graph, to the given element. The graph can be hierarchical such that rays of a given collection are tested against children of the GAD element associated with the given collection.
摘要翻译：对于由原始图像，系统和方法组成的光线跟踪场景，通过对并行交叉测试资源中的几何加速度数据（GAD）的元素进行测试，加速射线/原始交点识别。射线组可以描述为共享属性信息和用于射线数据传输的单独射线数据。主机主机着色和/或管理过程可以控制测试资源并适应光线跟踪。 GAD元素可以排列在图形中，并且基于光线是否与给定元素相交，将光线收集到集合中。当集合被认为准备进行进一步测试时，它将与图中连接的GAD元素的交点进行测试。该图可以是分级的，使得给定集合的射线针对与给定集合相关联的GAD元素的子对象进行测试。

4. 发明申请

US20130069960A1 MULTISTAGE COLLECTOR FOR OUTPUTS IN MULTIPROCESSOR SYSTEMS 有权
标题翻译：多处理器系统中的输出多级收集器
公开(公告)号：US20130069960A1
公开(公告)日：2013-03-21
申请号：US13611325
申请日：2012-09-12
申请人： James Alexander McCombe , Steven John Clohset , Jason Rupert Redgrave , Luke Tilman Peterson
发明人： James Alexander McCombe , Steven John Clohset , Jason Rupert Redgrave , Luke Tilman Peterson
IPC分类号： G06T1/20
CPC分类号： G06T1/60 , G06T1/20 , G06T15/06 , G06T2210/52
摘要： Aspects include a multistage collector to receive outputs from plural processing elements. Processing elements may comprise (each or collectively) a plurality of clusters, with one or more ALUs that may perform SIMD operations on a data vector and produce outputs according to the instruction stream being used to configure the ALU(s). The multistage collector includes substituent components each with at least one input queue, a memory, a packing unit, and an output queue; these components can be sized to process groups of input elements of a given size, and can have multiple input queues and a single output queue. Some components couple to receive outputs from the ALUs and others receive outputs from other components. Ultimately, the multistage collector can output groupings of input elements. Each grouping of elements (e.g., at input queues, or stored in the memories of component) can be formed based on matching of index elements.
摘要翻译：方面包括用于接收来自多个处理元件的输出的多级收集器。处理元件可以包括（每个或集体）多个聚类，其中一个或多个ALU可以对数据向量执行SIMD操作，并根据用于配置ALU的指令流产生输出。多级收集器包括各自具有至少一个输入队列，存储器，打包单元和输出队列的取代组件; 这些组件的大小可以处理给定大小的输入元素组，并且可以具有多个输入队列和单个输出队列。一些组件耦合以接收来自ALU和其他组件的输出，从其他组件接收输出。最终，多级收集器可以输出输入元素的分组。可以基于索引元素的匹配来形成每个元素组（例如，在输入队列处或存储在组件的存储器中）。

5. 发明申请

US20100332523A1 SYSTEMS AND METHODS FOR PHOTON MAP QUERYING 有权
标题翻译：光电地图查询的系统和方法
公开(公告)号：US20100332523A1
公开(公告)日：2010-12-30
申请号：US12825728
申请日：2010-06-29
申请人： Luke Tilman Peterson , James Alexander McCombe
发明人： Luke Tilman Peterson , James Alexander McCombe
IPC分类号： G06F17/30
CPC分类号： G06F17/30533 , G06F17/30592
摘要： In one aspect, photon queries are answered using systems and methods of traversal of collections of photon queries through an acceleration structure, to identify photons meeting a specification of a given query. Such systems and methods can be extended to satisfying similarity queries in an n-dimensional parameter space. Queries can be associated with code (or pointers to code) that are run to achieve closure of that query. Queries can cause further queries to be emitted. Arbitrary data can be passed from one query to another; for example, parameters defined internally to the code modules themselves (e.g., the parameters do not need to have a definition or meaning to the systems or within the methods).
摘要翻译：在一个方面，使用通过加速结构遍历光子查询的集合的系统和方法回答光子查询，以识别符合给定查询的规范的光子。这样的系统和方法可以扩展到在n维参数空间中满足相似性查询。查询可以与运行以实现该查询关闭的代码（或代码指针）相关联。查询可能会导致进一步的查询被排除。任意数据可以从一个查询传递给另一个查询; 例如，在代码模块本身内部定义的参数（例如，参数不需要具有对系统的定义或含义或在方法内）。

6. 发明申请

US20090096789A1 METHOD, APPARATUS, AND COMPUTER READABLE MEDIUM FOR LIGHT ENERGY ACCOUNTING IN RAY TRACING 有权
标题翻译：方法，装备和计算机可读性介质，用于光线跟踪中的能源会计
公开(公告)号：US20090096789A1
公开(公告)日：2009-04-16
申请号：US11872593
申请日：2007-10-15
申请人： Luke Tilman Peterson , James Alexander McCombe , Ryan R. Salsbury
发明人： Luke Tilman Peterson , James Alexander McCombe , Ryan R. Salsbury
IPC分类号： G06T15/50
CPC分类号： G06T15/06
摘要： For ray tracing systems, described methods, media, apparatuses provide for accounting of light energy that will be collected at pixels of a 2-D representation without recursive closure of a tree of ray/primitive intersections, and also provide for adaptivity in ray tracing based on importance indicators of each ray, such as a weight, which may be carried in data structures representative of the rays. Examples of such adaptivity may include determining a number of children to issue for shading an identified intersecting primitive, culling rays, and adding rays to achieve more accurate sampling, if desired. All such adaptivity may be triggered with goal-based indicators, such as a threshold value representative of rendering progress to a time-based goal, such as a frame rate.
摘要翻译：对于光线跟踪系统，所描述的方法，介质，设备提供将会在2-D表示的像素处收集的光能的记录，而不是光线/原始相交的树的递归闭合，并且还提供基于光线跟踪的适应性关于每个射线的重要性指标，例如重量，其可以在代表射线的数据结构中携带。这种适应性的示例可以包括确定若干儿童发出遮蔽所识别的相交原始图像，剔除光线和添加光线以实现更准确的采样，如果需要的话。可以利用基于目标的指示符来触发所有这样的自适应，例如代表将进度渲染到基于时间的目标（例如帧速率）的阈值。

7. 发明授权

US08717357B2 Systems and methods for 3-D scene acceleration structure creation and updating 有权
标题翻译： 3-D场景加速结构创建和更新的系统和方法
公开(公告)号：US08717357B2
公开(公告)日：2014-05-06
申请号：US13567033
申请日：2012-08-04
申请人： James Alexander McCombe , Aaron Dwyer , Luke Tilman Peterson , Neils Nesse
发明人： James Alexander McCombe , Aaron Dwyer , Luke Tilman Peterson , Neils Nesse
IPC分类号： G06T15/08
CPC分类号： G06T15/06 , G06T15/08 , G06T15/80 , G06T17/00 , G06T17/10 , G06T2210/12 , G06T2210/32
摘要： Systems and methods for producing an acceleration structure provide for subdividing a 3-D scene into a plurality of volumetric portions, which have different sizes, each being addressable using a multipart address indicating a location and a relative size of each volumetric portion. A stream of primitives is processed by characterizing each according to one or more criteria, selecting a relative size of volumetric portions for use in bounding the primitive, and finding a set of volumetric portions of that relative size which bound the primitive. A primitive ID is stored in each location of a cache associated with each volumetric portion of the set of volumetric portions. A cache location is selected for eviction, responsive to each cache eviction decision made during the processing. An element of an acceleration structure according to the contents of the evicted cache location is generated, responsive to the evicted cache location.
摘要翻译：用于产生加速结构的系统和方法提供将3-D场景细分为具有不同大小的多个体积部分，每个体积部分可使用指示每个体积部分的位置和相对大小的多部分地址来寻址。通过根据一个或多个标准表征每个图元来处理图元流，选择用于界定图元的体积部分的相对大小，以及找到结合原始图像的相对尺寸的一组体积部分。原始ID存储在与该组体积部分的每个体积部分相关联的高速缓存的每个位置中。响应于在处理期间进行的每个缓存驱逐决定，选择缓存位置进行驱逐。响应于被驱逐的高速缓存位置，生成根据驱逐的高速缓存位置的内容的加速结构的元素。

8. 发明授权

US08692834B2 Graphics processor with non-blocking concurrent architecture 有权
标题翻译：具有非阻塞并发架构的图形处理器
公开(公告)号：US08692834B2
公开(公告)日：2014-04-08
申请号：US13567091
申请日：2012-08-06
申请人： Luke Tilman Peterson , James Alexander McCombe , Steven John Clohset , Jason Rupert Redgrave
发明人： Luke Tilman Peterson , James Alexander McCombe , Steven John Clohset , Jason Rupert Redgrave
IPC分类号： G06F15/80 , G06F15/00 , G06T1/00
CPC分类号： G06T15/005 , G06F9/5033 , G06F9/505 , G06F9/52 , G06F15/8007 , G06T1/20 , G06T1/60 , G06T15/06 , G06T2200/28
摘要： In some aspects, systems and methods provide for forming groupings of a plurality of independently-specified computation workloads, such as graphics processing workloads, and in a specific example, ray tracing workloads. The workloads include a scheduling key, which is one basis on which the groupings can be formed. Workloads grouped together can all execute from the same source of instructions, one or more different private data elements. Such workloads can recursively instantiate other workloads that reference the same private data elements. In some examples, the scheduling key can be used to identify a data element to be used by all the workloads of a grouping. Memory conflicts to private data elements are handled through scheduling of non-conflicted workloads or specific instructions an deferring conflicted workloads instead of locking memory locations.
摘要翻译：在一些方面，系统和方法提供用于形成多个独立指定的计算工作负荷（诸如图形处理工作负载）以及在具体示例中的光线跟踪工作负载的分组。工作负载包括一个调度密钥，这是可以形成分组的一个基础。分组在一起的工作负载都可以从相同的指令来源执行，一个或多个不同的私有数据元素。这样的工作负载可以递归地实例化引用相同私有数据元素的其他工作负载。在一些示例中，调度密钥可用于标识要由分组的所有工作负载使用的数据元素。与私有数据元素的内存冲突通过调度非冲突的工作负载或特定指令来处理，推迟冲突的工作负载而不是锁定内存位置。

9. 发明申请

US20120324458A1 SCHEDULING HETEROGENOUS COMPUTATION ON MULTITHREADED PROCESSORS 审中-公开
标题翻译：在多处理器上调度异构计算
公开(公告)号：US20120324458A1
公开(公告)日：2012-12-20
申请号：US13368682
申请日：2012-02-08
申请人： Luke Tilman PETERSON , James Alexander McCombe
发明人： Luke Tilman PETERSON , James Alexander McCombe
IPC分类号： G06F9/46
CPC分类号： G06F9/5038 , G06F9/3828 , G06F9/3851 , G06F9/3891 , G06F2209/506
摘要： Aspects include computation systems that can identify computation instances that are not capable of being reentrant, or are not reentrant capable on a target architecture, or are non-reentrant as a result of having a memory conflict in a particular execution situation. A system can have a plurality of computation units, each with an independently schedulable SIMD vector. Computation instances can be defined by a program module, and a data element(s) that may be stored in a local cache for a particular computation unit. Each local cache does not maintain coherency controls for such data elements. During scheduling, a scheduler can maintain a list of running (or runnable) instances, and attempt to schedule new computation instances by determining whether any new computation instance conflicts with a running instance and responsively defer scheduling. Memory conflict checks can be conditioned on a flag or other indication of the potential for non-reentrancy.
摘要翻译：方面包括计算系统，其可以识别不能够在目标架构上能够重入或不可重入的计算实例，或者由于在特定执行情况下存在内存冲突而不能重入。系统可以具有多个计算单元，每个具有可独立调度的SIMD向量。计算实例可以由程序模块和可存储在特定计算单元的本地高速缓存中的数据元素来定义。每个本地缓存不保持这些数据元素的一致性控制。在调度期间，调度程序可以维护运行（或可运行）实例的列表，并尝试通过确定任何新的计算实例是否与正在运行的实例冲突并响应延迟调度来调度新的计算实例。内存冲突检查可以根据标志或其他指示来进行非重新注册。

10. 发明申请

US20120249553A1 ARCHITECTURES FOR CONCURRENT GRAPHICS PROCESSING OPERATIONS 有权
标题翻译：相关图形处理操作的架构
公开(公告)号：US20120249553A1
公开(公告)日：2012-10-04
申请号：US13525300
申请日：2012-06-16
申请人： Luke Tilman Peterson , James Alexander McCombe , Ryan R. Salsbury , Stephen Purcell
发明人： Luke Tilman Peterson , James Alexander McCombe , Ryan R. Salsbury , Stephen Purcell
IPC分类号： G06T15/06 , G06T1/20
CPC分类号： G06T15/06 , G06T15/005 , G06T15/50 , G06T15/80
摘要： Ray tracing, and more generally, graphics operations taking place in a 3-D scene, involve a plurality of constituent graphics operations. Scheduling of graphics operations for concurrent execution on a computer may increase throughput. In aspects herein, constituent graphics operations are scheduled in groups, having members selected according to disclosed aspects. Processing for specific graphics operations in a group can be deferred if all the operations in the group cannot be further tested concurrently. Graphics operations that have been deferred are recombined into two or more different groups and ultimately complete processing, through a required number of iterations of such process. In one application, the performance of the graphics operations perform a search in which respective 1:1 matches between different types of geometric shapes involved in the 3-D scene are identified. For example, closest intersections between rays and scene geometry can be identified by processing scheduled according to disclosed aspects.
摘要翻译：光线跟踪，更一般地，在3-D场景中发生的图形操作涉及多个组成图形操作。在计算机上并行执行的图形操作的调度可以增加吞吐量。在这些方面中，组件图形操作按组进行调度，具有根据公开的方面选择的成员。如果组中的所有操作无法进一步并发测试，则可以推迟对组中特定图形操作的处理。已经推迟的图形操作被重新组合成两个或更多个不同的组，并且通过所需数量的这种过程的迭代最终完成处理。在一个应用中，图形操作的性能执行搜索，其中识别在3-D场景中涉及的不同类型的几何形状之间相应的1：1匹配。例如，可以通过根据所公开的方面调度的处理来识别光线和场景几何之间的最接近的交点。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式