Invention Grant
US09038079B2 Reducing cross queue synchronization on systems with low memory latency across distributed processing nodes 有权
减少跨分布式处理节点内存延迟低的系统的交叉队列同步

Reducing cross queue synchronization on systems with low memory latency across distributed processing nodes
Abstract:
A method for efficient dispatch/completion of a work element within a multi-node data processing system. The method comprises: selecting specific processing units from among the processing nodes to complete execution of a work element that has multiple individual work items that may be independently executed by different ones of the processing units; generating an allocated processor unit (APU) bit mask that identifies at least one of the processing units that has been selected; placing the work element in a first entry of a global command queue (GCQ); associating the APU mask with the work element in the GCQ; and responsive to receipt at the GCQ of work requests from each of the multiple processing nodes or the processing units, enabling only the selected specific ones of the processing nodes or the processing units to be able to retrieve work from the work element in the GCQ.
Information query
Patent Agency Ranking
0/0