Compute optimizations for low precision machine learning operations

Invention Grant

US11948224B2 Compute optimizations for low precision machine learning operations 有权

Please log in to see more content

Patent Title: Compute optimizations for low precision machine learning operations
Application No.: US17978573

Application Date: 2022-11-01
Publication No.: US11948224B2

Publication Date: 2024-04-02
Inventor: Elmoustapha Ould-Ahmed-Vall , Sara S. Baghsorkhi , Anbang Yao , Kevin Nealis , Xiaoming Chen , Altug Koker , Abhishek R. Appu , John C. Weast , Mike B. Macpherson , Dukhwan Kim , Linda L. Hurd , Ben J. Ashbaugh , Barath Lakshmanan , Liwei Ma , Joydeep Ray , Ping T. Tang , Michael S. Strickland
Applicant: Intel Corporation
Applicant Address: US CA Santa Clara
Assignee: Intel Corporation
Current Assignee: Intel Corporation
Current Assignee Address: US CA Santa Clara
Agency: Jaffery Watson Mendonsa & Hamilton LLP
Main IPC: G06T1/20
IPC: G06T1/20 ; G06F3/14 ; G06F7/483 ; G06F9/30 ; G06F9/38 ; G06F9/50 ; G06N3/044 ; G06N3/045 ; G06N3/063 ; G06N3/08 ; G06N3/084 ; G06N20/00 ; G06T1/60 ; G06T15/00

Compute optimizations for low precision machine learning operations

Abstract:

One embodiment provides an apparatus comprising a memory stack including multiple memory dies and a parallel processor including a plurality of multiprocessors. Each multiprocessor has a single instruction, multiple thread (SIMT) architecture, the parallel processor coupled to the memory stack via one or more memory interfaces. At least one multiprocessor comprises a multiply-accumulate circuit to perform multiply-accumulate operations on matrix data in a stage of a neural network implementation to produce a result matrix comprising a plurality of matrix data elements at a first precision, precision tracking logic to evaluate metrics associated with the matrix data elements and indicate if an optimization is to be performed for representing data at a second stage of the neural network implementation, and a numerical transform unit to dynamically perform a numerical transform operation on the matrix data elements based on the indication to produce transformed matrix data elements at a second precision.

Public/Granted literature

US20230061670A1 COMPUTE OPTIMIZATIONS FOR LOW PRECISION MACHINE LEARNING OPERATIONS Public/Granted day:2023-03-02

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06T	一般的图像数据处理或产生
G06T1/00	通用图像数据处理
G06T1/20	.处理器架构; 处理器配置，例如流水线