Multi-mode low-precision inner-product computation circuits for massively parallel neural inference engine

Invention Grant

US11270196B2 Multi-mode low-precision inner-product computation circuits for massively parallel neural inference engine 有权

Please log in to see more content

Patent Title: Multi-mode low-precision inner-product computation circuits for massively parallel neural inference engine
Application No.: US16653366

Application Date: 2019-10-15
Publication No.: US11270196B2

Publication Date: 2022-03-08
Inventor: Jun Sawada , Filipp A. Akopyan , Rathinakumar Appuswamy , John V. Arthur , Andrew S. Cassidy , Pallab Datta , Steven K. Esser , Myron D. Flickner , Dharmendra S. Modha , Tapan K. Nayak , Carlos O. Otero
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Applicant Address: US NY Armonk
Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee Address: US NY Armonk
Agency: Foley Hoag, LLP
Agent Erik A. Huestis; Stephen J. Kenny
Main IPC: G06N3/063
IPC: G06N3/063 ; G06N3/04

Multi-mode low-precision inner-product computation circuits for massively parallel neural inference engine

Abstract:

Neural inference chips for computing neural activations are provided. In various embodiments, the neural inference chip is adapted to: receive an input activation tensor comprising a plurality of input activations; receive a weight tensor comprising a plurality of weights; Booth recode each of the plurality of weights into a plurality of Booth-coded weights, each Booth coded value having an order; multiply the input activation tensor by the Booth coded weights, yielding a plurality of results for each input activation, each of the plurality of results corresponding to the orders of the Booth-coded weights; for each order of the Booth-coded weights, sum the corresponding results, yielding a plurality of partial sums, one for each order; and compute a neural activation from a sum of the plurality of partial sums.

Public/Granted literature

US20210110245A1 MULTI-MODE LOW-PRECISION INNER-PRODUCT COMPUTATION CIRCUITS FOR MASSIVELY PARALLEL NEURAL INFERENCE ENGINE Public/Granted day:2021-04-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/06	..物理实现，即神经网络、神经元或神经元部分的硬件实现
G06N3/063	...采用电的