Fast deep neural network feature transformation via optimized memory bandwidth utilization

Invention Grant

US10013652B2 Fast deep neural network feature transformation via optimized memory bandwidth utilization 有权

Please log in to see more content

Patent Title: Fast deep neural network feature transformation via optimized memory bandwidth utilization
Application No.: US14699778

Application Date: 2015-04-29
Publication No.: US10013652B2

Publication Date: 2018-07-03
Inventor: Jan Vlietinck , Stephan Kanthak , Rudi Vuerinckx , Christophe Ris
Applicant: Nuance Communications, Inc.
Applicant Address: US MA Burlington
Assignee: Nuance Communications, Inc.
Current Assignee: Nuance Communications, Inc.
Current Assignee Address: US MA Burlington
Agency: Hamilton, Brook, Smith & Reynolds, P.C.
Main IPC: G10L15/00
IPC: G10L15/00 ; G06N3/08 ; G06N3/04 ; G10L15/16 ; G10L15/06 ; G10L15/02

Fast deep neural network feature transformation via optimized memory bandwidth utilization

Abstract:

Deep Neural Networks (DNNs) with many hidden layers and many units per layer are very flexible models with a very large number of parameters. As such, DNNs are challenging to optimize. To achieve real-time computation, embodiments disclosed herein enable fast DNN feature transformation via optimized memory bandwidth utilization. To optimize memory bandwidth utilization, a rate of accessing memory may be reduced based on a batch setting. A memory, corresponding to a selected given output neuron of a current layer of the DNN, may be updated with an incremental output value computed for the selected given output neuron as a function of input values of a selected few non-zero input neurons of a previous layer of the DNN in combination with weights between the selected few non-zero input neurons and the selected given output neuron, wherein a number of the selected few corresponds to the batch setting.

Public/Granted literature

US20160322042A1 FAST DEEP NEURAL NETWORK FEATURE TRANSFORMATION VIA OPTIMIZED MEMORY BANDWIDTH UTILIZATION Public/Granted day:2016-11-03

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）