Invention Grant
- Patent Title: Fast deep neural network feature transformation via optimized memory bandwidth utilization
-
Application No.: US14699778Application Date: 2015-04-29
-
Publication No.: US10013652B2Publication Date: 2018-07-03
- Inventor: Jan Vlietinck , Stephan Kanthak , Rudi Vuerinckx , Christophe Ris
- Applicant: Nuance Communications, Inc.
- Applicant Address: US MA Burlington
- Assignee: Nuance Communications, Inc.
- Current Assignee: Nuance Communications, Inc.
- Current Assignee Address: US MA Burlington
- Agency: Hamilton, Brook, Smith & Reynolds, P.C.
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G06N3/08 ; G06N3/04 ; G10L15/16 ; G10L15/06 ; G10L15/02

Abstract:
Deep Neural Networks (DNNs) with many hidden layers and many units per layer are very flexible models with a very large number of parameters. As such, DNNs are challenging to optimize. To achieve real-time computation, embodiments disclosed herein enable fast DNN feature transformation via optimized memory bandwidth utilization. To optimize memory bandwidth utilization, a rate of accessing memory may be reduced based on a batch setting. A memory, corresponding to a selected given output neuron of a current layer of the DNN, may be updated with an incremental output value computed for the selected given output neuron as a function of input values of a selected few non-zero input neurons of a previous layer of the DNN in combination with weights between the selected few non-zero input neurons and the selected given output neuron, wherein a number of the selected few corresponds to the batch setting.
Public/Granted literature
- US20160322042A1 FAST DEEP NEURAL NETWORK FEATURE TRANSFORMATION VIA OPTIMIZED MEMORY BANDWIDTH UTILIZATION Public/Granted day:2016-11-03
Information query