Invention Grant
- Patent Title: Online maximum-likelihood mean and variance normalization for speech recognition
-
Application No.: US14640912Application Date: 2015-03-06
-
Publication No.: US09280979B2Publication Date: 2016-03-08
- Inventor: Daniel Willett
- Applicant: Nuance Communications, Inc.
- Applicant Address: US MA Burlington
- Assignee: Nuance Communications, Inc.
- Current Assignee: Nuance Communications, Inc.
- Current Assignee Address: US MA Burlington
- Agency: Banner & Witcoff, Ltd.
- Main IPC: G10L19/02
- IPC: G10L19/02 ; G10L15/02 ; G10L15/08 ; G10L15/20 ; G10L19/00 ; G10L15/34

Abstract:
A feature transform for speech recognition is described. An input speech utterance is processed to produce a sequence of representative speech vectors. A time-synchronous speech recognition pass is performed using a decoding search to determine a recognition output corresponding to the speech input. The decoding search includes, for each speech vector after some first threshold number of speech vectors, estimating a feature transform based on the preceding speech vectors in the utterance and partial decoding results of the decoding search. The current speech vector is then adjusted based on the current feature transform, and the adjusted speech vector is used in a current frame of the decoding search.
Public/Granted literature
- US20150221320A1 Online Maximum-Likelihood Mean and Variance Normalization for Speech Recognition Public/Granted day:2015-08-06
Information query