Online maximum-likelihood mean and variance normalization for speech recognition

Invention Grant

US09280979B2 Online maximum-likelihood mean and variance normalization for speech recognition 有权

Please log in to see more content

Patent Title: Online maximum-likelihood mean and variance normalization for speech recognition
Application No.: US14640912

Application Date: 2015-03-06
Publication No.: US09280979B2

Publication Date: 2016-03-08
Inventor: Daniel Willett
Applicant: Nuance Communications, Inc.
Applicant Address: US MA Burlington
Assignee: Nuance Communications, Inc.
Current Assignee: Nuance Communications, Inc.
Current Assignee Address: US MA Burlington
Agency: Banner & Witcoff, Ltd.
Main IPC: G10L19/02
IPC: G10L19/02 ; G10L15/02 ; G10L15/08 ; G10L15/20 ; G10L19/00 ; G10L15/34

Online maximum-likelihood mean and variance normalization for speech recognition

Abstract:

A feature transform for speech recognition is described. An input speech utterance is processed to produce a sequence of representative speech vectors. A time-synchronous speech recognition pass is performed using a decoding search to determine a recognition output corresponding to the speech input. The decoding search includes, for each speech vector after some first threshold number of speech vectors, estimating a feature transform based on the preceding speech vectors in the utterance and partial decoding results of the decoding search. The current speech vector is then adjusted based on the current feature transform, and the adjusted speech vector is used in a current frame of the decoding search.

Public/Granted literature

US20150221320A1 Online Maximum-Likelihood Mean and Variance Normalization for Speech Recognition Public/Granted day:2015-08-06

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L19/00	用于冗余度下降情形（例如在声码器中）的语音或音频信号分析-合成技术；语音或音频信号编码或解码，采用源滤波器模型或心理声学分析（乐器中的入G10H）
G10L19/02	.利用频谱分析，例如变换声码器或子频带声码器