Invention Grant
- Patent Title: Aligning a transcript to audio data
- Patent Title (中): 将抄本与音频数据对齐
-
Application No.: US12238257Application Date: 2008-09-25
-
Publication No.: US08131545B1Publication Date: 2012-03-06
- Inventor: Pedro J. Moreno , Christopher Alberti
- Applicant: Pedro J. Moreno , Christopher Alberti
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G10L15/26
- IPC: G10L15/26

Abstract:
The subject matter of this specification can be implemented in, among other things, a computer-implemented method including receiving audio data and a transcript of the audio data. The method further includes generating a language model including a factor automaton that includes automaton states and arcs, each of the automaton arcs corresponding to a language element from the transcript. The method further includes receiving language elements recognized from the received audio data and times at which each of the recognized language elements occur in the audio data. The method further includes comparing the recognized language elements to one or more of the language elements from the factor automaton to identify times at which the one or more of the language elements from the transcript occur in the audio data. The method further includes aligning a portion of the transcript with a portion of the audio data using the identified times.
Information query