Invention Grant
- Patent Title: Training and/or using a language selection model for automatically determining language for speech recognition of spoken utterance
-
Application No.: US16959037Application Date: 2019-11-27
-
Publication No.: US11410641B2Publication Date: 2022-08-09
- Inventor: Li Wan , Yang Yu , Prashant Sridhar , Ignacio Lopez Moreno , Quan Wang
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Middleton Reutlinger
- International Application: PCT/US2019/063643 WO 20191127
- International Announcement: WO2020/113031 WO 20200604
- Main IPC: G10L15/00
- IPC: G10L15/00

Abstract:
Methods and systems for training and/or using a language selection model for use in determining a particular language of a spoken utterance captured in audio data. Features of the audio data can be processed using the trained language selection model to generate a predicted probability for each of N different languages, and a particular language selected based on the generated probabilities. Speech recognition results for the particular language can be utilized responsive to selecting the particular language of the spoken utterance. Many implementations are directed to training the language selection model utilizing tuple losses in lieu of traditional cross-entropy losses. Training the language selection model utilizing the tuple losses can result in more efficient training and/or can result in a more accurate and/or robust model—thereby mitigating erroneous language selections for spoken utterances.
Public/Granted literature
Information query