Invention Grant
- Patent Title: Data sorting for generating RNN-T models
-
Application No.: US17580846Application Date: 2022-01-21
-
Publication No.: US12027153B2Publication Date: 2024-07-02
- Inventor: Takashi Fukuda , Tohru Nagano
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Tutunjian & Bitetto, P.C.
- Agent Robert Richard Aragona
- Main IPC: G10L15/02
- IPC: G10L15/02 ; G06F7/24 ; G10L15/06

Abstract:
A computer-implemented method for preparing training data for a speech recognition model is provided including obtaining a plurality of sentences from a corpus, dividing each phoneme in each sentence of the plurality of sentences into three hidden states, calculating, for each sentence of the plurality of sentences, a score based on a variation in duration of the three hidden states of each phoneme in the sentence, and sorting the plurality of sentences by using the calculated scores.
Public/Granted literature
- US20230237987A1 DATA SORTING FOR GENERATING RNN-T MODELS Public/Granted day:2023-07-27
Information query