Invention Grant
US08386264B2 Speech data retrieval apparatus, speech data retrieval method, speech data retrieval program and computer usable medium having computer readable speech data retrieval program embodied therein
有权
语音数据检索装置,语音数据检索方法,语音数据检索程序以及其中包含计算机可读语音数据检索程序的计算机可用介质
- Patent Title: Speech data retrieval apparatus, speech data retrieval method, speech data retrieval program and computer usable medium having computer readable speech data retrieval program embodied therein
- Patent Title (中): 语音数据检索装置,语音数据检索方法,语音数据检索程序以及其中包含计算机可读语音数据检索程序的计算机可用介质
-
Application No.: US12593636Application Date: 2008-04-11
-
Publication No.: US08386264B2Publication Date: 2013-02-26
- Inventor: Takaaki Hori , I. Lee Hetherington , Timothy J. Hazen , James R. Glass
- Applicant: Takaaki Hori , I. Lee Hetherington , Timothy J. Hazen , James R. Glass
- Applicant Address: JP US MA Cambridge
- Assignee: Nippon Telegraph and Telephone Corporation,Massachusetts Institute of Technology
- Current Assignee: Nippon Telegraph and Telephone Corporation,Massachusetts Institute of Technology
- Current Assignee Address: JP US MA Cambridge
- Agency: Kilpatrick Townsend & Stockton LLP
- International Application: PCT/JP2008/057554 WO 20080411
- International Announcement: WO2008/130018 WO 20081030
- Main IPC: G10L15/00
- IPC: G10L15/00

Abstract:
A speech data retrieval apparatus (10) includes a speech database (1), a speech recognition unit (2), a confusion network creation unit (3), an inverted index table creation unit (4), a query input unit (6), a query conversion unit (7) and a label string check unit (8). The speech recognition unit (2) reads speech data from the speech database (1), carries out a speech recognition process with respect to the read speech data, and outputs a result of speech recognition process as a lattice in which a phoneme, a syllable, or a word is a base unit. The confusion network creation unit (3) creates a confusion network based on the output lattice and outputs the result of speech recognition process as the confusion network. The inverted index table creation unit (4) creates an inverted index table based on the output confusion network. The query input unit (6) receives a query input by a user, carries out a speech recognition process with respect to the received query, and outputs a result of speech recognition process as a character string. The query conversion unit (7) converts the output character string into a label string in which a phoneme, a syllable, or a word is a base unit. The label string check unit (8) checks the label string against the inverted index table and retrieves speech data which is included in both of the label string and the speech database (1).
Public/Granted literature
Information query