-
公开(公告)号:US09940931B2
公开(公告)日:2018-04-10
申请号:US15201188
申请日:2016-07-01
Applicant: Amazon Technologies, Inc.
Inventor: Marc White , Igor Roditis Jablokov , Victor Roman Jablokov
CPC classification number: G10L15/26 , G06F3/0236 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/30 , G10L2015/0631
Abstract: A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.
-
公开(公告)号:US09583107B2
公开(公告)日:2017-02-28
申请号:US14517720
申请日:2014-10-17
Applicant: Amazon Technologies, Inc.
Inventor: James Richard Terrell, II , Marc White , Igor Roditis Jablokov
CPC classification number: G10L15/26 , G10L15/01 , G10L15/22 , G10L2015/221
Abstract: A method of providing speech transcription performance indication includes receiving, at a user device data representing text transcribed from an audio stream by an ASR system, and data representing a metric associated with the audio stream; displaying, via the user device, said text; and via the user device, providing, in user-perceptible form, an indicator of said metric. Another method includes displaying, by a user device, text transcribed from an audio stream by an ASR system; and via the user device, providing, in user-perceptible form, an indicator of a level of background noise of the audio stream. Another method includes receiving data representing an audio stream; converting said data representing an audio stream to text via an ASR system; determining a metric associated with the audio stream; transmitting data representing said text to a user device; and transmitting data representing said metric to the user device.
Abstract translation: 提供语音转录性能指示的方法包括在用户设备处接收表示由ASR系统从音频流转录的文本的数据和表示与音频流相关联的度量的数据; 经由所述用户设备显示所述文本; 并且经由用户设备,以用户可感知的形式提供所述度量的指示符。 另一方法包括由用户设备显示由ASR系统从音频流转录的文本; 并且经由用户设备,以用户可感知的形式提供音频流的背景噪声水平的指示符。 另一种方法包括接收表示音频流的数据; 通过ASR系统将表示音频流的所述数据转换为文本; 确定与所述音频流相关联的度量; 将表示所述文本的数据发送到用户设备; 以及将表示所述度量的数据发送到用户设备。
-
3.
公开(公告)号:US20170004831A1
公开(公告)日:2017-01-05
申请号:US15201188
申请日:2016-07-01
Applicant: Amazon Technologies, Inc.
Inventor: Marc White , Igor Roditis Jablokov , Victor Roman Jablokov
IPC: G10L15/26 , G10L15/22 , G10L15/183 , G10L15/30
CPC classification number: G10L15/26 , G06F3/0236 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/30 , G10L2015/0631
Abstract: A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.
Abstract translation: 一种便于更新语言模型的方法包括在客户端设备经由麦克风接收对应于用户语音的音频消息; 将音频消息传送到第一远程服务器; 从所述音频消息接收使用自动语音识别系统(“ASR”)在所述第一远程服务器处转录的结果,所述客户端设备, 在客户端设备从用户接收结果的肯定; 在客户端设备处存储与对应于音频消息的标识符相关联的结果; 以及与所述标识符一起与所述第二远程服务器通信所存储的结果。
-
4.
公开(公告)号:US09384735B2
公开(公告)日:2016-07-05
申请号:US14341054
申请日:2014-07-25
Applicant: Amazon Technologies, Inc.
Inventor: Marc White , Igor Roditis Jablokov , Victor Roman Jablokov
CPC classification number: G10L15/26 , G06F3/0236 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/30 , G10L2015/0631
Abstract: A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.
Abstract translation: 一种便于更新语言模型的方法包括在客户端设备经由麦克风接收对应于用户语音的音频消息; 将音频消息传送到第一远程服务器; 从所述音频消息接收使用自动语音识别系统(“ASR”)在所述第一远程服务器处转录的结果,所述客户端设备, 在客户端设备从用户接收结果的肯定; 在客户端设备处存储与对应于音频消息的标识符相关联的结果; 以及与所述标识符一起与所述第二远程服务器通信所存储的结果。
-
-
-