-
公开(公告)号:US09940931B2
公开(公告)日:2018-04-10
申请号:US15201188
申请日:2016-07-01
Applicant: Amazon Technologies, Inc.
Inventor: Marc White , Igor Roditis Jablokov , Victor Roman Jablokov
CPC classification number: G10L15/26 , G06F3/0236 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/30 , G10L2015/0631
Abstract: A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.
-
公开(公告)号:US20160217786A1
公开(公告)日:2016-07-28
申请号:US14685528
申请日:2015-04-13
Applicant: Amazon Technologies, Inc.
Inventor: Victor R. Jablokov , Igor R. Jablokov , Marc White
IPC: G10L15/26
CPC classification number: G10L15/26 , G06Q30/0251 , G10L13/043 , G10L15/30 , H04L51/066 , H04L51/38
Abstract: Methods, systems, and software for converting the audio input of a user of a handheld client device or mobile phone into a textual representation by means of a backend server accessed by the device through a communications network. The text is then inserted into or used by an application of the client device to send a text message, instant message, email, or to insert a request into a web-based application or service. In one embodiment, the method includes the steps of initializing or launching the application on the device; recording and transmitting the recorded audio message from the client device to the backend server through a client-server communication protocol; converting the transmitted audio message into the textual representation in the backend server; and sending the converted text message back to the client device or forwarding it on to an alternate destination directly from the server.
Abstract translation: 用于通过通过通信网络由设备访问的后端服务器将手持式客户端设备或移动电话的用户的音频输入转换为文本表示的方法,系统和软件。 然后将文本插入或由客户端设备的应用程序使用,以发送文本消息,即时消息,电子邮件,或将请求插入基于Web的应用程序或服务。 在一个实施例中,该方法包括在设备上初始化或启动应用的步骤; 通过客户机 - 服务器通信协议将记录的音频消息从客户端设备记录并发送到后端服务器; 将发送的音频消息转换成后端服务器中的文本表示; 并将转换的文本消息发送回客户端设备,或将其直接从服务器转发到备用目的地。
-
公开(公告)号:US09583107B2
公开(公告)日:2017-02-28
申请号:US14517720
申请日:2014-10-17
Applicant: Amazon Technologies, Inc.
Inventor: James Richard Terrell, II , Marc White , Igor Roditis Jablokov
CPC classification number: G10L15/26 , G10L15/01 , G10L15/22 , G10L2015/221
Abstract: A method of providing speech transcription performance indication includes receiving, at a user device data representing text transcribed from an audio stream by an ASR system, and data representing a metric associated with the audio stream; displaying, via the user device, said text; and via the user device, providing, in user-perceptible form, an indicator of said metric. Another method includes displaying, by a user device, text transcribed from an audio stream by an ASR system; and via the user device, providing, in user-perceptible form, an indicator of a level of background noise of the audio stream. Another method includes receiving data representing an audio stream; converting said data representing an audio stream to text via an ASR system; determining a metric associated with the audio stream; transmitting data representing said text to a user device; and transmitting data representing said metric to the user device.
Abstract translation: 提供语音转录性能指示的方法包括在用户设备处接收表示由ASR系统从音频流转录的文本的数据和表示与音频流相关联的度量的数据; 经由所述用户设备显示所述文本; 并且经由用户设备,以用户可感知的形式提供所述度量的指示符。 另一方法包括由用户设备显示由ASR系统从音频流转录的文本; 并且经由用户设备,以用户可感知的形式提供音频流的背景噪声水平的指示符。 另一种方法包括接收表示音频流的数据; 通过ASR系统将表示音频流的所述数据转换为文本; 确定与所述音频流相关联的度量; 将表示所述文本的数据发送到用户设备; 以及将表示所述度量的数据发送到用户设备。
-
公开(公告)号:US09542944B2
公开(公告)日:2017-01-10
申请号:US14685528
申请日:2015-04-13
Applicant: Amazon Technologies, Inc.
Inventor: Victor R. Jablokov , Igor R. Jablokov , Marc White
CPC classification number: G10L15/26 , G06Q30/0251 , G10L13/043 , G10L15/30 , H04L51/066 , H04L51/38
Abstract: Methods, systems, and software for converting the audio input of a user of a handheld client device or mobile phone into a textual representation by means of a backend server accessed by the device through a communications network. The text is then inserted into or used by an application of the client device to send a text message, instant message, email, or to insert a request into a web-based application or service. In one embodiment, the method includes the steps of initializing or launching the application on the device; recording and transmitting the recorded audio message from the client device to the backend server through a client-server communication protocol; converting the transmitted audio message into the textual representation in the backend server; and sending the converted text message back to the client device or forwarding it on to an alternate destination directly from the server.
Abstract translation: 用于通过通过通信网络由设备访问的后端服务器将手持式客户端设备或移动电话的用户的音频输入转换为文本表示的方法,系统和软件。 然后将文本插入或由客户端设备的应用程序使用,以发送文本消息,即时消息,电子邮件,或将请求插入基于Web的应用程序或服务。 在一个实施例中,该方法包括在设备上初始化或启动应用的步骤; 通过客户机 - 服务器通信协议将记录的音频消息从客户端设备记录并发送到后端服务器; 将发送的音频消息转换成后端服务器中的文本表示; 并将转换的文本消息发送回客户端设备,或将其直接从服务器转发到备用目的地。
-
5.
公开(公告)号:US20170004831A1
公开(公告)日:2017-01-05
申请号:US15201188
申请日:2016-07-01
Applicant: Amazon Technologies, Inc.
Inventor: Marc White , Igor Roditis Jablokov , Victor Roman Jablokov
IPC: G10L15/26 , G10L15/22 , G10L15/183 , G10L15/30
CPC classification number: G10L15/26 , G06F3/0236 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/30 , G10L2015/0631
Abstract: A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.
Abstract translation: 一种便于更新语言模型的方法包括在客户端设备经由麦克风接收对应于用户语音的音频消息; 将音频消息传送到第一远程服务器; 从所述音频消息接收使用自动语音识别系统(“ASR”)在所述第一远程服务器处转录的结果,所述客户端设备, 在客户端设备从用户接收结果的肯定; 在客户端设备处存储与对应于音频消息的标识符相关联的结果; 以及与所述标识符一起与所述第二远程服务器通信所存储的结果。
-
6.
公开(公告)号:US09384735B2
公开(公告)日:2016-07-05
申请号:US14341054
申请日:2014-07-25
Applicant: Amazon Technologies, Inc.
Inventor: Marc White , Igor Roditis Jablokov , Victor Roman Jablokov
CPC classification number: G10L15/26 , G06F3/0236 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/30 , G10L2015/0631
Abstract: A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.
Abstract translation: 一种便于更新语言模型的方法包括在客户端设备经由麦克风接收对应于用户语音的音频消息; 将音频消息传送到第一远程服务器; 从所述音频消息接收使用自动语音识别系统(“ASR”)在所述第一远程服务器处转录的结果,所述客户端设备, 在客户端设备从用户接收结果的肯定; 在客户端设备处存储与对应于音频消息的标识符相关联的结果; 以及与所述标识符一起与所述第二远程服务器通信所存储的结果。
-
-
-
-
-