Speech recognition and transcription among users having heterogeneous protocols

Invention Grant

US09934786B2 Speech recognition and transcription among users having heterogeneous protocols 有权

Please log in to see more content

Patent Title: Speech recognition and transcription among users having heterogeneous protocols
Application No.: US15400732

Application Date: 2017-01-06
Publication No.: US09934786B2

Publication Date: 2018-04-03
Inventor: Joseph H. Miglietta , Michael K. Davis
Applicant: Advanced Voice Recognition Systems, Inc.
Applicant Address: US AZ Scottsdale
Assignee: Advanced Voice Recognition Systems, Inc.
Current Assignee: Advanced Voice Recognition Systems, Inc.
Current Assignee Address: US AZ Scottsdale
Agency: Ascenda Law Group, PC
Main IPC: G10L21/00
IPC: G10L21/00 ; G10L15/30 ; G06F17/28 ; G10L15/26 ; G06F17/30 ; H04M3/493

Speech recognition and transcription among users having heterogeneous protocols

Abstract:

A system is disclosed for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous native (legacy) protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes at least one system transaction manager having a “system protocol,” to receive a verified, streamed speech information request from at least one authorized user employing a first legacy user protocol. The speech information request which includes spoken text and system commands is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications, including prompts to direct user dictation in response to user system protocol commands and systems transaction manager commands. A speech recognition and/or transcription engine (ASR), in communication with the systems transaction manager, receives the speech information request from the system transaction manager, generates a transcribed response, which can include a formatted transcription, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users employing a second protocol, which may be the same as or different than the first protocol. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR, regardless of the ASR's ability to recognize and/or transcribe spoken text from any input source such as, for example, a live microphone or line input. In another embodiment, the system employs a buffer to facilitate the system's use of ASRs requiring input data to be in batches, while providing the user with an uninterrupted, seamless dictating experience.

Public/Granted literature

US20170116993A1 SPEECH RECOGNITION AND TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS Public/Granted day:2017-04-27

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L21/00	为了改变语音或声音信号的质量或其可识度而处理语音或声音信号，以产生另一种可听的或非可听的信号，例如视觉信号或触觉信号（G10L19/00优先）