Invention Grant
- Patent Title: Speech recognition and transcription among users having heterogeneous protocols
-
Application No.: US15400732Application Date: 2017-01-06
-
Publication No.: US09934786B2Publication Date: 2018-04-03
- Inventor: Joseph H. Miglietta , Michael K. Davis
- Applicant: Advanced Voice Recognition Systems, Inc.
- Applicant Address: US AZ Scottsdale
- Assignee: Advanced Voice Recognition Systems, Inc.
- Current Assignee: Advanced Voice Recognition Systems, Inc.
- Current Assignee Address: US AZ Scottsdale
- Agency: Ascenda Law Group, PC
- Main IPC: G10L21/00
- IPC: G10L21/00 ; G10L15/30 ; G06F17/28 ; G10L15/26 ; G06F17/30 ; H04M3/493

Abstract:
A system is disclosed for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous native (legacy) protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes at least one system transaction manager having a “system protocol,” to receive a verified, streamed speech information request from at least one authorized user employing a first legacy user protocol. The speech information request which includes spoken text and system commands is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications, including prompts to direct user dictation in response to user system protocol commands and systems transaction manager commands. A speech recognition and/or transcription engine (ASR), in communication with the systems transaction manager, receives the speech information request from the system transaction manager, generates a transcribed response, which can include a formatted transcription, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users employing a second protocol, which may be the same as or different than the first protocol. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR, regardless of the ASR's ability to recognize and/or transcribe spoken text from any input source such as, for example, a live microphone or line input. In another embodiment, the system employs a buffer to facilitate the system's use of ASRs requiring input data to be in batches, while providing the user with an uninterrupted, seamless dictating experience.
Public/Granted literature
- US20170116993A1 SPEECH RECOGNITION AND TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS Public/Granted day:2017-04-27
Information query