SPEECH RECOGNITION AND TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS

    公开(公告)号:US20170116993A1

    公开(公告)日:2017-04-27

    申请号:US15400732

    申请日:2017-01-06

    Abstract: A system is disclosed for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous native (legacy) protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes at least one system transaction manager having a “system protocol,” to receive a verified, streamed speech information request from at least one authorized user employing a first legacy user protocol. The speech information request which includes spoken text and system commands is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications, including prompts to direct user dictation in response to user system protocol commands and systems transaction manager commands. A speech recognition and/or transcription engine (ASR), in communication with the systems transaction manager, receives the speech information request from the system transaction manager, generates a transcribed response, which can include a formatted transcription, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users employing a second protocol, which may be the same as or different than the first protocol. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR, regardless of the ASR's ability to recognize and/or transcribe spoken text from any input source such as, for example, a live microphone or line input. In another embodiment, the system employs a buffer to facilitate the system's use of ASRs requiring input data to be in batches, while providing the user with an uninterrupted, seamless dictating experience.

    DYNAMIC SPEECH RECOGNITION AND TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS
    2.
    发明申请
    DYNAMIC SPEECH RECOGNITION AND TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS 审中-公开
    具有异质性协议的用户的动态语音识别和转录

    公开(公告)号:US20150348552A1

    公开(公告)日:2015-12-03

    申请号:US14821786

    申请日:2015-08-10

    Abstract: A system is disclosed for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous native (legacy) protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes at least one system transaction manager having a “system protocol,” to receive a verified, streamed speech information request from at least one authorized user employing a first legacy user protocol. The speech information request which includes spoken text and system commands is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications, including prompts to direct user dictation in response to user system protocol commands and systems transaction manager commands. A speech recognition and/or transcription engine (ASR), in communication with the systems transaction manager, receives the speech information request from the system transaction manager, generates a transcribed response, which can include a formatted transcription, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users employing a second protocol, which may be the same as or different than the first protocol. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR, regardless of the ASR's ability to recognize and/or transcribe spoken text from any input source such as, for example, a live microphone or line input. In another embodiment, the system employs a buffer to facilitate the system's use of ASRs requiring input data to be in batches, while providing the user with an uninterrupted, seamless dictating experience.

    Abstract translation: 公开了一种用于促进自由形式听写的系统,包括在具有用于生成,转录和交换已识别和转录的语音的异构原生(传统))协议的用户之间的定向听写和约束识别和/或结构化转录。 该系统包括具有“系统协议”的至少一个系统事务管理器,用于从采用第一传统用户协议的至少一个授权用户接收经过验证的流式语音信息请求。 包括语音文本和系统命令的语音信息请求使用能够与系统事务管理器进行双向通信的用户界面和支持听写应用程序来生成,包括响应于用户系统协议命令和系统事务管理器来指示用户听写的提示 命令。 与系统事务管理器通信的语音识别和/或转录引擎(ASR)从系统事务管理器接收语音信息请求,生成可以包括格式转录的转录响应,并将响应发送到系统 交易经理。 系统事务管理器将响应路由到使用第二协议的一个或多个用户,该第二协议可以与第一协议相同或不同。 在另一个实施例中,系统采用虚拟声音驱动器,用于将任意ASR的自由形式听写流式传输,而不管ASR如何从任何输入源(例如现场麦克风或线路输入)识别和/或转录口语文本的能力。 在另一个实施例中,系统使用缓冲器来促进系统使用需要输入数据的ASR,同时为用户提供不间断的无缝指令体验。

    SPEECH RECOGNITION AND TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS

    公开(公告)号:US20130346079A1

    公开(公告)日:2013-12-26

    申请号:US13928381

    申请日:2013-06-27

    Abstract: A system for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes a system transaction manager having a “system protocol,” to receive a speech information request from an authorized user. The speech information request is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications. A speech recognition and/or transcription engine (ASR), in communication with the system transaction manager, receives the speech information request, generates a transcribed response, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR.

    SPEECH RECOGNITION AND TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS
    4.
    发明申请
    SPEECH RECOGNITION AND TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS 有权
    具有异质性协议的用户的语音识别和转录

    公开(公告)号:US20130339016A1

    公开(公告)日:2013-12-19

    申请号:US13928383

    申请日:2013-06-27

    Abstract: A system for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes a system transaction manager having a “system protocol,” to receive a speech information request from an authorized user. The speech information request is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications. A speech recognition and/or transcription engine (ASR), in communication with the system transaction manager, receives the speech information request, generates a transcribed response, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR.

    Abstract translation: 一种用于促进自由形式听写的系统,包括在具有用于生成,转录和交换已识别和转录的语音的异构协议的用户之间的定向听写和约束识别和/或结构化转录。 该系统包括具有“系统协议”的系统事务管理器,用于从授权用户接收语音信息请求。 使用能够与系统事务管理器和支持听写应用程序进行双向通信的用户界面来生成语音信息请求。 与系统事务管理器通信的语音识别和/或转录引擎(ASR)接收语音信息请求,产生转录的响应,并将该响应发送给系统事务管理器。 系统事务管理器将响应路由到一个或多个用户。 在另一个实施例中,系统使用虚拟声音驱动器来将任何ASR的自由形式听写流。

    Speech recognition and transcription among users having heterogeneous protocols

    公开(公告)号:US09934786B2

    公开(公告)日:2018-04-03

    申请号:US15400732

    申请日:2017-01-06

    Abstract: A system is disclosed for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous native (legacy) protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes at least one system transaction manager having a “system protocol,” to receive a verified, streamed speech information request from at least one authorized user employing a first legacy user protocol. The speech information request which includes spoken text and system commands is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications, including prompts to direct user dictation in response to user system protocol commands and systems transaction manager commands. A speech recognition and/or transcription engine (ASR), in communication with the systems transaction manager, receives the speech information request from the system transaction manager, generates a transcribed response, which can include a formatted transcription, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users employing a second protocol, which may be the same as or different than the first protocol. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR, regardless of the ASR's ability to recognize and/or transcribe spoken text from any input source such as, for example, a live microphone or line input. In another embodiment, the system employs a buffer to facilitate the system's use of ASRs requiring input data to be in batches, while providing the user with an uninterrupted, seamless dictating experience.

    Speech recognition and transcription among users having heterogeneous protocols
    6.
    发明授权
    Speech recognition and transcription among users having heterogeneous protocols 有权
    具有异构协议的用户之间的语音识别和转录

    公开(公告)号:US09142217B2

    公开(公告)日:2015-09-22

    申请号:US13928383

    申请日:2013-06-27

    Abstract: A system for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes a system transaction manager having a “system protocol,” to receive a speech information request from an authorized user. The speech information request is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications. A speech recognition and/or transcription engine (ASR), in communication with the system transaction manager, receives the speech information request, generates a transcribed response, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR.

    Abstract translation: 一种用于促进自由形式听写的系统,包括在具有用于生成,转录和交换已识别和转录的语音的异构协议的用户之间的定向听写和约束识别和/或结构化转录。 该系统包括具有“系统协议”的系统事务管理器,用于从授权用户接收语音信息请求。 使用能够与系统事务管理器和支持听写应用程序进行双向通信的用户界面来生成语音信息请求。 与系统事务管理器通信的语音识别和/或转录引擎(ASR)接收语音信息请求,产生转录的响应,并将该响应发送给系统事务管理器。 系统事务管理器将响应路由到一个或多个用户。 在另一个实施例中,系统使用虚拟声音驱动器来将任何ASR的自由形式听写流。

Patent Agency Ranking