System and method for a dialogue response generation system

Invention Grant

US11264009B2 System and method for a dialogue response generation system 有权

Please log in to see more content

Patent Title: System and method for a dialogue response generation system
Application No.: US16569679

Application Date: 2019-09-13
Publication No.: US11264009B2

Publication Date: 2022-03-01
Inventor: Chiori Hori , Anoop Cherian , Tim Marks , Takaaki Hori
Applicant: Mitsubishi Electric Research Laboratories, Inc.
Applicant Address: US MA Cambridge
Assignee: Mitsubishi Electric Research Laboratories, Inc.
Current Assignee: Mitsubishi Electric Research Laboratories, Inc.
Current Assignee Address: US MA Cambridge
Agent Gennadiy Vinokur; Hironori Tsukamoto
Main IPC: G10L15/06
IPC: G10L15/06 ; G10L15/02 ; G10L15/22 ; G10L19/00

System and method for a dialogue response generation system

Abstract:

A computer-implemented method for training a dialogue response generation system and the dialogue response generation system are provided. The method includes arranging a first multimodal encoder-decoder for the dialogue response generation or video description having a first input and a first output, wherein the first multimodal encoder-decoder has been pretrained by training audio-video datasets with training video description sentences, arranging a second multimodal encoder-decoder for dialog response generation having a second input and a second output, providing first audio-visual datasets with first corresponding video description sentences to the first input of the first multimodal encoder-decoder, wherein the first encoder-decoder generates first output values based on the first audio-visual datasets with the first corresponding description sentences, providing the first audio-visual datasets excluding the first corresponding video description sentences to the second multimodal encoder-decoder. In this case, the second multimodal encoder-decoder generates second output values based on the first audio-visual datasets without the first corresponding video description sentences.

Public/Granted literature

US20210082398A1 System and Method for a Dialogue Response Generation System Public/Granted day:2021-03-18

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）