首页> 外文期刊>ACM transactions on intelligent systems and technology >Aspect-Aware Response Generation for Multimodal Dialogue System
【24h】

Aspect-Aware Response Generation for Multimodal Dialogue System

机译:多模式对话系统的方面感知响应生成

获取原文
获取原文并翻译 | 示例
           

摘要

Multimodality in dialogue systems has opened up new frontiers for the creation of robust conversational agents. Any multimodal system aims at bridging the gap between language and vision by leveraging diverse and often complementary information from image, audio, and video, as well as text. For every task-oriented dialog system, different aspects of the product or service are crucial for satisfying the user's demands. Based upon the aspect, the user decides upon selecting the product or service. The ability to generate responses with the specified aspects in a goal-oriented dialogue setup facilitates user satisfaction by fulfilling the user's goals. Therefore, in our current work, we propose the task of aspect controlled response generation in a multimodal task-oriented dialog system. We employ a multimodal hierarchical memory network for generating responses that utilize information from both text and images. As there was no readily available data for building such multimodal systems, we create a Multi-Domain Multi-Modal Dialog (MDMMD++) dataset. The dataset comprises the conversations having both text and images belonging to the four different domains, such as hotels, restaurants, electronics, and furniture. Quantitative and qualitative analysis on the newly created MDMMD++ dataset shows that the proposed methodology outperforms the baseline models for the proposed task of aspect controlled response generation.
机译:对话系统中的多模型已开辟了创建强大的会话代理的新边界。任何多模式系统都旨在通过利用图像,音频和视频以及文本来促进多样化和通常互补信息来弥合语言和愿景之间的差距。对于每个面向任务的对话系统,产品或服务的不同方面对于满足用户的需求至关重要。基于该方面,用户在选择产品或服务时决定。通过在面向目标的对话设置中与指定方面生成响应的能力促进了通过满足用户的目标来满足。因此,在我们当前的工作中,我们提出了在多模式任务导向的对话系统中的方面控制响应生成的任务。我们采用多模式分层存储网络,用于生成利用文本和图像信息的响应。由于没有用于构建此类多模式系统的可用数据,因此我们创建了一个多域多模态对话框(MDMMD ++)数据集。该数据集包括具有属于四个不同域的文本和图像的对话,例如酒店,餐馆,电子设备和家具。关于新创建的MDMMD ++数据集的定量和定性分析表明,所提出的方法优于基线模型,以了解方面控制响应生成的建议任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号