首页> 外文会议>International Conference on Web Engineering >VISH: Does Your Smart Home Dialogue System Also Need Training Data?
【24h】

VISH: Does Your Smart Home Dialogue System Also Need Training Data?

机译:VISH:您的智能家居对话系统是否还需要培训数据?

获取原文

摘要

The main objective of smart homes is to improve the quality of life and comfort of their inhabitants through automation systems and ambient intelligence. Voice-based interaction like dialogue systems is the current emerging trend in these systems. Natural Language Understanding (NLU) model can identify the end-users' intentions in the utterances provided to spoken dialogue systems. The utility of dialogue systems is reliant on the quality of NLU models, which is in turn significantly dependent on the availability of a high-quality and sufficiently large corpus for training, containing diverse utterance structures. However, building such corpora is a complex task even for companies possessing significant human and infrastructure resources. On the other hand, the existing corpora for the smart home domain are either concerned with web services, focus on direct goals only, follow static command structure, or are not publicly available in English language which limits the development of goal-oriented dialogue systems for smart homes. In this paper, we propose a generic method to create training data for the NLU component using a generative grammar-based approach. Our method outputs, Voice Interaction in Smart Home (VISH) dataset consisting of five million unique utterances for the smart home. This dataset can greatly facilitate research in the area of voice-based dialogue systems for smart homes. We evaluate the approach by using VISH to train several state-of-the-art NLU models. Our experiment results demonstrate the capability of the corpus to support the development of goal-oriented voice-based dialogue systems in the context of smart homes.
机译:智能家居的主要目标是通过自动化系统和环境智能来改善居民的生活质量和舒适度。诸如对话系统之类的基于语音的交互是这些系统中的当前新兴趋势。自然语言理解(NLU)模型可以通过提供给语音对话系统的语音识别最终用户的意图。对话系统的实用性依赖于NLU模型的质量,而NLU模型的质量又很大程度上取决于高质量和足够大的语料库的可用性,该语料库包含各种发声结构。但是,即使对于拥有大量人力和基础设施资源的公司而言,建立这样的语料库也是一项复杂的任务。另一方面,用于智能家居领域的现有语料库要么与Web服务有关,仅关注直接目标,遵循静态命令结构,要么无法公开使用英语,这限制了面向目标的对话系统的开发。智能家居。在本文中,我们提出了一种通用的方法,该方法使用基于生成语法的方法为NLU组件创建训练数据。我们的方法输出是“智能家居中的语音交互”(VISH)数据集,其中包含500万个智能家居独特的话语。该数据集可以极大地促进智能家居基于语音的对话系统领域的研究。我们通过使用VISH训练几种最新的NLU模型来评估该方法。我们的实验结果证明了语料库在智能家居环境中支持基于目标的基于语音的对话系统开发的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号