首页> 外文会议>IEEE International Symposium on Robot and Human Interactive Communication >Exploring data augmentation methods in reverberant human-robot voice communication
【24h】

Exploring data augmentation methods in reverberant human-robot voice communication

机译:探索混响人机语音通信中的数据增强方法

获取原文

摘要

Collecting training data is not an easy task especially in situation involving robots that require tremendous physical effort. The ability to augment data through synthetic means is a convenient tool to solve this problem. Therefore it is important to evaluate the extent of the usefulness of augmented data. In this paper, we will explore data augmentation schemes in reverberant environment and investigate a method to effectively select data. We experiment in a real reverberant environment condition and investigate both the traditional automatic speech recognition (ASR) system based on gaussian mixture model-hidden markov model (GMM-HMM) and the most current system based on Deep Neural Networks (i.e, HMM-DNN). Our results show that the combination of data augmentation and data selection, further improves system performance. In our experiments, we used real test data in a reverberant hands-free human-robot communication scenario.
机译:收集训练数据并不是一件容易的事,尤其是在涉及需要大量体力的机器人的情况下。通过综合手段增强数据的能力是解决此问题的便捷工具。因此,评估增强数据的有用程度很重要。在本文中,我们将探索混响环境中的数据增强方案,并研究有效选择数据的方法。我们在真实的混响环境条件下进行实验,并研究了基于高斯混合模型-隐马尔可夫模型(GMM-HMM)的传统自动语音识别(ASR)系统和基于深度神经网络(即HMM-DNN)的最新系统)。我们的结果表明,将数据扩充和数据选择相结合,可以进一步提高系统性能。在我们的实验中,我们在混响的免提人机交互场景中使用了真实的测试数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号