首页> 外文会议>Proceedings of the Seventh Annual ACM/IEEE International Conference on Human-Robot Interaction >Multi-party human-robot interaction with distant-talking speech recognition
【24h】

Multi-party human-robot interaction with distant-talking speech recognition

机译:多方人机交互与远距离语音识别

获取原文
获取原文并翻译 | 示例

摘要

Speech is one of the most natural medium for human communication, which makes it vital to human-robot interaction. In real environments where robots are deployed, distant-talking speech recognition is difficult to realize due to the effects of reverberation. This leads to the degradation of speech recognition and understanding, and hinders a seamless human-robot interaction. To minimize this problem, traditional speech enhancement techniques optimized for human perception are adopted to achieve robustness in humanrobot interaction. However, human and machine perceive speech differently: An improvement in speech recognition performance may not automatically translate to an improvement in human-robot interaction experience (as perceived by the users). In this paper, we propose a method in optimizing speech enhancement techniques specifically to improve automatic speech recognition (ASR) with emphasis on the human-robot interaction experience. Experimental results using real reverberant data in a multi-party conversation, show that the proposed method improved human-robot interaction experience in severe reverberant conditions compared to the traditional techniques.
机译:语音是人类交流的最自然的媒介之一,这使其对人机交互至关重要。在部署机器人的实际环境中,由于混响的影响,难以实现远距离语音识别。这导致语音识别和理解能力下降,并阻碍了人机之间的无缝交互。为了最小化此问题,采用了针对人类感知优化的传统语音增强技术,以实现人机交互的鲁棒性。但是,人与机器对语音的感知有所不同:语音识别性能的提高可能不会自动转换为人机交互体验的改进(如用户所感知)。在本文中,我们提出了一种优化语音增强技术的方法,专门用于改进自动语音识别(ASR),重点是人机交互体验。在多方对话中使用真实混响数据的实验结果表明,与传统技术相比,该方法改善了在严重混响条件下的人机交互体验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号