Multi-party human-robot interaction with distant-talking speech recognition

机译：多方人机交互与远距离语音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech is one of the most natural medium for human communication, which makes it vital to human-robot interaction. In real environments where robots are deployed, distant-talking speech recognition is difficult to realize due to the effects of reverberation. This leads to the degradation of speech recognition and understanding, and hinders a seamless human-robot interaction. To minimize this problem, traditional speech enhancement techniques optimized for human perception are adopted to achieve robustness in humanrobot interaction. However, human and machine perceive speech differently: An improvement in speech recognition performance may not automatically translate to an improvement in human-robot interaction experience (as perceived by the users). In this paper, we propose a method in optimizing speech enhancement techniques specifically to improve automatic speech recognition (ASR) with emphasis on the human-robot interaction experience. Experimental results using real reverberant data in a multi-party conversation, show that the proposed method improved human-robot interaction experience in severe reverberant conditions compared to the traditional techniques.

机译：语音是人类交流的最自然的媒介之一，这使其对人机交互至关重要。在部署机器人的实际环境中，由于混响的影响，难以实现远距离语音识别。这导致语音识别和理解能力下降，并阻碍了人机之间的无缝交互。为了最小化此问题，采用了针对人类感知优化的传统语音增强技术，以实现人机交互的鲁棒性。但是，人与机器对语音的感知有所不同：语音识别性能的提高可能不会自动转换为人机交互体验的改进（如用户所感知）。在本文中，我们提出了一种优化语音增强技术的方法，专门用于改进自动语音识别（ASR），重点是人机交互体验。在多方对话中使用真实混响数据的实验结果表明，与传统技术相比，该方法改善了在严重混响条件下的人机交互体验。

著录项

来源
《Proceedings of the Seventh Annual ACM/IEEE International Conference on Human-Robot Interaction》|2012年|p.439- 446|共8页
会议地点 Boston(MA)
作者
Gomez Randy; Nakamura Keisuke; Kawahara Tatsuya; Nakadai Kazuhiro;
展开▼
作者单位

Academic Center for, Computing and Media, Studies, Kyoto University, Sakyo-ku Kyoto, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
入库时间 2022-08-26 14:06:11

相似文献

外文文献
中文文献
专利

1. Performing predefined tasks using the human-robot interaction on speech recognition for an industrial robot [J] . Mustafa Can Bingol, Omur Aydogmus Engineering Applications of Artificial Intelligence . 2020,第Octa期

机译：使用人体机器人交互对工业机器人进行语音识别进行预定义任务
2. Two-layer fuzzy multiple random forest for speech emotion recognition in human-robot interaction [J] . Chen Luefeng, Su Wanjuan, Feng Yu, Information Sciences: An International Journal . 2020,第期

机译：人体机器人互动中的语音情感识别两层模糊多个随机森林
3. Speech Emotion Recognition Using an Enhanced Kernel Isomap for Human-Robot Interaction [J] . Shiqing Zhang, Xiaoming Zhao, Bicheng Lei International Journal of Advanced Robotic Systems . 2017,第2期

机译：使用增强型内核Isomap进行人机交互的语音情感识别
4. Multi-party human-robot interaction with distant-talking speech recognition [C] . Gomez Randy, Nakamura Keisuke, Kawahara Tatsuya, ACM/IEEE International Conference on Human-Robot Interaction . 2012

机译：与遥远的语音识别的多方人体机器人互动
5. Modeling Oxytocin-Induced Neurorobotic Trust and Intent Recognition in Human-Robot Interaction. [D] . Anumandla, Sridhar Reddy. 2010

机译：在人机交互中模拟催产素诱导的神经机器人信任和意图识别。
6. Threshold-Based Noise Detection and Reduction for Automatic Speech Recognition System in Human-Robot Interactions [O] . Sheng-Chieh Lee, Jhing-Fa Wang, Miao-Hia Chen 2018

机译：人机交互中基于阈值的自动语音识别系统噪声检测与消减
7. Speech emotion recognition in emotional feedbackfor Human-Robot Interaction [O] . Rázuri, Javier G., Sundgren, David, Rahmani, Rahim, 2015

机译：人机交互情感反馈中的语音情感识别

Multi-party human-robot interaction with distant-talking speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅