首页> 外文会议>IEEE/RSJ International Conference on Intelligent Robots and Systems >Assessment of general applicability of robot audition system by recognizing three simultaneous speeches
【24h】

Assessment of general applicability of robot audition system by recognizing three simultaneous speeches

机译:通过识别三次同时演讲评估机器人试镜系统的一般适用性

获取原文

摘要

Robot audition is a critical technology in creating an intelligent robot operating in daily environments. We have developed such a robot audition system by using a new interface between sound source separation and automatic speech recognition (ASR). A mixture of speeches captured with a pair of microphones installed in the ear positions of a humanoid is separated into each speech by using active direction-pass filter (ADPF). The ADPF extracts a sound source originating from a specific direction in real-time by using interaural phase and intensity differences. The separated speech is recognized by a speech recognizer based on the missing feature theory (MFT). By using a missing feature mask, the MFT based ASR neglects distorted and missing features caused during the speech separation. A missing feature mask for each separated speech is generated in speech separation and is sent to the ASR with the separated speech. Thus, this new integration improves the performance of ASR. However, the generality of this robot audition system has not been assessed so far. In this paper, we assess its general applicability by implementing it on the three humanoids, i.e., ASIMO of Honda, SIG2, and Replie of Kyoto University. By using three simultaneous speeches as benchmarks, the robot audition system improved the performance of ASR over 50% in every humanoid, and thus its general applicability was confirmed.
机译:机器人试镜是在日常环境中创造智能机器人的关键技术。我们通过使用声源分离和自动语音识别(ASR)之间的新界面开发了这样一个机器人试镜。通过使用主动方向通滤光器(ADPF)将用一对麦克风捕获的混合用一对麦克风捕获的麦克风分离为每个语音。通过使用互连和强度差异,ADPF在实时提取源自特定方向的声源。基于缺失的特征理论(MFT),由语音识别器识别分离的语音。通过使用缺少的功能掩码,基于MFT的ASR忽略了语音分离期间引起的扭曲和缺失的功能。在语音分离中生成每个分隔语音的缺失的功能掩码,并通过分离的语音发送到ASR。因此,这种新的集成可以提高ASR的性能。然而,到目前为止尚未评估该机器人试镜系统的一般性。在本文中,我们通过在三个人型器,即Honda,Sig2,Sig2和京都大学的副本中实施普遍适用性。通过使用三个同时演讲作为基准,机器人试听系统在每个人形方向上提高了ASR超过50%的性能,因此确认了其一般适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号