首页> 外文会议>Intelligent Robots and Systems, 2004. (IROS 2004). Proceedings. 2004 IEEE/RSJ International Conference on >Assessment of general applicability of robot audition system by recognizing three simultaneous speeches
【24h】

Assessment of general applicability of robot audition system by recognizing three simultaneous speeches

机译:通过同时识别三个语音来评估机器人试听系统的普遍适用性

获取原文

摘要

Robot audition is a critical technology in creating an intelligent robot operating in daily environments. We have developed such a robot audition system by using a new interface between sound source separation and automatic speech recognition (ASR). A mixture of speeches captured with a pair of microphones installed in the ear positions of a humanoid is separated into each speech by using active direction-pass filter (ADPF). The ADPF extracts a sound source originating from a specific direction in real-time by using interaural phase and intensity differences. The separated speech is recognized by a speech recognizer based on the missing feature theory (MFT). By using a missing feature mask, the MFT based ASR neglects distorted and missing features caused during the speech separation. A missing feature mask for each separated speech is generated in speech separation and is sent to the ASR with the separated speech. Thus, this new integration improves the performance of ASR. However, the generality of this robot audition system has not been assessed so far. In this paper, we assess its general applicability by implementing it on the three humanoids, i.e., ASIMO of Honda, SIG2, and Replie of Kyoto University. By using three simultaneous speeches as benchmarks, the robot audition system improved the performance of ASR over 50% in every humanoid, and thus its general applicability was confirmed.
机译:机器人试听是创建在日常环境中运行的智能机器人的一项关键技术。通过使用声源分离和自动语音识别(ASR)之间的新接口,我们已经开发了这样的机器人试听系统。通过使用有源方向通滤波器(ADPF),可以将用安装在人形机器人耳朵位置的一对麦克风捕获的语音混合成每个语音。 ADPF通过使用双耳相位和强度差异实时提取源自特定方向的声源。语音识别器根据缺失特征理论(MFT)识别分离出的语音。通过使用缺少的特征蒙版,基于MFT的ASR可以忽略语音分离过程中导致的失真和缺少的特征。在语音分离中生成每个分离语音的缺失特征掩码,并将其与分离语音一起发送到ASR。因此,这种新的集成提高了ASR的性能。但是,到目前为止,尚未评估该机器人试听系统的一般性。在本文中,我们通过将其应用于本田的ASIMO,本田的ASIMO,SIG2和京都大学的Replie这三个类人动物来评估其普遍适用性。通过同时使用三个语音作为基准,该机器人试听系统在每个类人动物中将ASR的性能提高了50%以上,从而确认了其普遍适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号