首页> 外文会议>IEEE/RSJ International Conference on Intelligent Robots and Systems >Missing-Feature based Speech Recognition for Two Simultaneous Speech Signals Separated by ICA with a pair of Humanoid Ears
【24h】

Missing-Feature based Speech Recognition for Two Simultaneous Speech Signals Separated by ICA with a pair of Humanoid Ears

机译:基于ICA分开的两个同时语音信号的语音识别与一对人形耳朵分开

获取原文

摘要

Robot audition is a critical technology in making robots symbiosis with people. Since we hear a mixture of sounds in our daily lives, sound source localization and separation, and recognition of separated sounds are three essential capabilities. Sound source localization has been recently studied well for robots, while the other capabilities still need extensive studies. This paper reports the robot audition system with a pair of omni-directional microphones embedded in a humanoid to recognize two simultaneous talkers. It first separates sound sources by Independent Component Analysis (ICA) with single-input multiple-output (SIMO) model. Then, spectral distortion for separated sounds is estimated to identify reliable and unreliable components of the spectrogram. This estimation generates the missing feature masks as spectrographic masks. These masks are then used to avoid influences caused by spectral distortion in automatic speech recognition based on missing-feature method. The novel ideas of our system reside in estimates of spectral distortion of temporal-frequency domain in terms of feature vectors. In addition, we point out that the voice-activity detection (VAD) is effective to overcome the weak point of ICA against the changing number of talkers. The resulting system outperformed the baseline robot audition system by 15 percent.
机译:机器人试镜是一种与人民制作机器人共生的关键技术。由于我们在日常生活中听到声音的混合,声音源定位和分离,并且分离声音的识别是三个必不可少的能力。最近对机器人进行了很好的研究源定位,而其他能力仍需要广泛的研究。本文报告了机器人试镜系统,其中嵌入在人形方形中的一对全方位麦克风以识别两个同步讲话者。它首先通过独立分量分析(ICA)分离声源,单输入多输出(SIMO)模型。然后,估计分离声音的光谱失真以识别频谱图的可靠和不可靠的组件。此估计生成丢失的特征掩码作为光谱掩模。然后使用这些掩模来避免基于缺失特征方法在自动语音识别中引起的影响。我们的系统的新颖思想驻留在特征向量方面的时间频域的光谱失真估计。此外,我们指出,语音活动检测(VAD)有效地克服ICA的弱点,而不是变化的谈话者。由此产生的系统优于基线机器人试听系统15%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号