首页> 外文会议>International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics >Review of Ideal Binary and Ratio Mask Estimation Techniques for Monaural Speech Separation
【24h】

Review of Ideal Binary and Ratio Mask Estimation Techniques for Monaural Speech Separation

机译:单声道语音分离的理想二进制和比率掩码估计技术综述

获取原文

摘要

Monaural speech separation is the process of separating the target speech from a noisy speech mixture recorded using single microphone. It can be used in wide range of applications including mobile telephony, hearing aid design and robust automatic speech and speaker recognition (ASR). Recently, researchers use computational auditory scene analysis (CASA) technique to successfully separate the target speech from the monaural noisy speech mixture. In CASA based monaural speech separation techniques, Ideal binary mask (IBM) and Ideal ratio mask (IRM) has been proposed as a computational goal to improve the speech intelligibility and speech quality. This paper reviews and reports various research works carried out using CASA techniques with IBM and IRM to improve speech intelligibility and quality. The experimental results show that CASA systems using IBM improves the speech intelligibility and using IRM improves the speech quality.
机译:单声道语音分离是将目标语音与使用单个麦克风录制的嘈杂语音混合分离出来的过程。它可以用于广泛的应用中,包括移动电话,助听器设计以及强大的自动语音和说话者识别(ASR)。最近,研究人员使用计算听觉场景分析(CASA)技术成功地将目标语音与单声道嘈杂语音混合分离出来。在基于CASA的单声道语音分离技术中,已经提出了理想二进制掩码(IBM)和理想比率掩码(IRM)作为提高语音清晰度和语音质量的计算目标。本文回顾并报告了使用CASA技术与IBM和IRM进行的各种研究工作,以提高语音清晰度和质量。实验结果表明,使用IBM的CASA系统可以提高语音清晰度,使用IRM可以提高语音质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号