首页> 外文期刊>Circuits, systems, and signal processing >Improving Speech Intelligibility in Monaural Segregation System by Fusing Voiced and Unvoiced Speech Segments
【24h】

Improving Speech Intelligibility in Monaural Segregation System by Fusing Voiced and Unvoiced Speech Segments

机译:通过融合浊音和清音言论改善单声道隔离系统的语音清晰度

获取原文
获取原文并翻译 | 示例

摘要

Improving the speech intelligibility remains a challenging problem in digital hearing aids. This research work proposes a new speech segregation algorithm to improve the speech intelligibility by effectively fusing the voiced and unvoiced segment of the speech signal using the genetic algorithm. The voiced speech segments are obtained using perceptual speech cues such as auto-correlation, cross-channel correlation and pitch. Similarly, the unvoiced speech segments are obtained using another perceptual speech cue onset/offset after subtracting the voiced segments. The speech onset- and offset-based segregation process actually produce segments for both voiced and unvoiced. The unvoiced speech segments are obtained by subtracting the voiced speech segments from the segments obtained using speech onset and offset. The unvoiced speech segments obtained using onset and offset may contain interference. This research work proposes a scheme to remove those interferences from the unvoiced speech segments and effectively fuse the segments of voiced and unvoiced speech using the genetic algorithm. The performance of the proposed algorithm is evaluated using the intelligibility measures such as CSII, NCM and STOI. The experimental results show that the proposed algorithm significantly improves the speech intelligibility with an average of 0.23 for CSII, 0.20 for NCM and 0.16 for STOI as compared with other existing systems.
机译:改善语音识别性仍然是数字助听器的具有挑战性问题。该研究工作提出了一种新的语音分离算法,通过有效地利用遗传算法有效地融合语音信号的浊音和清晰的段来提高语音可懂度。使用感知语音提示获得浊音语音段,例如自动相关,交叉通道相关和间距。类似地,在减去声音段后,使用另一感知语音提示发作/偏移来获得发音段。语音遗传和基于偏移的分离过程实际上为浊音和清音产生段。通过从使用语音发作和偏移获得的段中减去浊音语音段来获得清音语音段。使用发作和偏移获得的发音段可能包含干扰。该研究工作提出了一种计划,以便使用遗传算法有效地融合浊音和清音言论的段。使用CSII,NCM和STOI等可用性措施来评估所提出的算法的性能。实验结果表明,该算法显着提高了CSII的平均值0.23的语音可懂度,与其他现有系统相比,STOI为0.20,为0.16。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号