首页> 外文会议>2017 International Conference on Wireless Communications, Signal Processing and Networking >Frequency compression of speech for improving speech perception in sensorineural hearing loss: FBS approach
【24h】

Frequency compression of speech for improving speech perception in sensorineural hearing loss: FBS approach

机译:语音频率压缩以改善感觉神经性听力损失中的语音感知:FBS方法

获取原文
获取原文并翻译 | 示例

摘要

Filter-bank summation (FBS) method is a commonly used technique for multi-band processing of speech and audio signals, especially in digital hearing aids. In multi-band speech processing techniques, filter bank summation provides the convenient way of processing the auditory information present in different bands based on their perceptual significance. People with hearing loss have issue in perception of speech due to widening of the auditory filter bandwidth leading to increased frequency masking. Previous studies have shown that spectral splitting of speech signal for binaural dichotic presentation helps to reduce the effect of frequency masking. Also studies showed that using multiband frequency compression it is possible to compensate the effect of widened auditory filters. This paper presents a filter bank summation method to perform dichotic spectral splitting of input speech signal followed by frequency compression to enhance speech perception for hearing impaired. In the present study, the speech signal is split into eighteen frequency bands ranging from 0-5000 Hz based on auditory critical bandwidths and frequency samples of every band compressed in the direction of center of each band using spectral segment frequency mapping technique. Performance of the algorithm was evaluated using MOS test for subjective assessment of speech quality and Perceptual Evaluation of Speech Quality (PESQ) scores for objective assessment of speech quality. The results showed a significant improvement in speech quality as indicated by MOS and PESQ scores for the SNR values in the range of -6 dB to 6 dB.
机译:滤波器组求和(FBS)方法是用于语音和音频信号多频带处理的一种常用技术,尤其是在数字助听器中。在多频带语音处理技术中,滤波器组求和提供了一种基于其感知意义来处理不同频带中存在的听觉信息的便捷方法。听力受损的人由于听觉滤波器带宽的扩大导致语音掩盖,导致频率掩蔽增加。先前的研究表明,语音信号的频谱分裂用于双耳双耳的表达有助于降低频率掩蔽的影响。研究还表明,使用多频带频率压缩可以补偿加宽的听觉滤波器的影响。本文提出了一种滤波器组求和方法,对输入的语音信号进行双色频谱分裂,然后进行频率压缩,以增强听力障碍者的语音感知能力。在本研究中,语音频谱基于听觉临界带宽和使用频谱段频率映射技术在每个频带的中心方向压缩的每个频带的频率样本,分为0-5000 Hz范围内的18个频带。使用MOS测试对语音质量进行主观评估,对语音质量进行感知评估(PESQ)评分,对语音质量进行客观评估,从而评估算法的性能。结果表明,对于SNR值在-6 dB至6 dB范围内的MOS和PESQ得分,语音质量有了显着改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号