首页> 外文学位 >Speech Analysis and Single Channel Enhancement to Improve Speech Intelligibility for Cochlear Implant Recipients
【24h】

Speech Analysis and Single Channel Enhancement to Improve Speech Intelligibility for Cochlear Implant Recipients

机译:语音分析和单通道增强功能可提高人工耳蜗植入者的语音清晰度

获取原文
获取原文并翻译 | 示例

摘要

Cochlear implant (CI) devices are able to help deaf individuals recover hearing ability by surgically inserting electrode arrays into the inner ear, to stimulate the auditory nerve and transmit the sound to the auditory cortex in the brain. CI listeners achieve high speech intelligibility in quiet environments, however their speech perception degrades dramatically when presented in noisy backgrounds. This is especially true in fluctuating noise, such as competing-speaker or babble noise, where CI users have difficulties in speech understanding. One of the reasons is that low spectral resolution provided by CI encoding strategies is insufficient to distinguish speech components from noise. In this dissertation, we propose a new speech enhancement solution to improve speech intelligibility for CI recipients in noise.;Speech energy is primarily carried in the harmonic structure located at multiple integer harmonics of the fundamental frequency. In order to combat noise, we propose to use harmonic structure as the frequency domain cues to estimate the degraded noise. The proposed speech enhancement is based on harmonic structure estimation combined with a traditional statistical based leveraged solution. This dissertation has investigated robust fundamental frequency estimation in noise, along with integrating such novel in formulate to improve harmonic based speech enhancement in both stationary and non-stationary noise scenarios.;Noise-robust pitch estimation is proposed based on temporal harmonic structure in local time-frequency (TF) segments. To reduce the noise affect, we take advantage of the sparsity of speech signal that only the high signal to noise ratio (SNR) TF segments are used for pitch estimation. Robust harmonic features are investigated for neural network classification based pitch estimation. The harmonic features map the pitch candidates into a more separable space for classification. Experimental results show that the proposed pitch estimation method improves global pitch error in noise. Next, harmonic structure estimation is combined with the traditional statistical based method to speech enhancement. Noise estimation is performed based on harmonic structures. The estimated noise variance is employed in a traditional MMSE framework for a prior and posterior SNR estimation to obtain a gain function for the target speech. Listening experiments with CI subjects demonstrated improved speech intelligibility for both stationary and non-stationary noise.
机译:耳蜗植入(CI)装置能够通过外科手术将电极阵列插入内耳来帮助聋人恢复听力,从而刺激听觉神经并将声音传输到大脑的听觉皮层。 CI聆听者在安静的环境中可实现较高的语音清晰度,但是在嘈杂的背景中呈现时,其语音感知能力会急剧下降。这在CI用户难以理解语音的波动噪声(例如,竞争性扬声器或ba不休的噪声)中尤其如此。原因之一是CI编码策略提供的低频谱分辨率不足以区分语音分量和噪声。本文提出了一种新的语音增强解决方案,用以提高噪声中CI接收者的语音清晰度。语音能量主要携带在基频的多个整数谐波处的谐波结构中。为了对抗噪声,我们建议使用谐波结构作为频域提示来估计降级的噪声。所提出的语音增强是基于谐波结构估计与传统的基于统计的杠杆解决方案相结合的。本文研究了噪声中鲁棒的基频估计,并结合了这种新颖的公式,以改善在平稳和非平稳噪声情况下基于谐波的语音增强。;提出了基于时域谐波结构的时域噪声鲁棒基音估计-频率(TF)段。为了减少噪声影响,我们利用语音信号的稀疏性,即仅将高信噪比(SNR)TF段用于音调估计。研究了基于神经网络分类的基音估计的鲁棒谐波特征。谐波特征将候选音调映射到更可分离的空间中进行分类。实验结果表明,提出的基音估计方法可以改善噪声中的整体基音误差。接下来,谐波结构估计与传统的基于统计的方法相结合来增强语音。噪声估计是基于谐波结构执行的。估计的噪声方差在传统的MMSE框架中用于先验和后验SNR估计,以获得目标语音的增益函数。 CI主体的听力实验表明,对于固定噪声和非固定噪声,语音清晰度都得到了改善。

著录项

  • 作者

    Wang, Dongmei.;

  • 作者单位

    The University of Texas at Dallas.;

  • 授予单位 The University of Texas at Dallas.;
  • 学科 Biomedical engineering.;Electrical engineering.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 123 p.
  • 总页数 123
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 康复医学;
  • 关键词

  • 入库时间 2022-08-17 11:54:31

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号