...
首页> 外文期刊>The Journal of the Acoustical Society of America >Recognition of spectrally asynchronous speech by normal-hearing listeners and Nucleus-22 cochlear implant users
【24h】

Recognition of spectrally asynchronous speech by normal-hearing listeners and Nucleus-22 cochlear implant users

机译:正常听众和Nucleus-22人工耳蜗使用者对频谱异步语音的识别

获取原文
获取原文并翻译 | 示例
           

摘要

This experiment examined the effects of spectral resolution and fine spectral structure on recognition of spectrally asynchronous sentences by normal-hearing and cochlear implant listeners. Sentence recognition was measured in six normal-hearing subjects listening to either full-spectrum or noise-band processors and five Nucleus-22 cochlear implant listeners fitted with 4-channel continuous interleaved sampling (CIS) processors. For the full-spectrum processor, the speech signals were divided into either 4 or 16 channels. For the noise-band processor, after band-pass filtering into 4 or 16 channels, the envelope of each channel was extracted and used to modulate noise of the same bandwidth as the analysis band, thus eliminating the fine spectral structure available in the full-spectrum processor. For the 4-channel CIS processor, the amplitude envelopes extracted from four bands were transformed to electric currents by a power function and the resulting electric currents were used to modulate pulse trains delivered to four electrode pairs. For all processors, the output of each channel was time-shifted relative to other channels, varying the channel delay across channels from 0 to 240 ms (in 40-ms steps). Within each delay condition, all channels were desynchronized such that the cross-channel delays between adjacent channels were maximized, thereby avoiding local pockets of channel synchrony. Results show no significant difference between the 4- and 16-channel full-spectrum speech processor for normal-hearing listeners. Recognition scores dropped significantly only when the maximum delay reached 200 ms for the 4-channel processor and 240 ms for the 16-channel processor. When fine spectral structure were removed in the noise-band processor, sentence recognition dropped significantly when the maximum delay was 160 ms for the 16-channel noise-band processor and 40 ms for the 4-channel noise-band processor. There was no significant difference between implant listeners using the 4-channel CIS processor and normal-hearing listeners using the 4-channel noise-band processor. The results imply that when fine spectral structures are not available, as in the implant listener's case, increased spectral resolution is important for overcoming cross-channel asynchrony in speech signals.
机译:该实验检查了频谱分辨率和精细频谱结构对正常听觉和人工耳蜗听众识别频谱异步句子的影响。对六名听力正常且听觉全谱或噪声频带处理器的受试者和五名配备4通道连续交错采样(CIS)处理器的Nucleus-22人工耳蜗监听器进行了句子识别。对于全谱处理器,语音信号被分为4或16个通道。对于噪声带处理器,在对4或16个通道进行带通滤波后,每个通道的包络被提取出来并用于调制与分析带相同带宽的噪声,从而消除了全频段中可用的精细频谱结构。频谱处理器。对于4通道CIS处理器,从四个频段提取的幅度包络通过幂函数转换为电流,所得电流用于调制输送到四个电极对的脉冲序列。对于所有处理器,每个通道的输出都相对于其他通道进行时移,从而将通道之间的通道延迟从0更改为240毫秒(以40毫秒为步长)。在每个延迟条件内,所有通道均不同步,从而使相邻通道之间的跨通道延迟最大化,从而避免了本地通道的通道同步。结果表明,正常听力的4通道和16通道全谱语音处理器之间没有显着差异。仅当最大延迟达到4通道处理器的200 ms和16通道处理器的240 ms时,识别分数才会显着下降。如果在噪声带处理器中删除了精细的频谱结构,则当16通道噪声带处理器的最大延迟为160 ms,而4通道噪声带处理器的最大延迟为40 ms时,句子识别会明显下降。使用4通道CIS处理器的植入式监听器与使用4通道噪声带处理器的正常听力式监听器之间没有显着差异。结果表明,当没有精细的频谱结构时(如在植入式监听器中的情况),提高频谱分辨率对于克服语音信号中的跨通道异步非常重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号