首页> 外文期刊>Ear and hearing. >Effect of speaking rate on recognition of synthetic and natural speech by normal-hearing and cochlear implant listeners
【24h】

Effect of speaking rate on recognition of synthetic and natural speech by normal-hearing and cochlear implant listeners

机译:语速对正常听觉和人工耳蜗听众识别合成和自然语音的影响

获取原文
获取原文并翻译 | 示例
           

摘要

OBJECTIVE: Most studies have evaluated cochlear implant (CI) performance using "clear" speech materials, which are highly intelligible and well articulated. CI users may encounter much greater variability in speech patterns in the "real world," including synthetic speech. In this study, the authors measured sentence recognition with multiple talkers and speaking rates, and with naturally produced and synthetic speech in listeners with normal hearing (NH) and CIs. DESIGN:: NH and CI subjects were asked to recognize naturally produced or synthetic sentences, presented at a slow, normal, or fast speaking rate. Natural speech was produced by one male and one female talker; synthetic speech was generated to simulate a male and female talker. For natural speech, the speaking rate was time-scaled while preserving voice pitch and formant frequency information. For synthetic speech, the speaking rate was adjusted within the speech synthesis engine. NH subjects were tested while listening to unprocessed speech or to an eight-channel acoustic CI simulation. CI subjects were tested while listening with their clinical processors and the recommended microphone sensitivity and volume settings. RESULTS:: The NH group performed significantly better than did the CI-simulation group, and the CI-simulation group performed significantly better than did the CI group. For all subject groups, sentence recognition was significantly better with natural speech than with synthetic speech. The performance deficit with synthetic speech was relatively small for NH subjects listening to unprocessed speech. However, the performance deficit with synthetic speech was much greater for CI subjects and for CI-simulation subjects. There was significant effect of talker gender, with slightly better performance with the female talker for CI subjects and slightly better performance with the male talker for the CI simulations. For all subject groups, sentence recognition was significantly poorer only at the fast rate. CI performance was very poor (approximately 10% correct) at the fast rate. CONCLUSIONS:: CI listeners are susceptible to variability in speech patterns caused by speaking rate and production style (natural versus synthetic). CI performance with clear speech materials may overestimate performance in real-world listening conditions. The poorer CI performance may be because of other factors besides reduced spectro-temporal resolution, such the quality of electric stimulation, duration of deafness, or cortical processing. Optimizing the input or training may improve CI users' tolerance for variability in speech patterns.
机译:目的:大多数研究使用“清晰”的语音材料评估了人工耳蜗(CI)的性能,这些材料易于理解且发音清晰。 CI用户在包括合成语音在内的“真实世界”中可能会遇到更大的语音模式变化。在这项研究中,作者在具有正常听力(NH)和CI的听众中测量了具有多个说话者和讲话速度以及自然产生的合成语音的句子识别能力。设计::要求NH和CI受试者识别以慢,正常或快语速出现的自然产生或合成的句子。一位男性和一位女性说话者发出自然的语音;生成合成语音以模拟男性和女性说话者。对于自然语音,在保留语音音高和共振峰频率信息的同时,对时间比例进行了时间缩放。对于合成语音,在语音合成引擎中调整了语速。在听未经处理的语音或八通道声学CI模拟时,对NH受试者进行了测试。在测试CI受试者时,他们会使用其临床处理器以及建议的麦克风灵敏度和音量设置进行测试。结果:NH组的表现明显优于CI模拟组,CI模拟组的表现明显优于CI组。对于所有主题组,自然语音的句子识别明显优于人工语音。对于NH听未处理语音的受试者,合成语音的性能缺陷相对较小。但是,对于CI主题和CI模拟主题,合成语音的性能缺陷要大得多。谈话者性别具有显着影响,女性谈话者在CI主题方面的表现略好,男性谈话者在CI模拟中的表现略好。对于所有主题组,句子识别仅在快速的情况下明显较差。 CI的性能非常差(正确率约为10%)。结论:CI聆听者易受语音速率和生产风格(自然或合成)引起的语音模式变化的影响。具有清晰语音材料的CI性能可能会高估实际聆听条件下的性能。 CI性能较差的原因可能是由于光谱时间分辨率降低以外的其他因素,例如电刺激的质量,耳聋的持续时间或皮层处理。优化输入或培训可以提高CI用户对语音模式变化的容忍度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号