Effect of speaking rate on recognition of synthetic and natural speech by normal-hearing and cochlear implant listeners

JiC.; GalvinJ.J.; XuA.; FuQ.-J.

首页> 外文期刊>Ear and hearing. >Effect of speaking rate on recognition of synthetic and natural speech by normal-hearing and cochlear implant listeners

【24h】

Effect of speaking rate on recognition of synthetic and natural speech by normal-hearing and cochlear implant listeners

机译：语速对正常听觉和人工耳蜗听众识别合成和自然语音的影响

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

OBJECTIVE: Most studies have evaluated cochlear implant (CI) performance using "clear" speech materials, which are highly intelligible and well articulated. CI users may encounter much greater variability in speech patterns in the "real world," including synthetic speech. In this study, the authors measured sentence recognition with multiple talkers and speaking rates, and with naturally produced and synthetic speech in listeners with normal hearing (NH) and CIs. DESIGN:: NH and CI subjects were asked to recognize naturally produced or synthetic sentences, presented at a slow, normal, or fast speaking rate. Natural speech was produced by one male and one female talker; synthetic speech was generated to simulate a male and female talker. For natural speech, the speaking rate was time-scaled while preserving voice pitch and formant frequency information. For synthetic speech, the speaking rate was adjusted within the speech synthesis engine. NH subjects were tested while listening to unprocessed speech or to an eight-channel acoustic CI simulation. CI subjects were tested while listening with their clinical processors and the recommended microphone sensitivity and volume settings. RESULTS:: The NH group performed significantly better than did the CI-simulation group, and the CI-simulation group performed significantly better than did the CI group. For all subject groups, sentence recognition was significantly better with natural speech than with synthetic speech. The performance deficit with synthetic speech was relatively small for NH subjects listening to unprocessed speech. However, the performance deficit with synthetic speech was much greater for CI subjects and for CI-simulation subjects. There was significant effect of talker gender, with slightly better performance with the female talker for CI subjects and slightly better performance with the male talker for the CI simulations. For all subject groups, sentence recognition was significantly poorer only at the fast rate. CI performance was very poor (approximately 10% correct) at the fast rate. CONCLUSIONS:: CI listeners are susceptible to variability in speech patterns caused by speaking rate and production style (natural versus synthetic). CI performance with clear speech materials may overestimate performance in real-world listening conditions. The poorer CI performance may be because of other factors besides reduced spectro-temporal resolution, such the quality of electric stimulation, duration of deafness, or cortical processing. Optimizing the input or training may improve CI users' tolerance for variability in speech patterns.

机译：目的：大多数研究使用“清晰”的语音材料评估了人工耳蜗（CI）的性能，这些材料易于理解且发音清晰。 CI用户在包括合成语音在内的“真实世界”中可能会遇到更大的语音模式变化。在这项研究中，作者在具有正常听力（NH）和CI的听众中测量了具有多个说话者和讲话速度以及自然产生的合成语音的句子识别能力。设计：：要求NH和CI受试者识别以慢，正常或快语速出现的自然产生或合成的句子。一位男性和一位女性说话者发出自然的语音;生成合成语音以模拟男性和女性说话者。对于自然语音，在保留语音音高和共振峰频率信息的同时，对时间比例进行了时间缩放。对于合成语音，在语音合成引擎中调整了语速。在听未经处理的语音或八通道声学CI模拟时，对NH受试者进行了测试。在测试CI受试者时，他们会使用其临床处理器以及建议的麦克风灵敏度和音量设置进行测试。结果：NH组的表现明显优于CI模拟组，CI模拟组的表现明显优于CI组。对于所有主题组，自然语音的句子识别明显优于人工语音。对于NH听未处理语音的受试者，合成语音的性能缺陷相对较小。但是，对于CI主题和CI模拟主题，合成语音的性能缺陷要大得多。谈话者性别具有显着影响，女性谈话者在CI主题方面的表现略好，男性谈话者在CI模拟中的表现略好。对于所有主题组，句子识别仅在快速的情况下明显较差。 CI的性能非常差（正确率约为10％）。结论：CI聆听者易受语音速率和生产风格（自然或合成）引起的语音模式变化的影响。具有清晰语音材料的CI性能可能会高估实际聆听条件下的性能。 CI性能较差的原因可能是由于光谱时间分辨率降低以外的其他因素，例如电刺激的质量，耳聋的持续时间或皮层处理。优化输入或培训可以提高CI用户对语音模式变化的容忍度。

著录项

来源
《Ear and hearing.》 |2013年第3期|共11页
作者
JiC.; GalvinJ.J.; XuA.; FuQ.-J.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类耳鼻咽喉科学;
关键词

相似文献

外文文献
中文文献
专利

1. Effect of speaking rate on recognition of synthetic and natural speech by normal-hearing and cochlear implant listeners [J] . JiC., GalvinJ.J., XuA., Ear and hearing. . 2013,第3期

机译：语速对正常听觉和人工耳蜗听众识别合成和自然语音的影响
2. Recognition of spectrally asynchronous speech by normal-hearing listeners and Nucleus-22 cochlear implant users [J] . Qian-Jie Fu, John J. Galvin III The Journal of the Acoustical Society of America . 2001,第3期

机译：正常听众和Nucleus-22人工耳蜗使用者对频谱异步语音的识别
3. Speech recognition by normal-hearing and cochlear implant listeners as a function of intensity resolution [J] . Philipos C. Loizou, Michael Dorman, Oguz Poroy The Journal of the Acoustical Society of America . 2000,第5期

机译：正常听觉和人工耳蜗听众的语音识别与强度分辨率的关系
4. A speech enhancement method for cochlear implant listeners [C] . Yuan, Meng, Sun, Yang, Feng, Haihong, Annual Conference of Japanese Society for Medical and Biological Engineering;Annual International Conference of the IEEE Engineering in Medicine and Biology Society . 2013

机译：人工耳蜗听众的语音增强方法
5. Signal processing strategies for better melody recognition and improved speech understanding in noise for cochlear implants. [D] . Kasturi, Kalyan S. 2006

机译：信号处理策略可更好地识别旋律，并改善人工耳蜗噪声中的语音理解能力。
6. Effect of Speaking Rate on Recognition of Synthetic and Natural Speech by Normal-Hearing and Cochlear Implant Listeners [O] . Caili Ji, John J. Galvin III, Anting Xu, -1

机译：语速对正常听觉和人工耳蜗听众识别合成和自然语音的影响
7. Normal-Hearing Listeners’ and Cochlear Implant Users’ Perception of Pitch Cues in Emotional Speech [O] . Steven Gilbers, Christina Fuller, Dicky Gilbers, 2015

机译：正常听力听众和人工耳蜗用户对情绪语音中音高线索的感知

Effect of speaking rate on recognition of synthetic and natural speech by normal-hearing and cochlear implant listeners

摘要

著录项

相似文献

相关主题

期刊订阅