首页> 美国卫生研究院文献>The Journal of the Acoustical Society of America >Intelligibility of naturally produced and synthesized Mandarin speech by cochlear implant listeners
【2h】

Intelligibility of naturally produced and synthesized Mandarin speech by cochlear implant listeners

机译:人工耳蜗听者自然产生和合成的普通话语音的可理解性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Mandarin is a tonal language, and it is important to preserve lexical tone information in synthesized speech. With natural speech, Chinese cochlear implant (CI) users have difficulty perceiving voice pitch cues important for lexical tone perception; it is unclear whether this difficulty persists in Mandarin synthesized speech. In this study, intelligibility of naturally produced and synthesized Mandarin speech was measured in Chinese CI listeners; intelligibility was also measured in a control group of normal-hearing (NH) listeners. Five synthesized voices were selected to represent different talker genders (male, female, child), speaking rates (normal, slow), and speaking styles (emotional, accent). The data showed that while modern Mandarin text-to-speech (TTS) systems can provide perfect speech intelligibility for NH listeners, overall intelligibility was much poorer for CI than for NH listeners. CI performance was significantly poorer with synthesized speech than with natural speech (p < 0.001). CI listeners were highly sensitive to the “extra-atypical” synthesized emotional and accented speech. Performance with each of the synthesized speech types was significantly correlated with performance with natural speech in CI users (p < 0.01 in all cases). While modern TTS systems offer educational and communication benefits to CI users and hearing-impaired individuals, the selection of synthesized voices should be carefully considered in education applications of TTS for hearing-impaired individuals, especially CI children, since poor intelligibility performance may affect language learning.
机译:普通话是一种声调语言,在合成语音中保留词汇音调信息非常重要。使用自然语音时,中国人工耳蜗(CI)用户难以感知对于词汇音调感知很重要的语音提示。目前尚不清楚这种困难在普通话合成语音中是否仍然存在。在这项研究中,对中文CI听众测量了自然产生和合成的普通话的清晰度。在正常听(NH)听者的对照组中也测量了清晰度。选择了五种合成声音来代表不同的说话者性别(男性,女性,儿童),说话速度(正常,慢音)和说话风格(情绪,口音)。数据显示,尽管现代普通话文本语音转换(TTS)系统可以为NH听众提供完美的语音清晰度,但CI的整体清晰度比NH听众差得多。合成语音的CI性能显着低于自然语音(p <0.001)。 CI听众对“非典型”合成情感和口音非常敏感。在CI用户中,每种合成语音的性能与自然语音的性能显着相关(在所有情况下,p <0.01)。尽管现代的TTS系统为CI用户和听力障碍者提供了教育和交流上的好处,但在TTS针对听力障碍者(尤其是CI儿童)的教育应用中,应仔细考虑合成语音的选择,因为较差的清晰度可能会影响语言学习。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号