首页> 外文期刊>The Journal of the Acoustical Society of America >Intelligibility of naturally produced and synthesized Mandarin speech by cochlear implant listeners
【24h】

Intelligibility of naturally produced and synthesized Mandarin speech by cochlear implant listeners

机译:Cochlear植入听众自然产生和合成普通话的可懂度

获取原文
获取原文并翻译 | 示例
       

摘要

Mandarin is a tonal language, and it is important to preserve lexical tone information in synthesized speech. With natural speech, Chinese cochlear implant (CI) users have difficulty perceiving voice pitch cues important for lexical tone perception; it is unclear whether this difficulty persists in Mandarin synthesized speech. In this study, intelligibility of naturally produced and synthesized Mandarin speech was measured in Chinese CI listeners; intelligibility was also measured in a control group of normal-hearing (NH) listeners. Five synthesized voices were selected to represent different talker genders (male, female, child), speaking rates (normal, slow), and speaking styles (emotional, accent). The data showed that while modern Mandarin text-to-speech (TTS) systems can provide perfect speech intelligibility for NH listeners, overall intelligibility was much poorer for CI than for NH listeners. CI performance was significantly poorer with synthesized speech than with natural speech (p 0.001). CI listeners were highly sensitive to the "extra-atypical" synthesized emotional and accented speech. Performance with each of the synthesized speech types was significantly correlated with performance with natural speech in CI users (p 0.01 in all cases). While modern TTS systems offer educational and communication benefits to CI users and hearing-impaired individuals, the selection of synthesized voices should be carefully considered in education applications of TTS for hearing-impaired individuals, especially CI children, since poor intelligibility performance may affect language learning. (C) 2018 Acoustical Society of America.
机译:普通话是一种音调语言,并且在合成演讲中保存词法语气信息非常重要。通过自然演讲,中文耳蜗植入物(CI)用户难以感知语音音调对词汇音调感知很重要;目前尚不清楚这种难度是否持续普通话综合演讲。在这项研究中,在中国CI听众中测量了天然生产和合成的普通话语音的可理解性;在正常听觉(NH)听众的对照组中也测量了可懂度。选择五个合成的声音代表不同的谈话者的性别(男性,女性,儿童),发言(正常,慢)和说话方式(情绪化,口音)。数据显示,虽然现代普通话文本语音(TTS)系统可以为NH侦听器提供完美的语音清晰度,但总体上可理解性比NH听众更差。 CI性能明显较差,合成语音比自然语音(P <0.001)。 CI侦听器对“额外非典型”合成的情感和令人沮丧的演讲非常敏感。与每个合成语音类型的性能与CI用户中的自然语音的性能显着相关(在所有情况下P <0.01)。虽然现代TTS系统为CI用户和听力受损的个人提供教育和沟通好处,但应在听力受损个人的TTS的教育应用中仔细考虑合成声音的选择,尤其是CI儿童,因为无能性绩效可能会影响语言学习。 (c)2018年声学学会。

著录项

  • 来源
  • 作者单位

    Capital Med Univ Beijing TongRen Hosp Dept Otolaryngol Head &

    Neck Surg Beijing 100730 Peoples R China;

    Capital Med Univ Beijing TongRen Hosp Dept Otolaryngol Head &

    Neck Surg Beijing 100730 Peoples R China;

    Capital Med Univ Beijing TongRen Hosp Dept Otolaryngol Head &

    Neck Surg Beijing 100730 Peoples R China;

    Capital Med Univ Beijing TongRen Hosp Dept Otolaryngol Head &

    Neck Surg Beijing 100730 Peoples R China;

    Capital Med Univ Beijing TongRen Hosp Dept Otolaryngol Head &

    Neck Surg Beijing 100730 Peoples R China;

    House Ear Res Inst Los Angeles CA 90057 USA;

    Univ Calif Los Angeles David Geffen Sch Med Dept Head &

    Neck Surg Los Angeles CA 90095 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 声学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号