首页> 外文会议>Cognitive computing - ICCC 2018 >Localized Mandarin Speech Synthesis Services for Enterprise Scenarios
【24h】

Localized Mandarin Speech Synthesis Services for Enterprise Scenarios

机译:针对企业场景的本地化普通话语音合成服务

获取原文
获取原文并翻译 | 示例

摘要

Speech interaction systems have been gaining popularity in recent years. For these systems, the performance of speech synthesis has become a key factor to determine quality of service (QoS) and user experience in real-world speech interaction systems. How to improve the efficiency of speech synthesis has become a hot topic and represents one of the main streams in specific scenarios of human-computer interactions. In this paper, we propose a low-latency hidden Markov model (HMM)-based localized Mandarin speech synthesis architecture which uses a shared global variance for all the Gaussian mixture models (GMMs). Through this strategy, the memory consumption for loading the acoustic model has been reduced greatly. We also encapsulate the speech synthesis as a service using epoll mechanism so that the synthesis engine can be initialized by preloading the text analysis model and acoustic model, and can be invoked by multiple processes simultaneously, thus further improving the efficiency of speech synthesis. Experimental results demonstrate that our proposed method can significantly reduce the time latency while maintaining voice quality of synthesized speeches.
机译:近年来,语音交互系统已经越来越流行。对于这些系统,语音合成的性能已成为确定实际语音交互系统中的服务质量(QoS)和用户体验的关键因素。如何提高语音合成的效率已经成为一个热门话题,并且代表了人机交互特定场景下的主流之一。在本文中,我们提出了一种基于低延迟隐马尔可夫模型(HMM)的本地化普通话语音合成体系结构,该体系结构对所有高斯混合模型(GMM)使用共享的全局方差。通过这种策略,大大减少了加载声学模型的内存消耗。我们还使用epoll机制将语音合成封装为服务,以便可以通过预加载文本分析模型和声学模型来初始化合成引擎,并且可以同时通过多个进程调用合成引擎,从而进一步提高了语音合成的效率。实验结果表明,我们提出的方法可以在保持合成语音的语音质量的同时显着减少时间延迟。

著录项

  • 来源
    《Cognitive computing - ICCC 2018》|2018年|130-143|共14页
  • 会议地点 Seattle(US)
  • 作者单位

    Research Institute of Web Information, Tsinghua University, Beijing, China,National Engineering Research Center for Supporting Software of Enterprise Internet Services, Hong Kong, China,Kingdee Research, Kingdee International Software Group Company Limited, Shenzhen, China;

    National Engineering Research Center for Supporting Software of Enterprise Internet Services, Hong Kong, China,Kingdee Research, Kingdee International Software Group Company Limited, Shenzhen, China;

    Research Institute of Web Information, Tsinghua University, Beijing, China;

    National Engineering Research Center for Supporting Software of Enterprise Internet Services, Hong Kong, China,Kingdee Research, Kingdee International Software Group Company Limited, Shenzhen, China;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Localized Mandarin speech synthesis; Hidden Markov model; Low latency;

    机译:本地化普通话语音合成;隐马尔可夫模型;低延迟;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号