【24h】

An investigation on linguistic features for Mandarin prosody generation

机译:普通话韵律产生的语言特征研究

获取原文
获取原文并翻译 | 示例

摘要

This paper seeks to investigate the usability of two fully-automatic machine-extracted linguistic features from an unlimited text input, in a prosody generation of Mandarin text-to-speech system (MTTS). One is the base-phrase chunk feature, labeled by a conditional random field (CRF)-based base-phrase chunker. Another is the punctuation confidence (PC), calculated for each lexical word (LW) boundary from input text tagged with Chinese word boundaries, part of speech (POS) and base-phrase chunk, measuring the likelihood of inserting a punctuation mark (PM) at a word boundary. Owing to the fact that a PM in text is highly correlated with a prosodic break, and base-phrases play an important role in human language understanding, the two features potentially could provide useful information for prosody generation. To examine potential usefulness of the proposed linguistic features, the performances of neural network-based prosody generator - with and without the proposed features - were evaluated. Both objective and subjective tests showed that the prosody generator with the proposed linguistic features performed better than the one without the proposed features. So the proposed PC and base-phrase chunking information are promising features for Mandarin prosody generation.
机译:本文旨在研究韵文生成的普通话语音合成系统(MTTS)中从无限文本输入中提取的两种全自动机器提取的语言功能的可用性。一种是基本短语组块功能,由基于条件随机字段(CRF)的基本短语组块器标记。另一个是标点符号置信度(PC),它是根据标记有中文单词边界,词性(POS)和基本短语块的输入文本中的每个词法(LW)边界计算出来的,测量插入标点符号(PM)的可能性在单词边界。由于文本中的PM与韵律中断高度相关,并且基本短语在人类语言理解中起着重要作用,因此这两个功能可能会为韵律生成提供有用的信息。为了检查所提出的语言功能的潜在实用性,评估了基于神经网络的韵律生成器(具有和不具有所提出的功能)的性能。客观测试和主观测试均表明,具有所提议的语言特征的韵律产生器比没有所提议的特征的韵律产生器表现更好。因此,建议的PC和基本短语分块信息是普通话韵律生成的有前途的功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号