首页> 外文期刊>American journal of applied sciences >Corpus Design for Malay Corpus-based Speech Synthesis System | Science Publications
【24h】

Corpus Design for Malay Corpus-based Speech Synthesis System | Science Publications

机译:基于马来语语料库的语音合成系统的语料库设计科学出版物

获取原文
       

摘要

> Problem statement: Speech corpus is one of the major components in corpus-based synthesis. The quality and coverage in speech corpus will affect the quality of synthesis speech sound. Approach: This study proposes a corpus design for Malay corpus-based speech synthesis system. This includes the study of design criteria in corpus-based speech synthesis, Malay corpus based database design and the concatenation engine in Malay corpus-based synthesis system. A set of 10 millions digital text corpuses for Malay language has been collected from Malay internet news. This text corpus had been analyzed using word frequency count to find out all high frequency words to be used for designing the sentences for speech corpus. Results: Altogether 381 sentences for speech corpus had been designed using 70% of high frequency words from 10 million text corpus. It consists of 16826 phoneme units and the total storage size is 37.6Mb. All the phone units are phonetically transcribed to preserve the phonetic context of its origin that will be used for phonetic context unit. This speech corpus had been labeled at phoneme level and used for variable length continuous phoneme based concatenation. Speech corpus is one of the major components in corpus-based synthesis. The quality and coverage in speech corpus will affect the quality of synthesized speech sound. Conclusion/Recommendation: This study has proposed a platform for designing speech corpus especially for Malay Text to Speech which can be further enhanced to support more coverage and higher naturalness of synthetic speech.
机译: > 问题陈述:语音语料库是基于语料库的合成的主要组成部分之一。语音语料库的质量和覆盖范围将影响合成语音的质量。 方法:本研究提出了一种基于马来语语料库的语音合成系统的语料库设计。这包括研究基于语料库的语音合成中的设计标准,基于马来语语料库的数据库设计以及基于马来语语料库的合成系统中的级联引擎。从马来互联网新闻中收集了一套一千万种用于马来语的数字文本语料库。已使用词频计数对该文本语料库进行了分析,以找出所有用于设计语音语料库句子的高频词。结果:总共有381个语音语料库句子被设计使用了70%的语音语料库。 1000万个文本语料库中的高频词。它由16826个音素单元组成,总存储大小为37.6Mb。所有电话单元都进行了语音转录,以保留其起源的语音上下文,该语音上下文将用于语音上下文单元。该语音语料库已在音素级别标记,并用于基于可变长度连续音素的串联。语音语料库是基于语料库的合成的主要组成部分之一。语音语料库的质量和覆盖范围将影响合成语音的质量。 结论/建议:本研究提出了一个专门设计用于马来文到语音的语音语料库的平台,可以进一步增强该平台,以支持合成语音的更大覆盖范围和更高的自然性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号