首页> 外文期刊>Procedia Computer Science >A Tool to Solve Sentence Segmentation Problem on Preparing Speech Database for Indonesian Text-to-speech System
【24h】

A Tool to Solve Sentence Segmentation Problem on Preparing Speech Database for Indonesian Text-to-speech System

机译:为印尼文字转语音系统准备语音数据库时解决句子分割问题的工具

获取原文

摘要

Creating a training data ready to be used for developing a text-to-speech (TTS) system can be a difficult task, since sometimes the recorded audio data is not the same with the prepared texts. To overcome differences between audio and text data, we developed a tool to segment audio data into sentences. As it is known, doing sentence segmentation of audio data manually needs efforts and resources. This paper presents a solution for alleviating problems encountered during segmentation process of audio data for developing an Indonesian TTS system. The tool was developed based on a fact that bahasa Indonesia is a syllable-timed language. We found that our tool reduces resources needed for segmenting Indonesian audio data.
机译:创建准备好用于开发文本语音转换(TTS)系统的训练数据可能是一项艰巨的任务,因为有时记录的音频数据与准备好的文本并不相同。为了克服音频和文本数据之间的差异,我们开发了一种将音频数据分段为句子的工具。众所周知,手动进行音频数据的句子分段需要付出努力和资源。本文提出了一种解决方案,可减轻开发印度尼西亚TTS系统的音频数据分割过程中遇到的问题。该工具是基于印度尼西亚语是一种音节定时语言而开发的。我们发现我们的工具减少了分割印度尼西亚音频数据所需的资源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号