Automatic Speech Corpus Construction from Broadcasting Speech Databases

机译：从广播语音数据库自动构建语音语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The speech corpus often needs to be constructed frequently for the diversified speech synthesis. This paper discusses our efforts on construction of speech corpus automatically from broadcasting speech databases for trainable Text-To-Speech (TTS) system. We present a new framework of automatic speech corpus construction from broadcasting speech databases. We select the clean speech audios from the broadcasting audios with a music detector which is based on speech/music discrimination. An automatic speech sentence segmentation system is used to generate the sentence database from the clean speech audios. At last, a text corpus construction method selects appropriate sentences speech which is maximizing the coverage of the sentence databaseȁ9;s diphones. Experiments show that our method can generate a good speech corpus rapidly with minimum manual intervention.

机译：为了多样化的语音合成，经常需要构建语音语料库。本文讨论了我们从可训练的文本语音转换（TTS）系统的广播语音数据库自动构建语音语料库的努力。我们从广播语音数据库中提出了一种自动语音语料库构建的新框架。我们使用基于语音/音乐辨别力的音乐检测器从广播音频中选择干净的语音音频。自动语音句子分割系统用于从干净的语音音频生成句子数据库。最后，一种语料库构建方法选择适当的句子语音，以最大程度地覆盖句子数据库9个双音素。实验表明，我们的方法能够以最少的人工干预快速生成良好的语音语料库。

著录项

来源
《2010 International Conference on Computational Intelligence and Security》|2010年|p.639-643|共5页
会议地点
作者
Zhang Wei; Du Ranran; Pang Minhui; Wang Qiuhong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
automatic speech sentence segmentation; speech corpus; speech synthesis; text selection;

机译：自动语音句子分割;语音语料库;语音合成;文本选择;

相似文献

外文文献
中文文献
专利

1. ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION [J] . Ghania Droua-Hamdani, Sid Ahmed Selouani, IMalika Boudraa The Arabian journal for science and engineering . 2010,第2C期

机译：阿尔及利亚阿拉伯语语音数据库（ALGASD）：语料库设计和自动语音识别应用
2. Chhattisgarhi speech corpus for research and development in automatic speech recognition [J] . Narendra D. Londhe, Ghanahshyam B. Kshirsagar International journal of speech technology . 2018,第2期

机译：Chhattisgarhi语音语料库，用于自动语音识别的研究与开发
3. An automatic speech recognition system for spontaneous Punjabi speech corpus [J] . Yogesh Kumar, Navdeep Singh International journal of speech technology . 2017,第2期

机译：自发旁遮普语语料库的自动语音识别系统
4. Automatic Speech Corpus Construction from Broadcasting Speech Databases [C] . Zhang Wei, Du Ranran, Pang Minhui, International Conference on Computational Intelligence and Security . 2010

机译：从广播语音数据库自动语音语料库构建
5. Advances in Audiovisual Speech Processing for Robust Voice Activity Detection and Automatic Speech Recognition [D] . Tao, Fei. 2018

机译：用于鲁棒语音活动检测和自动语音识别的视听语音处理方面的进展
6. SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla [O] . Sadia Sultana, M. Shahidur Rahman, M. Reza Selim, 2021

机译：Sull Bangla情感语音语料库（Subesco）：孟加拉的一个音频情绪语音语音
7. Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System [O] . Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal 2014

机译：斯瓦希里语孤立数字语音语料库的开发，用于语音自动识别系统的开发
8. RSRE (Royal Signals and Radar Establishment) Speech Database Recordings (1983). Part 2. Recording Made for Automatics Speech Recognition Assessment and Research [R] . Russell, M. J., Moore, R. K., Tomlinson, M. J., 1984

机译：RsRE（皇家信号和雷达建立）语音数据库记录（1983年）。第2部分。用于自动语音识别评估和研究的录音

Automatic Speech Corpus Construction from Broadcasting Speech Databases

摘要

著录项

相似文献

相关主题

期刊订阅