Towards Fully Automatic Annotation of Audiobooks for TTS

机译：面向TTS的有声读物的全自动注释

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Building speech corpora is a first and crucial step for every text-to-speech synthesis system. Nowadays, the use of statistical models implies the use of huge sized corpora that need to be recorded, transcribed, annotated and segmented to be usable. The variety of corpora necessary for recent applications (content, style, etc.) makes the use of existing digital audio resources very attractive. Among all available resources, audiobooks, considering their quality, are interesting. Considering this framework, we propose a complete acquisition, segmentation and annotation chain for audiobooks that tends to be fully automatic. The proposed process relies on a data structure, Roots, that establishes the relations between the different annotation levels represented as sequences of items. This methodology has been applied successfully on 11 hours of speech extracted from an audiobook. A manual check, on a part of the corpus, shows the efficiency of the process.

机译：建立语音语料库是每个文本到语音合成系统的第一步，也是至关重要的一步。如今，使用统计模型意味着需要使用庞大的语料库，需要对其进行记录，转录，注释和分段才能使用。最近的应用程序所必需的各种语料库（内容，样式等）使得对现有数字音频资源的使用非常有吸引力。在所有可用资源中，考虑到音质的质量，有声读物很有趣。考虑到这个框架，我们为有声读物提出了一个完整的获取，分割和注释链，该链趋向于完全自动化。所提出的过程依赖于数据结构Roots，该数据结构在表示为项目序列的不同注释级别之间建立关系。该方法已成功应用于从有声读物中提取的11个小时的语音。在语料库的一部分上进行手动检查可显示该过程的效率。

著录项

来源
《International conference on language resources and evaluation》|2012年|975-980|共6页
会议地点
作者
Olivier Boeffard; Laure Charonnat; Sebastien Le Maguer; Damien Lolive; Gaeelle Vidal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Audiobook; annotation; phone segmentation;

机译：有声读物;注解;电话细分;

相似文献

外文文献
中文文献
专利

1. Metagenomic binning reconstruction coupled with automatic pipeline annotation and giant viruses: A potential source of mistake in annotations [J] . Julien Andreani, Bernard La Scola Virus Research: An International Journal of Molecular and Cellular Virology . 2018,第期

机译：搭乘融合重建与自动管道注释和巨大病毒相结合：注释中的潜在错误源
2. Modality annotation for Portuguese: from manual annotation to automatic labeling [J] . Am??lia Mendes, Iris Hendrickx, Luciana ??vila, Linguistic Issues in Language Technology . 2016,第0期

机译：葡萄牙语的模式注释：从手动注释到自动标记
3. Query Mining for Automatic Annotation and Annotation Based Image Retrieval Using Hidden Markov Model [J] . Shahidha M Meeran, Bineesh V International Journal of Innovative Research in Science, Engineering and Technology . 2014,第5期

机译：使用隐马尔可夫模型的查询挖掘自动注释和基于注释的图像检索
4. Towards Fully Automatic Annotation of Audiobooks for TTS [C] . Olivier Boeffard, Laure Charonnat, Sébastien Le Maguer, LREC-2012 . 2012

机译：在全自动批注TTS的AudioBooks
5. Audiobooks and attitudes: An examination of school librarians' perspectives. [D] . Brock, Rosemarie M. 2013

机译：有声读物和态度：检查学校图书馆员的观点。
6. Multi-expert annotation of Crohn’s disease images of the small bowel for automatic detection using a convolutional recurrent attention neural network [O] . Astrid de Maissin, Remi Vallée, Mathurin Flamant, 2021

机译：使用卷积复发注意神经网络进行自动检测克罗恩疾病图像的多专家注释
7. End-to-End Automatic Speech Translation of Audiobooks [O] . Alexandre Berard, Laurent Besacier, Ali Can Kocabiyikoglu, 2018

机译：AudioBooks的端到端自动语音翻译

Towards Fully Automatic Annotation of Audiobooks for TTS

摘要

著录项

相似文献

相关主题

期刊订阅