Bootstrapping Text-to-Speech for speech processing in languages without an orthography

机译：引导文本到语音进行语音处理，而无需拼写法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech synthesis technology has reached the stage where given a well-designed corpus of audio and accurate transcription an at least understandable synthesizer can be built without necessarily resorting to new innovations. However many languages do not have a well-defined writing system but such languages could still greatly benefit from speech systems. In this paper we consider the case where we have a (potentially large) single speaker database but have no transcriptions and no standardized way to write transcriptions. To address this scenario we propose a method that allows us to bootstrap synthetic voices purely from speech data. We use a novel combination of automatic speech recognition and automatic word segmentation for the bootstrapping. Our experimental results on speech corpora in two languages, English and German, show that synthetic voices that are built using this method are close to understandable. Our method is language-independent and can thus be used to build synthetic voices from a speech corpus in any new language.

机译：语音合成技术已到达阶段，其中可以建立一个设计的音频和准确转录精力和准确转录的阶段，可以建立至少可理解的合成器，而无需诉诸新的创新。然而，许多语言没有明确定义的写作系统，但这种语言仍然可以从语音系统中受益匪浅。在本文中，我们考虑到我们拥有（潜在大的）单个扬声器数据库但没有转录，没有标准化的方式来编写转录的情况。要解决此方案，我们提出了一种方法，允许我们纯粹从语音数据引导合成声音。我们使用自动语音识别和自动字分割的新组合进行引导。我们在两种语言，英语和德语的演讲语料库上的实验结果表明，使用此方法构建的合成声音靠近可理解。我们的方法是独立的语言，因此可以用于从任何新语言中从语音语料库构建合成声音。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2013年|7992-7996|共5页
会议地点
作者
Sitaram Sunayana; Palkar Sukhada; Chen Yun-Nung; Parlikar Alok;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Languages without an Orthography; Speech Synthesis; Synthesis without Text;

机译：没有拼字法的语言;语音合成;没有文本的合成;

相似文献

外文文献
中文文献
专利

1. An Approach to Proper Speech Segmentation for Quality Improvement in Concatenative Text-To-Speech System for Indian Languages [J] . SANGHAMITRA MOHANTY, SUMAN BHATTACHARYA, SUMIT BOSE, International Journal of Computer Processing of Oriental Languages . 2005,第1期

机译：适当的语音分割方法以提高印度语言的级联文本转语音系统的质量
2. Constructing text-to-speech systems for languages with unknown pronunciations [J] . Kei Sawada, Kei Hashimoto, Keiichiro Oura, Acoustical science and technology . 2018,第2期

机译：为未知发音的语言构建文本语音转换系统
3. Efficient Model for Numerical Text-To-Speech Synthesis System in Marathi, Hindi and English Languages [J] . G. D. Ramteke, R. J. Ramteke International Journal of Image, Graphics and Signal Processing . 2017,第3期

机译：马拉地语，北印度语和英语语言的数字语音合成系统的有效模型
4. BOOTSTRAPPING TEXT-TO-SPEECH FOR SPEECH PROCESSING IN LANGUAGES WITHOUT AN ORTHOGRAPHY [C] . Sunayana Sitaram, Sukhada Palkar, Yun-Nung Chen, International Conference on Acoustics, Speech and Signal Processing . 2013

机译：在没有正射图的语言中引导文本到语音处理
5. Universalizing universal design: Applying text-to-speech technology to English language learners' process writing. [D] . Kirstein, Marjorie. 2006

机译：通用设计的通用化：将文字转语音技术应用于英语学习者的过程写作。
6. NL reading skills mediate the relationship between NL phonological processing skills and a foreign language (FL) reading skills in students with and without dyslexia: a case of a NL (Polish) and FL (English) with different degrees of orthographic consistency [O] . Marta Łockiewicz, Martyna Jaskulska -1

机译：母语阅读技能可调节有或没有阅读障碍的学生的母语音素处理能力与外语（FL）阅读技能之间的关系：以正字法一致性程度不同的NL（波兰语）和FL（英语）为例
7. NL reading skills mediate the relationship between NL phonological processing skills and a foreign language (FL) reading skills in students with and without dyslexia: a case of a NL (Polish) and FL (English) with different degrees of orthographic consistency [O] . Marta Łockiewicz, Martyna Jaskulska 2019

机译：NL阅读技巧在没有综合障碍的学生中介绍了NL语音处理技能和外语（FL）阅读技巧的关系：一种NL（波兰）和FL（英语）的案例，具有不同程度的正交符合
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Bootstrapping Text-to-Speech for speech processing in languages without an orthography

摘要

著录项

相似文献

相关主题

期刊订阅