首页> 外文会议>INTERSPEECH 2012 >Synthetic References for Template-based ASR using Posterior Features

【24h】

Synthetic References for Template-based ASR using Posterior Features

机译：使用后部特征的基于模板的ASR的合成引用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, the use of phoneme class-conditional probabilities as features (posterior features) for template-based ASR has been proposed. These features have been found to generalize well to unseen data and yield better systems than standard spectralbased features. In this paper, motivated by the high quality of current text-to-speech systems and the robustness of posterior features toward undesired variability, we investigate the use of synthetic speech to generate reference templates. The use of synthetic speech in template-based ASR not only allows to address the issue of in-domain data collection but also expansion of vocabulary. Using 75- and 600-word task-independent and speaker-independent setup on Phonebook database, we investigate different synthetic voices produced by the Festival HTS-based synthesizer trained on CMU ARCTIC databases. Our study shows that synthetic speech templates can yield performance cornparable to the natural speech templates, especially with synthetic voices that have high intelligibility.

机译：最近，基于模板的ASR使用音素类条件概率为特征（后功能）已经提出。这些功能已被发现以及推广到看不见的数据，并产生比标准spectralbased功能更好的系统。在本文中，由当前的文本到语音系统的高品质和对不需要的变异后的功能鲁棒性动机，我们调查使用合成语音的生成参考模板。在基于模板的ASR使用合成语音，不仅可以解决域数据采集的问题，也是扩大词汇量。使用75-和电话簿数据库600字的任务无关和独立扬声器的设置，我们调查由经过培训的CMU ARCTIC数据库节HTS为基础的合成器产生不同的合成声音。我们的研究表明，合成语音的模板也能产生性能cornparable到自然语音模板，特别是具有高清晰度合成声音。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Serena Soldo; Mathew Magimai.-Doss; Hervé Bourlard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
Speech recognition; template-based approach; posterior features; synthetic reference templates;

机译：语音识别;基于模板的方法;后部特征;合成参考模板;

相似文献

外文文献
中文文献
专利

1. Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features [J] . Liu Yuanyuan, Lee Tan, Law Thomas, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第6期

机译：使用ASR后验特征对连续语音的语音障碍进行声学评估
2. Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features [J] . Liu Yuanyuan, Lee Tan, Law Thomas, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第6期

机译：使用ASR后部特征的语音障碍的声学评估
3. Equal correlation peak optimization of the filter-feature-based synthetic discriminant reference image [J] . Sheng Zhong, Shutian Liu, Xueru Zhang Optics Communications: A Journal Devoted to the Rapid Publication of Short Contributions in the Field of Optics and Interaction of Light with Matter . 1998,第4a6期

机译：基于滤波器特征的合成判别参考图像的等相关峰优化
4. Synthetic References for Template-based ASR using Posterior Features [C] . Serena Soldo, Mathew Magimai.-Doss, Herve Bourlardu Annual conference of the International Speech Communication Association . 2012

机译：基于后验特征的基于模板的ASR的合成参考
5. Characterization of the Rheological and Swelling Properties of Synthetic Alkali Silicate Gels in Order to Predict Their Behavior in ASR Damaged Concrete. [D] . Vayghan, Asghar Gholizadeh. 2017

机译：表征合成碱式硅酸盐凝胶的流变和膨胀特性，以预测其在ASR损坏的混凝土中的行为。
6. Template-based C8-SCORPION: a protein 8-state secondary structure prediction method using structural information and context-based features [O] . Ashraf Yaseen, Yaohang Li 2014

机译：基于模板的C8-SCORPION：使用结构信息和基于上下文的特征的蛋白质8状态二级结构预测方法
7. POSTERIOR FEATURES FOR TEMPLATE-BASED ASR [O] . Serena Soldo, Mathew Magimai. -doss, Joel Pinto, 2013

机译：基于模板的asR的后置特征
8. Inhalation Health Effect Reference Values for Manganese (CASRN 7439-96-5-Manganese) and Compounds (CASRN 1344-43-0; 1317-35-7; and 1129-60-5). [R] . 2012

机译：锰（CasRN 7439-96-5-锰）和化合物（CasRN 1344-43-0; 1317-35-7;和1129-60-5）的吸入健康效应参考值。

Synthetic References for Template-based ASR using Posterior Features

摘要

著录项

相似文献

相关主题

期刊订阅