Synthetic References for Template-based ASR using Posterior Features

机译：基于后验特征的基于模板的ASR的合成参考

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, the use of phoneme class-conditional probabilities as features (posterior features) for template-based ASR has been proposed. These features have been found to generalize well to unseen data and yield better systems than standard spectral-based features. In this paper, motivated by the high quality of current text-to-speech systems and the robustness of posterior features toward undesired variability, we investigate the use of synthetic speech to generate reference templates. The use of synthetic speech in template-based ASR not only allows to address the issue of in-domain data collection but also expansion of vocabulary. Using 75- and 600-word task-independent and speaker-independent setup on Phonebook database, we investigate different synthetic voices produced by the Festival HTS-based synthesizer trained on CMU ARCTIC databases. Our study shows that synthetic speech templates can yield performance comparable to the natural speech templates, especially with synthetic voices that have high intelligibility.

机译：最近，已经提出了将音素类别条件概率用作基于模板的ASR的特征（后验特征）。已经发现这些功能可以很好地推广到看不见的数据，并且比基于标准频谱的功能可以提供更好的系统。在本文中，受当前文本到语音系统的高质量以及后部特征对不希望的可变性的鲁棒性的影响，我们研究了使用合成语音来生成参考模板的情况。在基于模板的ASR中使用合成语音不仅可以解决域内数据收集问题，而且还可以扩展词汇量。使用电话簿数据库上的75字和600字独立于任务和与说话者无关的设置，我们研究了由在CMU ARCTIC数据库上训练的基于Festival HTS的合成器产生的不同合成声音。我们的研究表明，合成语音模板可以产生与自然语音模板相当的性能，尤其是对于具有高清晰度的合成语音而言。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|2143-2146|共4页
会议地点
作者
Serena Soldo; Mathew Magimai.-Doss; Herve Bourlardu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech recognition; template-based approach; posterior features; synthetic reference templates;

机译：语音识别;基于模板的方法;后部特征;综合参考模板;
入库时间 2022-08-26 15:11:04

相似文献

外文文献
中文文献
专利

1. Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features [J] . Liu Yuanyuan, Lee Tan, Law Thomas, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第6期

机译：使用ASR后验特征对连续语音的语音障碍进行声学评估
2. Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features [J] . Liu Yuanyuan, Lee Tan, Law Thomas, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第6期

机译：使用ASR后部特征的语音障碍的声学评估
3. Equal correlation peak optimization of the filter-feature-based synthetic discriminant reference image [J] . Sheng Zhong, Shutian Liu, Xueru Zhang Optics Communications: A Journal Devoted to the Rapid Publication of Short Contributions in the Field of Optics and Interaction of Light with Matter . 1998,第4a6期

机译：基于滤波器特征的合成判别参考图像的等相关峰优化
4. Synthetic References for Template-based ASR using Posterior Features [C] . Serena Soldo, Mathew Magimai.-Doss, Hervé Bourlard INTERSPEECH 2012 . 2012

机译：使用后部特征的基于模板的ASR的合成引用
5. Characterization of the Rheological and Swelling Properties of Synthetic Alkali Silicate Gels in Order to Predict Their Behavior in ASR Damaged Concrete. [D] . Vayghan, Asghar Gholizadeh. 2017

机译：表征合成碱式硅酸盐凝胶的流变和膨胀特性，以预测其在ASR损坏的混凝土中的行为。
6. Template-based C8-SCORPION: a protein 8-state secondary structure prediction method using structural information and context-based features [O] . Ashraf Yaseen, Yaohang Li 2014

机译：基于模板的C8-SCORPION：使用结构信息和基于上下文的特征的蛋白质8状态二级结构预测方法
7. POSTERIOR FEATURES FOR TEMPLATE-BASED ASR [O] . Serena Soldo, Mathew Magimai. -doss, Joel Pinto, 2013

机译：基于模板的asR的后置特征
8. Inhalation Health Effect Reference Values for Manganese (CASRN 7439-96-5-Manganese) and Compounds (CASRN 1344-43-0; 1317-35-7; and 1129-60-5). [R] . 2012

机译：锰（CasRN 7439-96-5-锰）和化合物（CasRN 1344-43-0; 1317-35-7;和1129-60-5）的吸入健康效应参考值。

Synthetic References for Template-based ASR using Posterior Features

摘要

著录项

相似文献

相关主题

期刊订阅