Robust speaking rate estimation using broad phonetic class recognition

机译：使用广泛的语音类别识别功能进行可靠的语音估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Robust speaking rate estimation can be useful in automatic speech recognition and speaker identification, and accurate, automatic measures of speaking rate are also relevant for research in linguistics, psychology, and social sciences. In this study we built a broad phonetic class recognizer for speaking rate estimation. We tested the recognizer on a variety of data sets, including laboratory speech, telephone conversations, foreign accented speech, and speech in different languages, and we found that the recognizer's estimates are robust under these sources of variation. We also found that the acoustic models of the broad phonetic classes are more robust than those of the monophones for syllable detection.

机译：健壮的语速估计在自动语音识别和说话者识别中很有用，准确，自动的语速测量也与语言学，心理学和社会科学领域的研究有关。在这项研究中，我们建立了一个广泛的语音分类识别器，用于语音速率估计。我们在各种数据集上对识别器进行了测试，包括实验室语音，电话交谈，外来语音和不同语言的语音，我们发现在这些变化来源下，识别器的估计是可靠的。我们还发现，针对音节检测，广泛的语音类别的声学模型比单音素的声学模型更健壮。

著录项

来源
《IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010》|2010年|p.4222-4225|共4页
会议地点 Dallas, TX(US);Dallas, TX(US)
作者
Yuan, Jiahong; Liberman, Mark;
展开▼
作者单位

University of Pennsylvania USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speaking rate estimation; broad phonetic class; robustness; syllable detection;

机译：说话率估算；广泛的语音课；健壮性音节检测;
入库时间 2022-08-26 14:40:25

相似文献

外文文献
中文文献
专利

1. Phonetically optimized speaker modeling for robust speaker recognition [J] . Bong-Jin Lee, Jeung-Yoon Choi, Hong-Goo Kang The Journal of the Acoustical Society of America . 2009,第3期

机译：通过语音优化的说话人建模，可实现可靠的说话人识别
2. Feature classification criterion for missing features mask estimation in robust speaker recognition - Springer [J] . Dayana Ribas González, José Ramón Calvo de Lara Signal, Image and Video Processing . 2014,第2期

机译：健壮的说话人识别中缺少特征蒙版估计的特征分类标准-Springer
3. Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition [J] . Abhay Prasad, Prasanta Kumar Ghosh Computer speech and language . 2016,第sepa期

机译：从实时磁共振图像中选择信息理论的最佳声道区域，以进行广泛的语音分类识别
4. A Comparison of Broad Phonetic and Acoustic Unitsfor Noise Robust Segment-Based Phonetic Recognition [C] . Tara N. Sainath, Victor Zue International Speech Communication Association . 2008

机译：基于噪声稳健段的语音识别的广泛语音和声学单元的比较
5. Acoustic modeling and speaker normalization strategies with application to robust in-vehicle speech recognition and dialect classification. [D] . Yapanel, Umit. 2005

机译：声学建模和说话人归一化策略及其在强大的车载语音识别和方言分类中的应用。
6. Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：识别消息和使者：仿生频谱分析可增强语音和说话者识别能力
7. Robust Speaking Rate Estimation Using Broad Phonetic Class Recognition [O] . Yuan, Jiahong, Liberman, Mark 2010

机译：使用广泛的语音类别识别进行稳健的说话率估计

Robust speaking rate estimation using broad phonetic class recognition

摘要

著录项

相似文献

相关主题

期刊订阅