首页> 外文会议>IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010 >Robust speaking rate estimation using broad phonetic class recognition
【24h】

Robust speaking rate estimation using broad phonetic class recognition

机译:使用广泛的语音类别识别功能进行可靠的语音估计

获取原文

摘要

Robust speaking rate estimation can be useful in automatic speech recognition and speaker identification, and accurate, automatic measures of speaking rate are also relevant for research in linguistics, psychology, and social sciences. In this study we built a broad phonetic class recognizer for speaking rate estimation. We tested the recognizer on a variety of data sets, including laboratory speech, telephone conversations, foreign accented speech, and speech in different languages, and we found that the recognizer's estimates are robust under these sources of variation. We also found that the acoustic models of the broad phonetic classes are more robust than those of the monophones for syllable detection.
机译:健壮的语速估计在自动语音识别和说话者识别中很有用,准确,自动的语速测量也与语言学,心理学和社会科学领域的研究有关。在这项研究中,我们建立了一个广泛的语音分类识别器,用于语音速率估计。我们在各种数据集上对识别器进行了测试,包括实验室语音,电话交谈,外来语音和不同语言的语音,我们发现在这些变化来源下,识别器的估计是可靠的。我们还发现,针对音节检测,广泛的语音类别的声学模型比单音素的声学模型更健壮。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号