首页> 外文会议>International Conference on Text, Speech and Dialogue >Intelligibility Is More Than a Single Word: Quantification of Speech Intelligibility by ASR and Prosody
【24h】

Intelligibility Is More Than a Single Word: Quantification of Speech Intelligibility by ASR and Prosody

机译:可懂度不仅仅是一个单词:ASR和韵律的语音可懂度量化

获取原文

摘要

In this paper we examine the quality of the prediction of intelligibility scores of human experts. Furthermore, we investigate the differences between subjective expert raters who evaluated speech disorders of laryngectomees and children with cleft lip and palate. We use the recognition rate of a word recognizer and prosodic features to predict the intelligibility score of each individual expert. For each expert and the mean opinion of all experts we present the best features to model their scoring behavior according to the mean rank obtained during a 10-fold cross-validation. In this manner all individual speech experts were modeled with a correlation coefficient of at least r > .75. The mean opinion of all raters is predicted with a correlation of r =.90 for the laryngectomees and r =.86 for the children.
机译:在本文中,我们研究了人类专家可懂度分数的预测的质量。此外,我们调查了评估喉部和唇腭裂的喉部和儿童的语音障碍的主观专家评级之间的差异。我们使用单词识别器和韵律特征的识别率来预测每个专家的可明智性评分。对于每个专家以及所有专家的平均意见,我们展示了根据在10倍交叉验证期间获得的平均等级来建模得分行为的最佳功能。以这种方式,所有单个语音专家都以至少r> .75的相关系数进行建模。所有评价者的平均意见是预测r = .90对于喉部的r = .90和儿童的r = .86。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号