首页> 外文会议>IEEE Workshop on Spoken Language Technology >The influence of automatic speech recognition accuracy on the performance of an automated speech assessment system
【24h】

The influence of automatic speech recognition accuracy on the performance of an automated speech assessment system

机译:自动语音识别准确性对自动语音评估系统性能的影响

获取原文

摘要

The effectiveness of automated scoring systems for evaluating spoken language proficiency depends greatly on the quality of the automatic speech recognition (ASR) output that is used to calculate the features for the scoring model. In this paper, we examine the effects of ASR word error rate (WER) on the scores produced by a system for automated scoring of non-native English speaking proficiency, as well as on the scoring model features (especially content features) in order to demonstrate the impact of ASR improvements on the performance of the automated speech assessment system. Five different sets of transcriptions with varying degrees of WER ranging from 0% to 52% (including four sets of ASR hypotheses and manual transcriptions) were obtained for a dataset of spoken responses from a pilot administration of an assessment of non-native English speaking proficiency. The experimental results show that higher performing ASR leads to better performance in the automated assessment system; furthermore, the correlation between human and automated scores drops substantially with an increase in WER from 10.7% to 28.9%, whereas the correlation changes little within the following two ranges of WERs: 0% to 10.7% and 28.9% to 52%. A detailed analysis of the features used in the scoring model shows that the ASR errors have a bigger impact on the content features than the delivery and language use features.
机译:自动评分系统评价口语语言能力的有效性在很大程度上取决于被用于计算评分模型的特征的自动语音识别(ASR)的输出的质量。在本文中,我们将考察对母语非英语的自动评分系统产生的分数ASR字错误率(WER)的影响来说熟练程度,以及对评分模型的特点(尤其是内容特征),以展示ASR改进的自动语音评估体系的性能的影响。获得用于语音响应的数据集从非英语为母语的熟练程度进行评估的导频给药五组不同的具有不同程度的WER的范围从0%至52%(包括四组ASR假说和手动转录的)转录的。实验结果表明,较高的执行ASR导致在自动评估系统更好的性能;此外,人的和自动化的分数之间的相关性从10.7%WER增加到28.9%显着下降,而相关WERS以下两个范围内变化不大:0%〜10.7%和28.9%至52%。的详细分析使用的功能在评分模型显示,ASR误差对内容功能比交付和语言使用的功能产生更大的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号