首页> 外文会议>Computational linguistics for linguistic complexity >Automatic Speech Recognition Errors as a Predictor of L2 Listening Difficulties
【24h】

Automatic Speech Recognition Errors as a Predictor of L2 Listening Difficulties

机译:自动语音识别错误可预测L2听力困难

获取原文
获取原文并翻译 | 示例

摘要

This paper investigates the use of automatic speech recognition (ASR) errors as indicators of the second language (L2) learners' listening difficulties and in doing so strives to overcome the shortcomings of Partial and Synchronized Caption (PSC) system. PSC is a system that generates a partial caption including difficult words detected based on high speech rate, low frequency, and specificity. To improve the choice of words in this system, and explore a better method to detect speech challenges, ASR errors were investigated as a model of the L2 listener, hypothesizing that some of these errors are similar to those of language learners' when transcribing the videos. To investigate this hypothesis, ASR errors in transcription of several TED talks were analyzed and compared with PSC's selected words. Both the overlapping and mismatching cases were analyzed to investigate possible improvement for the PSC system. Those ASR errors that were not detected by PSC as cases of learners' difficulties were further analyzed and classified into four categories: homophones, minimal pairs, breached boundaries and negatives. These errors were embedded into the baseline PSC to make the enhanced version and were evaluated in an experiment with L2 learners. The results indicated that the enhanced version, which encompasses the ASR errors addresses most of the L2 learners' difficulties and better assists them in comprehending challenging video segments as compared with the baseline.
机译:本文研究使用自动语音识别(ASR)错误作为第二语言(L2)学习者听力困难的指标,并以此来克服部分和同步字幕(PSC)系统的缺点。 PSC是一种生成部分字幕的系统,其中包括基于高语音速率,低频和特定性而检测到的困难单词。为了改善该系统中单词的选择并探索一种更好的方法来检测语音挑战,我们对ASR错误作为第二级听众的模型进行了调查,并假设其中一些错误与录制视频时语言学习者的错误相似。 。为了研究这个假设,分析了几个TED演讲的转录中的ASR错误,并将其与PSC选定的单词进行了比较。分析了重叠和不匹配的情况,以研究PSC系统可能的改进。那些由PSC未检测到的作为学习者困难情况的ASR错误将进一步分析,并分为四类:同音字,最小对,违反边界和否定字。这些错误被嵌入到基线PSC中以构成增强版本,并在L2学习者的实验中进行了评估。结果表明,包含ASR错误的增强版本可以解决大多数L2学习者的困难,并且与基线相比,可以更好地帮助他们理解具有挑战性的视频片段。

著录项

  • 来源
  • 会议地点 Osaka(JP)
  • 作者单位

    Graduate School of Informatics, Kyoto University Yoshida-Honmachi, Sakyo-ku, Kyoto 606-8501, Japan;

    Graduate School of Informatics, Kyoto University Yoshida-Honmachi, Sakyo-ku, Kyoto 606-8501, Japan;

    Graduate School of Informatics, Kyoto University Yoshida-Honmachi, Sakyo-ku, Kyoto 606-8501, Japan;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号