首页> 外文会议>Spoken Language Technology Workshop >Investigating the Downstream Impact of Grapheme-Based Acoustic Modeling on Spoken Utterance Classification
【24h】

Investigating the Downstream Impact of Grapheme-Based Acoustic Modeling on Spoken Utterance Classification

机译:调查石墨对声学建模对话语分类的下游影响

获取原文

摘要

Automatic speech recognition (ASR) and natural language understanding are critical components of spoken language understanding (SLU) systems. One obstacle to providing services with SLU systems in multiple languages is the cost associated with acquiring all of the language-specific resources required for ASR in each language. Modeling graphemes eliminates the need to obtain a pronunciation dictionary which maps from speech sounds to words and is one way to reduce ASR resource dependencies when rapidly developing ASR in new languages. However, little is known about the downstream impact on SLU task performance when selecting graphemes as the acoustic modeling unit. This work investigates acoustic modeling for the ASR component of an SLU system using grapheme-based approaches together with convolutional and recurrent neural network architectures. We evaluate both ASR word accuracy and spoken utterance classification (SUC) accuracy for English, Italian and Spanish language tasks and find that it is possible to achieve SUC accuracy that is comparable to conventional phoneme-based systems which leverage a pronunciation dictionary.
机译:自动语音识别(ASR)和自然语言理解是语言理解(SLU)系统的关键组成部分。以多种语言提供与SLU系统提供服务的障碍是与在每种语言中获取ASR所需的所有语言特定资源相关的成本。建模图形消除了获取发音词典的需要,从语音声音映射到单词,并且是减少在新语言中快速开发ASR的ASR资源依赖性的一种方法。然而,在选择图形作为声学建模单元时,关于SLU任务性能的下游影响很少。该工作调查了使用基于石墨的方法与卷积和经常性神经网络架构的SLU系统的ASR组件的声学建模。我们评估英语,意大利语和西班牙语任务的ASR字准确性和口语准确性(SUC)准确性,并发现可以实现与传统的基于音素的系统相当的成功精度,该系统利用发音词典。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号