首页> 外文会议>Workshop on e-Commerce and NLP >ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
【24h】

ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling

机译:基于跨话语语境和多任务语言建模的电子商务聊天机器人ASR自适应

获取原文

摘要

Automatic Speech Recognition (ASR) robustness toward slot entities are critical in e-commerce voice assistants that involve monetary transactions and purchases. Along with effective domain adaptation, it is intuitive that cross utterance contextual cues play an important role in disambiguating domain specific content words from speech. In this paper, we investigate various techniques to improve contextualization, content word robustness and domain adaptation of a Transformer-XL neural language model (NLM) to rescore ASR N-best hypotheses. To improve contextualization, we utilize turn level dialogue acts along with cross utterance context carry over. Additionally, to adapt our domain-general NLM towards e-commerce on-the-fly. we use embeddings derived from a finetuned masked LM on in-domain data. Finally, to improve robustness towards in-domain content words, we propose a multi-task model that can jointly perform content word detection and language modeling tasks. Compared to a non-contextual LSTM LM baseline, our best performing NLM rescorer results in a content WER reduction of 19.2% on e-commerce audio test set and a slot labeling Fl improvement of 6.4%.
机译:自动语音识别(ASR)对插槽实体的鲁棒性在涉及货币交易和购买的电子商务语音助手中至关重要。除了有效的领域适应,跨话语语境线索在消除特定领域内容词的歧义方面起着重要作用。在本文中,我们研究了各种技术,以提高Transformer XL神经语言模型(NLM)的语境化、内容词鲁棒性和领域适应能力,从而重新审视ASR N最佳假设。为了提高语境化程度,我们利用了转折层面的对话行为以及跨话语语境的延续。此外,使我们的领域通用NLM适应动态电子商务。我们在域内数据上使用从微调屏蔽LM派生的嵌入。最后,为了提高对域内内容词的鲁棒性,我们提出了一个多任务模型,可以联合执行内容词检测和语言建模任务。与无语境的LSTM LM基线相比,我们表现最好的NLM rescorer使电子商务音频测试集的内容WER减少了19.2%,时隙标签Fl提高了6.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号