首页> 外文OA文献 >Discriminative methods for statistical spoken dialogue systems
【2h】

Discriminative methods for statistical spoken dialogue systems

机译:统计口语对话系统的判别方法

摘要

Dialogue promises a natural and effective method for users to interact with and obtain information from computer systems. Statistical spoken dialogue systems are able to disambiguate in the presence of errors by maintaining probability distributions over what they believe to be the state of a dialogue. However, traditionally these distributions have been derived using generative models, which do not directly optimise for the criterion of interest and cannot easily exploit arbitrary information that may potentially be useful. This thesis presents how discriminative methods can overcome these problems in Spoken Language Understanding (SLU) and Dialogue State Tracking (DST).A robust method for SLU is proposed, based on features extracted from the full posterior distribution of recognition hypotheses encoded in the form of word confusion networks. This method uses discriminative classifiers, trained on unaligned input/output pairs. Performance is evaluated on both an off-line corpus, and on-line in a live user trial. It is shown that a statistical discriminative approach to SLU operating on the full posterior ASR output distribution can substantially improve performance in terms of both accuracy and overall dialogue reward. Furthermore, additional gains can be obtained by incorporating features from the system's output.For DST, a new word-based tracking method is presented that maps directly from the speech recognition results to the dialogue state without using an explicit semantic decoder. The method is based on a recurrent neural network structure that is capable of generalising to unseen dialogue state hypotheses, and requires very little feature engineering. The method is evaluated in the second and third Dialog State Tracking Challenges, as well as in a live user trial. The results demonstrate consistently high performance across all of the off-line metrics and a substantial increase in the quality of the dialogues in the live trial. The proposed method is shown to be readily applied to expanding dialogue domains, by exploiting robust features and a new method for online unsupervised adaptation. It is shown how the neural network structure can be adapted to output structured joint distributions, giving an improvement over estimating the dialogue state as a product of marginal distributions.
机译:对话为用户提供了一种自然有效的方法,可以与计算机系统进行交互并从计算机系统中获取信息。统计口语对话系统能够通过在他们认为是对话状态的位置上保持概率分布来消除存在错误时的歧义。但是,传统上,这些分布是使用生成模型导出的,生成模型无法直接针对感兴趣的标准进行优化,也无法轻松利用可能有用的任意信息。本文提出了判别方法如何克服口语理解(SLU)和对话状态跟踪(DST)中的这些问题。基于从识别假设的全部后验分布中提取的特征,提出了一种鲁棒的SLU方法。单词混淆网络。此方法使用在未对齐的输入/输出对上训练的判别式分类器。在离线语料库和实时用户试用中都可以评估性能。结果表明,对SLU进行完全后验ASR输出分配的统计判别方法可以从准确性和总体对话奖励方面显着提高性能。此外,还可以通过合并系统输出中的功能来获得额外收益。对于DST,提出了一种新的基于单词的跟踪方法,该方法无需使用显式语义解码器即可将语音识别结果直接映射到对话状态。该方法基于递归神经网络结构,该结构能够推广到看不见的对话状态假设,并且几乎不需要特征工程。在第二个和第三个“对话状态跟踪挑战”中以及在实时用户试用中都对该方法进行了评估。结果表明,所有离线指标的性能始终如一,并且实时测试中对话的质量大大提高。通过利用健壮的功能和一种用于在线无监督自适应的新方法,该提议的方法很容易应用于扩展对话域。它显示了如何将神经网络结构调整为输出结构化的关节分布,从而相对于将对话状态估计为边际分布的乘积进行了改进。

著录项

  • 作者

    Henderson Matthew S.;

  • 作者单位
  • 年度 2015
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号