...
首页> 外文期刊>SIGKDD explorations >Resolving Abstract Anaphora Implicitly in Conversational Assistants using a Hierarchically-stacked RNN
【24h】

Resolving Abstract Anaphora Implicitly in Conversational Assistants using a Hierarchically-stacked RNN

机译:使用分层堆叠的RNN隐式地在会话助手中解析抽象的Anaphora

获取原文
获取原文并翻译 | 示例
           

摘要

Recent proliferation of conversational systems has resulted in an increased demand for more natural dialogue systems, capable of more sophisticated interactions than merely providing factual answers. This is evident from usage pattern of a conversational system deployed within our organization. Users not only expect it to perform co-reference resolution of anaphora, but also of the antecedent or posterior facts presented by users with respect to their query. Presence of such facts in a conversation sometimes modifies the answer of main query, e.g., answer to 'how many sick leave do I get?' would be different when a fact 'I am on contract' is also present. Sometimes there is a need to collectively resolve three or four such facts. In this paper, we propose a novel solution which uses hierarchical neural network, comprising of BiLSTM layer and a maxpool layer that is hierarchically stacked to first obtain a representation of each user utterance and then to obtain a representation for sequence of utterances. This representation is used to identify users' intention. We also improvise this model by using skip connections in the second network to allow better gradient flow. Our model, not only a) resolves the antecedent and posterior facts, but also b) performs better even on self-contained queries. It is also c) faster to train, making it the most promising approach for use in our environment where frequent training and tuning is needed. It slightly outperforms the benchmark on a publicly available dataset, and e) performs better than obvious baselines approaches on our datasets.
机译:最近的会话系统的扩散导致对更自然对话系统的需求增加,能够更复杂的互动,而不是仅提供事实答案。这是从我们组织内部署的会话系统的使用模式明显。用户不仅预计它可以执行申请人的共同参考分辨率,也是用户对其查询呈现的前进或后部事实。在谈话中存在这些事实有时会修改主要查询的答案,例如,回答“我得到了多少病假?”当事实'我在合同上'也存在时会有所不同。有时需要共同解决三个或四个这样的事实。在本文中,我们提出了一种使用分层神经网络的新解决方案,包括Bilstm层和一个均基地堆叠以首先获得每个用户话语的表示,然后获得话语序列的表示。此表示用于识别用户的意图。我们还通过使用第二网络中的跳过连接来即兴创作该模型,以允许更好的梯度流。我们的模型,不仅是a)解决了先发病人和后部事实,而且b)即使在自包含的查询中也表现出更好。培训速度速度更快,使其在需要频繁培训和调整的环境中使用最有希望的方法。它略显略高于公共数据集上的基准,而e)在我们数据集上的明显基线执行更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号