首页> 外文会议>IEEE International Conference on Machine Learning and Applications >A Retrieval-Based Dialogue System Utilizing Utterance and Context Embeddings
【24h】

A Retrieval-Based Dialogue System Utilizing Utterance and Context Embeddings

机译:利用话语和上下文嵌入的基于检索的对话系统

获取原文

摘要

Finding semantically rich and computer-understandable representations for textual dialogues, utterances and words is crucial for dialogue systems (or conversational agents), as their performance mostly depends on understanding the context of conversations. In recent research approaches, responses have been generated utilizing a decoder architecture, given the distributed vector representation (embedding) of the current conversation. In this paper, the utilization of embeddings for answer retrieval is explored by using Locality-Sensitive Hashing Forest (LSH Forest), an Approximate Nearest Neighbor (ANN) model, to find similar conversations in a corpus and rank possible candidates. Experimental results on the well-known Ubuntu Corpus (in English) and a customer service chat dataset (in Dutch) show that, in combination with a candidate selection method, retrieval-based approaches outperform generative ones and reveal promising future research directions towards the usability of such a system.
机译:查找文本对话,话语和单词的语义丰富且计算机可理解的表示形式对于对话系统(或对话代理)至关重要,因为它们的性能主要取决于对对话上下文的理解。在最近的研究方法中,给定当前对话的分布式矢量表示(嵌入),已经利用解码器体系结构生成了响应。在本文中,通过使用局域敏感哈希林(LSH Forest)(一种近似最近邻居(ANN)模型)来探索将嵌入用于答案检索的方法,以在语料库中查找相似的对话并对可能的候选词进行排名。在著名的Ubuntu Corpus(英语)和客户服务聊天数据集(荷兰语)上的实验结果表明,与候选选择方法相结合,基于检索的方法优于生成方法,并揭示了关于可用性的未来研究方向这样的系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号