首页> 外文会议>Annual meeting of the Association for Computational Linguistics >DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents
【24h】

DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents

机译:DocChat:使用非结构化文档的聊天机器人引擎的信息检索方法

获取原文

摘要

Most current chatbot engines are designed to reply to user utterances based on existing utterance-response (or Q-R) pairs. In this paper, we present DocChat, a novel information retrieval approach for chatbot engines that can leverage unstructured documents, instead of Q-R pairs, to respond to utterances. A learning to rank model with features designed at different levels of granularity is proposed to measure the relevance between utterances and responses directly. We evaluate our proposed approach in both English and Chinese: (ⅰ) For English, we evaluate DocChat on WikiQA and QASent, two answer sentence selection tasks, and compare it with state-of-the-art methods. Reasonable improvements and good adaptability are observed. (ⅱ) For Chinese, we compare DocChat with XiaoIce, a famous chitchat engine in China, and side-by-side evaluation shows that DocChat is a perfect complement for chatbot engines using Q-R pairs as main source of responses.
机译:当前大多数聊天机器人引擎都设计为根据现有的话语响应(或Q-R)对来回复用户话语。在本文中,我们介绍了DocChat,这是一种用于聊天机器人引擎的新颖信息检索方法,可以利用非结构化文档而不是Q-R对来响应语音。提出了一种学习分级模型,该模型具有在不同粒度级别上设计的特征,以直接测量话语和响应之间的相关性。我们以英文和中文评估我们提出的方法:(ⅰ)对于英文,我们评估WikiQA和QASent上的DocChat(两个答案句子选择任务),并将其与最新方法进行比较。观察到合理的改进和良好的适应性。 (ⅱ)对于中文,我们将DocChat与中国著名的聊天引擎XiaoIce进行了比较,并排评估表明DocChat是使用Q-R对作为主要响应源的聊天机器人引擎的完美补充。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号