DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents

机译：DocChat：使用非结构化文档的聊天机器人引擎的信息检索方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most current chatbot engines are designed to reply to user utterances based on existing utterance-response (or Q-R) pairs. In this paper, we present DocChat, a novel information retrieval approach for chatbot engines that can leverage unstructured documents, instead of Q-R pairs, to respond to utterances. A learning to rank model with features designed at different levels of granularity is proposed to measure the relevance between utterances and responses directly. We evaluate our proposed approach in both English and Chinese: (ⅰ) For English, we evaluate DocChat on WikiQA and QASent, two answer sentence selection tasks, and compare it with state-of-the-art methods. Reasonable improvements and good adaptability are observed. (ⅱ) For Chinese, we compare DocChat with XiaoIce, a famous chitchat engine in China, and side-by-side evaluation shows that DocChat is a perfect complement for chatbot engines using Q-R pairs as main source of responses.

机译：当前大多数聊天机器人引擎都设计为根据现有的话语响应（或Q-R）对来回复用户话语。在本文中，我们介绍了DocChat，这是一种用于聊天机器人引擎的新颖信息检索方法，可以利用非结构化文档而不是Q-R对来响应语音。提出了一种学习分级模型，该模型具有在不同粒度级别上设计的特征，以直接测量话语和响应之间的相关性。我们以英文和中文评估我们提出的方法：（ⅰ）对于英文，我们评估WikiQA和QASent上的DocChat（两个答案句子选择任务），并将其与最新方法进行比较。观察到合理的改进和良好的适应性。（ⅱ）对于中文，我们将DocChat与中国著名的聊天引擎XiaoIce进行了比较，并排评估表明DocChat是使用Q-R对作为主要响应源的聊天机器人引擎的完美补充。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2016年|516-525|共10页
会议地点
作者
Zhao Yan; Nan Duan; Junwei Bao; Peng Chen; Ming Zhou; Zhoujun Li; Jianshe Zhou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. An Efficient Hybrid Information Retrieval Approach for Unstructured Document Classification [J] . Weikang Rui Journal of Computational Intelligence in Bioinformatics . 2018,第2期

机译：非结构化文档分类的有效混合信息检索方法
2. Semantic relation based personalized ranking approach for engineering document retrieval [J] . Gyeong June Hahm, Jae Hyun Lee, Hyo Won Suh Advanced engineering informatics . 2015,第3期

机译：基于语义关系的工程文档检索个性化排序方法
3. A personalized query expansion approach for engineering document retrieval [J] . Gyeong June Hahm, Mun Yong Yi, Jae Hyun Lee, Advanced engineering informatics . 2014,第4期

机译：用于工程文档检索的个性化查询扩展方法
4. DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents [C] . Zhao Yan, Nan Duan, Junwei Bao, Annual meeting of the Association for Computational Linguistics . 2016

机译：Docchat：使用非结构化文档的Chatbot引擎的信息检索方法
5. Parallel information retrieval and visualization on large, unstructured document collections using web link information. [D] . Alford, Kenneth Lowell. 2000

机译：使用Web链接信息对大型非结构化文档集合进行并行信息检索和可视化。
6. MetaMap is a superior baseline to a standard document retrieval engine for the task of finding patient cohorts in clinical free text [O] . K. Bretonnel Cohen, Tom Christiansen, Lawrence E. Hunter -1

机译：metamap是一个卓越的基线标准文献检索引擎在临床自由文本发现患者队列的任务
7. DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents [O] . Zhao Yan, Nan Duan, Junwei Bao, 2016

机译：Docchat：使用非结构化文档的Chatbot引擎的信息检索方法

DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents

摘要

著录项

相似文献

相关主题

期刊订阅