...
首页> 外文期刊>ACM Transactions on Internet Technology >Learning to Find Answers to Questions on the Web
【24h】

Learning to Find Answers to Questions on the Web

机译:学习在网上找到问题的答案

获取原文
获取原文并翻译 | 示例
           

摘要

We introduce a method for learning to find documents on the Web that contain answers to a given natural language question. In our approach, questions are transformed into new queries aimed at maximizing the probability of retrieving answers from existing information retrieval systems. The method involves automatically learning phrase features for classifying questions into different types, automatically generating candidate query transformations from a training set of question/answer pairs, and automatically evaluating the candidate transformations on target information retrieval systems such as real-world general purpose search engines. At run-time, questions are transformed into a set of queries, and reranking is performed on the documents retrieved. We present a prototype search engine, Tritus, that applies the method to Web search engines. Blind evaluation on a set of real queries from a Web search engine log shows that the method significantly outperforms the underlying search engines, and outperforms a commercial search engine specializing in question answering. Our methodology cleanly supports combining documents retrieved from different search engines, resulting in additional improvement with a system that combines search results from multiple Web search engines.
机译:我们介绍一种学习在Web上查找包含给定自然语言问题答案的文档的方法。在我们的方法中,问题被转换为新查询,旨在最大程度地从现有信息检索系统中检索答案。该方法包括自动学习短语特征,以将问题分类为不同类型,自动从一组问题/答案对的训练集中生成候选查询转换,以及在目标信息检索系统(如现实世界通用搜索引擎)上自动评估候选转换。在运行时,问题将转换为一组查询,并对检索到的文档进行重新排序。我们提供了一个原型搜索引擎Tritus,它将该方法应用于Web搜索引擎。对来自Web搜索引擎日志的一组实际查询的盲目评估显示,该方法明显优于基础搜索引擎,并且胜过了专门解决问题的商业搜索引擎。我们的方法干净利落地支持合并从不同搜索引擎中检索到的文档,并通过合并来自多个Web搜索引擎的搜索结果的系统进一步改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号