首页> 美国政府科技报告 >TREC2001 Question-Answer, Web and Cross Language Experiments Using PIRCS
【24h】

TREC2001 Question-Answer, Web and Cross Language Experiments Using PIRCS

机译:TREC2001使用pIRCs进行问答,网络和跨语言实验

获取原文

摘要

We applied our PIRCS system for the Question-Answer, ad-hoc Web retrieval using the 10-GB collection, and the English-Arabic cross language tracks. We also attempted to complete the adaptive filtering experiments with our upgraded programs but found that we did not have sufficient time to do so. The QA Track requires obtaining 50-byte answer strings to 500 questions (later truncated to 492). The answers are to be retrieved from documents made up from the TREC collections: AP1-3, WSJ1-2, SJMN-3, FT-4, LA-5 and FBIS-5. Our QA system is constructed using methods of classical IR, enhanced with simple heuristics. It does not have natural language understanding capabilities, but employs simple pattern matching and statistics. We view QA as a three-step process: (1) retrieving a set of documents that are highly related to the topic of the question; (2) weighing sentences in this document set that are most likely to answer the question according to the query type and its description; and (3) selecting words from the top-scoring sentences to form the answer string. This approach was quite successful for the 250-byte answer task at TREC- 9. This year we added more heuristics, better pattern recognition and entity recognition.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号