【24h】

Toward a Document Model for Question Answering Systems

机译:建立问答系统的文档模型

获取原文
获取原文并翻译 | 示例

摘要

The problem of acquiring valuable information from the large amounts available today in electronic media requires automated mechanisms more natural and efficient than those already existing. The trend in the evolution of information retrieval systems goes toward systems capable of answering specific questions formulated by the user in her/his language. The expected answers from such systems are short and accurate sentences, instead of large document lists. On the other hand, the state of the art of these systems is focused -mainly- in the resolution of factual questions, whose answers are named entities (dates, quantities, proper nouns, etc). This paper proposes a model to represent source documents that are then used by question answering systems. The model is based on a representation of a document as a set of named entities (NEs) and their local lexical context. These NEs are extracted and classified automatically by an off-line process. The entities are then taken as instance concepts in an upper ontology and stored as a set of DAML+OIL resources which could be used later by question answering engines. The paper presents a case of study with a news collection in Spanish and some preliminary results.
机译:从当今电子媒体中从大量可用信息中获取有价值的信息的问题要求自动化机制比现有机制更加自然和高效。信息检索系统的发展趋势趋向于能够以用户的语言回答用户提出的特定问题的系统。这种系统的预期答案是简短准确的句子,而不是大量的文档列表。另一方面,这些系统的技术水平主要集中在事实问题的解决上,事实问题的答案称为实体(日期,数量,专有名词等)。本文提出了一个表示源文档的模型,然后由答疑系统使用。该模型基于作为一组命名实体(NE)及其本地词法上下文的文档表示。这些网元是通过离线过程自动提取和分类的。然后将这些实体作为上层本体中的实例概念,并存储为一组DAML + OIL资源,以后可以由问答引擎使用。本文以西班牙新闻集为例,介绍了一个研究案例和一些初步结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号