首页> 外文期刊>Journal of web semantics: >Semantically enhanced Information Retrieval: An ontology-based approach
【24h】

Semantically enhanced Information Retrieval: An ontology-based approach

机译:语义增强的信息检索:基于本体的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Currently, techniques for content description and query processing in Information Retrieval (IR) are based on keywords, and therefore provide limited capabilities to capture the conceptualizations associated with user needs and contents. Aiming to solve the limitations of keyword-based models, the idea of conceptual search, understood as searching by meanings rather than literal strings, has been the focus of a wide body of research in the IR field. More recently, it has been used as a prototypical scenario (or even envisioned as a potential "killer app") in the Semantic Web (SW) vision, since its emergence in the late nineties. However, current approaches to semantic search developed in the SW area have not yet taken full advantage of the acquired knowledge, accumulated experience, and technological sophistication achieved through several decades of work in the IR field. Starting from this position, this work investigates the definition of an ontology-based IR model, oriented to the exploitation of domain Knowledge Bases to support semantic search capabilities in large document repositories, stressing on the one hand the use of fully fledged ontologies in the semantic-based perspective, and on the other hand the consideration of unstructured content as the target search space. The major contribution of this work is an innovative, comprehensive semantic search model, which extends the classic IR model, addresses the challenges of the massive and heterogeneous Web environment, and integrates the benefits of both keyword and semantic-based search. Additional contributions include: an innovative rank fusion technique that minimizes the undesired effects of knowledge sparseness on the yet juvenile SW, and the creation of a large-scale evaluation benchmark, based on TREC IR evaluation standards, which allows a rigorous comparison between IR and SW approaches. Conducted experiments show that our semantic search model obtained comparable and better performance results (in terms of MAP and P@10 values) than the best TREC automatic system.
机译:当前,信息检索(IR)中用于内容描述和查询处理的技术基于关键字,因此提供了捕获与用户需求和内容关联的概念化的有限功能。为了解决基于关键字的模型的局限性,概念搜索的概念(即按含义而不是按字面意义的字符串进行搜索)已成为IR领域广泛研究的重点。自从90年代末出现以来,它已在语义网(SW)愿景中用作原型场景(甚至被设想为潜在的“杀手级应用”)。但是,在西南地区开发的当前语义搜索方法尚未充分利用通过IR领域数十年的工作而获得的知识,积累的经验和技术的先进性。从这个位置开始,这项工作研究了基于本体的IR模型的定义,该模型旨在利用领域知识库来支持大型文档存储库中的语义搜索功能,一方面强调在语义中使用成熟的本体基于角度的观点,另一方面将非结构化内容视为目标搜索空间。这项工作的主要贡献是创新的,全面的语义搜索模型,该模型扩展了经典的IR模型,解决了庞大且异构的Web环境的挑战,并集成了关键字搜索和基于语义的搜索的好处。其他贡献包括:创新的等级融合技术,最大程度地减少了知识稀疏性对未成年人SW的不良影响;以及基于TREC IR评估标准创建了大规模评估基准,可以对IR和SW进行严格的比较方法。进行的实验表明,与最佳TREC自动系统相比,我们的语义搜索模型获得了可比且更好的性能结果(就MAP和P @ 10值而言)。

著录项

  • 来源
    《Journal of web semantics:》 |2011年第4期|p.434-452|共19页
  • 作者单位

    Knowledge Media Institute, The Open University, Milton Keynes MK7 6AA, United Kingdom;

    Departamento de Ingenieria Informatica, Universidad Autonoma de Madrid, Madrid, Spain;

    Knowledge Media Institute, The Open University, Milton Keynes MK7 6AA, United Kingdom;

    Departamento de Ingenieria Informatica, Universidad Autonoma de Madrid, Madrid, Spain;

    Departamento de Ingenieria Informatica, Universidad Autonoma de Madrid, Madrid, Spain;

    Knowledge Media Institute, The Open University, Milton Keynes MK7 6AA, United Kingdom;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    semantic web; information retrieval; semantic search;

    机译:语义网信息检索;语义搜索;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号