首页> 外文期刊>Fundamenta Informaticae >Interactive Method for Semantic Document Indexing Based on Explicit Semantic Analysis
【24h】

Interactive Method for Semantic Document Indexing Based on Explicit Semantic Analysis

机译:基于显式语义分析的交互式语义文档索引方法

获取原文
获取原文并翻译 | 示例

摘要

In this article we propose a general framework incorporating semantic indexing and search of texts within scientific document repositories. In our approach, a semantic interpreter, which can be seen as a tool for automatic tagging of textual data, is interactively updated based on feedback from the users, in order to improve quality of the tags that it produces. In our experiments, we index our document corpus using the Explicit Semantic Analysis (ESA) method. In this algorithm, an external knowledge base is used to measure relatedness between words and concepts, and those assessments are utilized to assign meaningful concepts to given texts. In the paper, we explain how the weights expressing relations between particular words and concepts can be improved by interaction with users or by employment of expert knowledge. We also present some results of experiments on a document corpus acquired from the PubMed Central repository to show feasibility of our approach.
机译:在本文中,我们提出了一个通用框架,该框架结合了语义索引和科学文档存储库中的文本搜索。在我们的方法中,可以将语义解释器(可以看作是自动标记文本数据的工具)根据用户的反馈进行交互式更新,以提高其生成的标签的质量。在我们的实验中,我们使用显式语义分析(ESA)方法索引文档语料库。在该算法中,外部知识库用于度量单词和概念之间的相关性,而这些评估则用于为给定的文本分配有意义的概念。在本文中,我们解释了如何通过与用户交互或通过使用专家知识来提高表示特定单词与概念之间关系的权重。我们还介绍了从PubMed Central存储库中获得的文档语料库的实验结果,以表明我们方法的可行性。

著录项

  • 来源
    《Fundamenta Informaticae》 |2014年第3期|423-438|共16页
  • 作者单位

    Faculty of Mathematics, Informatics and Mechanics, The University of Warsaw Banacha 2, 02-097, Warsaw Poland;

    Faculty of Mathematics, Informatics and Mechanics, The University of Warsaw Banacha 2, 02-097, Warsaw Poland;

    Computer Science, The Main School of Fire Service, Slowackiego 52/54, 01-629 Warsaw Poland;

    Faculty of Mathematics, Informatics and Mechanics, The University of Warsaw Banacha 2, 02-097, Warsaw Poland;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Semantic Search; Interactive Learning; Explicit Semantic Analysis; PubMed; MeSH;

    机译:语义搜索;互动学习;显式语义分析;PubMed;啮合;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号