首页> 外文会议>Federated Conference on Computer Science and Information Systems >A New Subject-based Document Retrieval from Digital Libraries Using Vector Space Model
【24h】

A New Subject-based Document Retrieval from Digital Libraries Using Vector Space Model

机译:使用矢量空间模型的数字图书馆的基于基于主题的文档检索

获取原文

摘要

Document retrieval from digital libraries based on user's query is highly affected by the terms appeared in the query. In many cases, there are some documents in the digital libraries that do not share exactly the same terms with the query, but they are related to the user's need. We address this challenge in this paper by introducing a new subject-based retrieval approach in which, apart from ranking documents based on the terms in the query, a new subject-based scoring scheme is defined between the query and a document. We define this score by introducing a new vector space model in which a vectorized subject-based representation is defined for each document and its keywords, and the terms in the query, as well. We have tested the new subject-based scoring scheme on a database of scientific papers obtained from Web of Science. Our Experimental results show that in 83% of times users prefer the proposed scoring scheme with respect to the classic scoring ones.
机译:根据用户查询的数字库从数字库检索的文档受到查询中出现的术语的高度影响。在许多情况下,数字库中有一些文档不会与查询分享完全相同的术语,但它们与用户的需要有关。我们通过引入新的基于主题的检索方法来解决这一挑战,其中,除了基于查询中的术语的排序文档之外,在查询和文档之间定义了新的基于主题的评分方案。我们通过引入新的向量空间模型来定义此分数,其中为每个文档及其关键字定义了一种基于主题的表示,以及查询中的术语。我们已经测试了从科学网获得的科学论文数据库的基于基于主题的评分计划。我们的实验结果表明,在83%的次数中,用户更喜欢拟议的评分方案相对于经典得分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号