...
首页> 外文期刊>International journal on digital libraries >Hidden features identification for designing an efficient research article recommendation system
【24h】

Hidden features identification for designing an efficient research article recommendation system

机译:用于设计高效研究文章推荐系统的隐藏功能识别

获取原文
获取原文并翻译 | 示例
           

摘要

The digital repository of research articles is increasing at a rapid rate and hence searching the right paper becoming a tedious task for researchers. A research paper recommendation system is advocated to help researchers in this context. In the process of designing such a system, proper representation of articles, more specifically, feature identification and extraction are two essential tasks. The existing approaches mainly consider direct features which are readily available from research articles. However, there are certain features which are not readily available from a paper, but may greatly influence the performance of recommendation systems. This paper proposes four indirect features: keyword diversification, text complexity, citation analysis over time, and scientific quality measurement to represent a research article. The keyword diversification measures the uniqueness of the keywords of a paper which helps variation in recommendation. The text complexity measurement helps to provide a paper by matching the user's understandability level. The citation analysis over time decides the relevancy of a paper. The scientific quality measurement helps to measure the scientific values of papers. Formal definitions of the proposed indirect features, schemes to extract the feature values given a research article, and metrics to measure them quantitatively are discussed in this paper. To substantiate the efficacy of the proposed features, a number of experiments have been carried out. The experimental results reveal that the proposed indirect features uniquely define a research article than the direct features. Given a research paper, extraction of feature vector is computationally fast and thus feasible to filter a large corpus of papers in real time. More significantly, indirect features are matchable with user's profile features, thus satisfying an important criterion in collaborative filtering.
机译:研究文章的数字资源库以极快的速度在增加,因此寻找合适的纸张,成为研究人员乏味的任务。一份研究报告推荐系统主张在这方面帮助研究人员。在设计这样一个系统,物品适当的代表的过程中,更具体地说,特征识别和提取是两个基本任务。现有的方法主要考虑直接的特点,其很容易从研究文章。然而,有某些特征不容易获得由纸,但也可以大大影响推荐系统的性能。本文提出了4个间接功能:关键字多样化,复杂性的文字,随着时间的推移引文分析,科学的质量测量来表示的研究文章。关键字多样化措施的纸,这有助于推荐变化的关键字的唯一性。文本复杂的测量有助于通过匹配用户的可理解性水平提供了一份文件。随着时间的推移引文分析确定一篇论文的相关性。科学的质量测量有助于衡量论文的科学价值。所提出的间接功能,方案的正式定义提取赋予了一篇研究文章中的特征值和标准来衡量他们定量在本文中讨论。为了证实所提出的功能功效,一些实验已经进行了。实验结果表明,所提出的间接功能唯一确定比直接特征的研究文章。给定一个研究报告,特征向量的提取是计算速度快,因此,可行的筛选大量语料的实时文件。更显著的,间接的功能是否匹配与用户的配置文件的功能,于是满足协同过滤的一个重要标准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号