【2h】

Predicting clicks of PubMed articles

机译:预测PubMed文章的点击

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Predicting the popularity or access usage of an article has the potential to improve the quality of PubMed searches. We can model the click trend of each article as its access changes over time by mining the PubMed query logs, which contain the previous access history for all articles. In this article, we examine the access patterns produced by PubMed users in two years (July 2009 to July 2011). We explore the time series of accesses for each article in the query logs, model the trends with regression approaches, and subsequently use the models for prediction. We show that the click trends of PubMed articles are best fitted with a log-normal regression model. This model allows the number of accesses an article receives and the time since it first becomes available in PubMed to be related via quadratic and logistic functions, with the model parameters to be estimated via maximum likelihood. Our experiments predicting the number of accesses for an article based on its past usage demonstrate that the mean absolute error and mean absolute percentage error of our model are 4.0% and 8.1% lower than the power-law regression model, respectively. The log-normal distribution is also shown to perform significantly better than a previous prediction method based on a human memory theory in cognitive science. This work warrants further investigation on the utility of such a log-normal regression approach towards improving information access in PubMed.
机译:预测文章的受欢迎程度或访问使用率可能会提高PubMed搜索的质量。通过挖掘PubMed查询日志(其中包含所有文章的先前访问历史记录),我们可以将每篇文章的点击趋势建模为随着访问时间的变化。在本文中,我们研究了PubMed用户在两年内(2009年7月至2011年7月)产生的访问模式。我们探索查询日志中每篇文章的访问时间序列,使用回归方法对趋势进行建模,然后使用这些模型进行预测。我们显示PubMed文章的点击趋势最适合对数正态回归模型。该模型允许通过二次函数和逻辑函数将文章接收的访问次数以及自该文章首次在PubMed中可用以来的时间与模型参数相关联,并通过最大似然估计模型参数。我们根据过去使用情况预测文章访问次数的实验表明,我们的模型的平均绝对误差和平均绝对百分比误差分别比幂律回归模型低4.0%和8.1%。对数正态分布还显示出比基于认知科学中人类记忆理论的先前预测方法明显更好的性能。这项工作值得进一步研究这种对数正态回归方法对改善PubMed中的信息访问的效用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号