...
首页> 外文期刊>Knowledge-Based Systems >Using time topic modeling for semantics-based dynamic research interest finding
【24h】

Using time topic modeling for semantics-based dynamic research interest finding

机译:使用时间主题建模进行基于语义的动态研究兴趣发现

获取原文
获取原文并翻译 | 示例

摘要

Researchers interests finding has been an active area of investigation for different recommendation tasks. Previous approaches for finding researchers interests exploit writing styles and links connectivity by considering time of documents, while semantics-based intrinsic structure of words is ignored. Consequently, a topic model named Author-Topic model is proposed, which exploits semantics-based intrinsic structure of words present between the authors of research papers. It ignores simultaneous modeling of time factor which results in exchangeability of topics problem, which is important factor to deal with when finding dynamic research interests. For example, in many real world applications, like finding reviewers for papers and finding taggers in the social tagging systems one need to consider different time periods. In this paper, we present time topic modeling approach named Temporal-Author-Topic (TAT) which can simultaneously model text, researchers and time of research papers to overcome the exchangeability of topics problem. The mixture distribution over topics is influenced by both co-occurrences of words and timestamps of the research papers. Consequently, topics occurrence and their related researchers change over time, while the meaning of particular topic almost remains unchanged. Proposed approach is used to discover topically related researchers for different time periods. We also show how their interests and relationships change over a time period. Empirical results on large research papers corpus show the effectiveness of our proposed approach and dominance over Author-Topic (AT) model, by handling the exchangeability of topics problem, which enables it to obtain similar meaning of particular topic overtime.
机译:研究人员的兴趣发现一直是针对不同推荐任务进行调查的活跃领域。寻找研究人员兴趣的先前方法是通过考虑文档时间来利用写作风格和链接连通性,而忽略了基于语义的单词内在结构。因此,提出了一个名为作者-主题模型的主题模型,该模型利用了研究论文作者之间基于语义的单词内在结构。它忽略了时间因素的同时建模,这导致了主题问题的可交换性,而这是发现动态研究兴趣时要处理的重要因素。例如,在许多现实世界的应用程序中,例如查找论文的审稿人和在社会标签系统中查找标签者,都需要考虑不同的时间段。在本文中,我们提出了一种称为“时间-作者-主题”(TAT)的时间主题建模方法,该方法可以同时对文本,研究人员和研究论文的时间进行建模,以克服主题问题的可交换性。主题的混合分布受单词的同时出现和研究论文的时间戳的影响。因此,主题的发生及其相关的研究人员会随时间而变化,而特定主题的含义几乎保持不变。建议的方法用于发现不同时间段内与主题相关的研究人员。我们还将展示他们的兴趣和关系在一段时间内的变化。大型研究论文语料库上的实证结果表明,通过处理主题问题的可交换性,我们提出的方法的有效性和对作者主题(AT)模型的支配性,使其能够获得特定主题的相似含义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号