首页> 外文期刊>Computer Science & Information Technology >A Text Mining Research Based on LDA Topic Modelling
【24h】

A Text Mining Research Based on LDA Topic Modelling

机译:基于LDA主题建模的文本挖掘研究

获取原文
       

摘要

A Large number of digital text information is generated every day. Effectively searching,managing and exploring the text data has become a main task. In this paper, we first representan introduction to text mining and a probabilistic topic model Latent Dirichlet allocation. Thentwo experiments are proposed - Wikipedia articles and users’ tweets topic modelling. Theformer one builds up a document topic model, aiming to a topic perspective solution onsearching, exploring and recommending articles. The latter one sets up a user topic model,providing a full research and analysis over Twitter users’ interest. The experiment processincluding data collecting, data pre-processing and model training is fully documented andcommented. Further more, the conclusion and application of this paper could be a usefulcomputation tool for social and business research.
机译:每天都会产生大量的数字文本信息。有效地搜索,管理和探索文本数据已成为一项主要任务。在本文中,我们首先对文本挖掘和概率主题模型Latent Dirichlet分配进行介绍。然后提出了两个实验-维基百科文章和用户的推文主题建模。前者建立了一个文档主题模型,旨在针对搜索,探索和推荐文章的主题角度解决方案。后者建立了一个用户主题模型,可以对Twitter用户的兴趣进行全面的研究和分析。实验过程包括数据收集,数据预处理和模型训练已得到充分记录和评论。此外,本文的结论和应用可能是社会和商业研究的有用的计算工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号