首页> 中文期刊>计算机辅助设计与图形学学报 >基于概率主题建模的新闻文本可视化综述

基于概率主题建模的新闻文本可视化综述

     

摘要

With the development of information technology, traditional media is making transformation to the“New Media”, which is based on the Internet. Meanwhile, data mining and natural language processing have been developed greatly these years. These technologies are utilized to uncover the semantics and topics in news articles. In addition, the overflow of information motivates the development of new visualization methods. Overall, these methods have great potential of making people better understand news. LDA is a promising topic modeling method, which is adopted by lots of researchers. In this paper, we first introduced the text probabilistic topic modeling technologies based on LDA and discussed the unique characteristics of topic modeling for news. Then, we summarized different visualization methods of topic modeling for news. We also analyzed and compared advantages and shortcomings of these methods. At last, we discussed some future directions of topic modeling for news.%伴随着信息技术的发展,传统纸质新闻逐渐向新媒体新闻转变。与此同时,近年来数据挖掘和自然语言处理等技术得到了极大的发展,使得对新闻所蕴含丰富语义和主题进行深度挖掘成为可能。然而,信息的超载使得主题可视化成为一个新的挑战,即如何以更好的方式来呈现海量互联网文本所蕴含的主题。隐形语义分析(LDA)是近年来兴起的主题建模方法,被当前学术界认为是主流的主题建模技术。文中首先介绍以LDA为主的文本概率主题建模技术及其发展,讨论了新闻主题建模特点;随后概括对比新闻主题可视化的若干方法,并对其进行分类,分析不同方法的适用性和局限性;最后对新闻主题可视化进行总结和展望。

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号