首页> 外文会议>International Conference on Information Modelling and Knowledge Bases >Topic Extraction from Millions of Tweets Based on Community Detection in Bipartite Networks
【24h】

Topic Extraction from Millions of Tweets Based on Community Detection in Bipartite Networks

机译:主题提取从数百万次推文基于群落检测的二分网络中的推文

获取原文

摘要

Social media offers a wealth of insight into how significant topics such as the Great East Japan Earthquake, the Arab Spring, and the Boston Bombing affect individuals. The scale of available data, however, can be intimidating: during the Great East Japan Earthquake, over 8 million tweets were sent each day from Japan alone. Conventional word vector-based topic-detection techniques for social media that use Latent Semantic Analysis, Latent Dirichlet Allocation, or graph community detection often cannot extract appropriate topics from such a large volume of data with accuracy due to their space and time complexity. To alleviate this problem, we propose an effective topic extraction from millions of tweets based on community detection in bipartite networks. Our method is based on the bipartite community detection technique developed by Okamoto, one of the authors of this paper. The paper demonstrates our method effectiveness on social media analysis and identifies topics from millions of tweets after the Great East Japan Earthquake. To show our method's effectiveness, we compute the coherence measure that can evaluate the semantic accuracy and the running time, and compare the method with LDA that is the major topic model.
机译:社交媒体提供了丰富的洞察力,探讨了大东日本地震,阿拉伯春天和波士顿轰炸的重要主题如何影响个人。然而,可用数据的规模可能是恐吓:在大东日本地震期间,每天单独从日本每天发过超过800万推文。用于使用潜在语义分析,潜在的Dirichlet分配或图形社区检测的社交媒体的传统信息的主题检测技​​术通常不能通过它们的空间和时间复杂度来从这些大量数据中提取适当的主题。为了减轻这个问题,我们提出了一种基于二分网络中的群落检测数百万推文的有效提取。我们的方法是基于由冈萨托开发的二分区社区检测技术,本文的作者之一。本文展示了我们对社交媒体分析的有效性,并在大东日本地震之后识别数百万推文的主题。为了展示我们的方法的效率,我们计算了可以评估语义准确性和运行时间的相干度量,并将方法与LDA进行比较,即主要主题模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号