首页> 外文会议>Asia Pacific Web and Web-Age Information Management >A Local-Global LDA Model for Discovering Geographical Topics from Social Media
【24h】

A Local-Global LDA Model for Discovering Geographical Topics from Social Media

机译:用于从社交媒体发现地理主题的本地全球LDA模型

获取原文

摘要

Micro-blogging services can track users' geo-locations when users check-in their places or use geo-tagging which implicitly reveals locations. This "geo tracking" can help to find topics triggered by certain events in certain regions. However, discovering such topics is very challenging because of the large amount of noisy messages (e.g. daily conversations). This paper proposes a method to model geographical topics, which can filter out irrelevant words by different weights in the local and global contexts. Our method is based on the Latent Dirichlet Allocation (LDA) model but each word is generated from either a local or a global topic distribution by its generation probabilities. We evaluated our model with data collected from Weibo, which is currently the most popular micro-blogging service for Chinese. The evaluation results demonstrate that our method outperforms other baseline methods in several metrics such as model perplexity, two kinds of entropies and KL-divergence of discovered topics.
机译:微博客服务可以在用户办理登机位置或使用隐含地显示位置的地理标记时跟踪用户的地理位置。这种“Geo跟踪”可以帮助找到某些区域中某些事件触发的主题。然而,由于大量嘈杂的消息(例如,日常对话),发现这种主题是非常具有挑战性的。本文提出了一种模拟地理主题的方法,可以在本地和全局背景下通过不同权重滤除无关的单词。我们的方法基于潜在的Dirichlet分配(LDA)模型,但是每个单词由其生成概率从本地或全局主题分布生成。我们使用从微博收集的数据进行了评估,这是目前最受欢迎的Micro-Blogging服务。评估结果表明,我们的方法在若干度量中占据了诸如模型困惑,两种熵和发现主题的熵和kl分歧的其他基线方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号