首页> 外文会议>ISPRS Geospatial Week >A TOPIC MODELING BASED REPRESENTATION TO DETECT TWEET LOCATIONS. EXAMPLE OF THE EVENT 'JE SUIS CHARLIE'
【24h】

A TOPIC MODELING BASED REPRESENTATION TO DETECT TWEET LOCATIONS. EXAMPLE OF THE EVENT 'JE SUIS CHARLIE'

机译:基于主题建模的表示来检测推文位置。事件的示例“Je Suis Charlie”

获取原文

摘要

Social Networks became a major actor in information propagation. Using the Twitter popular platform, mobile users post or relay messages from different locations. The tweet content, meaning and location, show how an event-such as the bursty one "JeSuisCharlie", happened in France in January 2015, is comprehended in different countries. This research aims at clustering the tweets according to the co-occurrence of their terms, including the country, and forecasting the probable country of a non-located tweet, knowing its content. First, we present the process of collecting a large quantity of data from the Twitter website. We finally have a set of 2,189 located tweets about "Charlie", from the 7th to the 14th of January. We describe an original method adapted from the Author-Topic (AT) model based on the Latent Dirichlet Allocation (LDA) method. We define an homogeneous space containing both lexical content (words) and spatial information (country). During a training process on a part of the sample, we provide a set of clusters (topics) based on statistical relations between lexical and spatial terms. During a clustering task, we evaluate the method effectiveness on the rest of the sample that reaches up to 95% of good assignment. It shows that our model is pertinent to foresee tweet location after a learning process.
机译:社交网络成为信息传播中的主要演员。使用Twitter流行平台,移动用户从不同位置发布或中继消息。推文内容,意义和位置,展示了一个事件 - 例如突发一个“jesuischarlie”,在2015年1月在法国发生在不同国家。本研究旨在根据其术语的共同发生,包括该国的共同发生,并预测未位于未位推文的可能国家,了解其内容。首先,我们介绍了从Twitter网站收集大量数据的过程。我们终于从1月7日到14日到了一段关于“查理”的推文。我们描述了一种原始方法,根据潜在的Dirichlet分配(LDA)方法,从作者 - 主题(AT)模型调整。我们定义了包含词汇内容(单词)和空间信息(国家)的同质空间。在样本的培训过程中,我们提供了一组基于词汇和空间术语之间的统计关系的集群(主题)。在群集任务期间,我们评估了其余的样本中的方法效果,达到了高达95%的好分配。它表明,我们的模型与学习过程之后的预见到预见位置有关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号