...
首页> 外文期刊>International Journal of Web Based Communities >Streaming trend detection in Twitter
【24h】

Streaming trend detection in Twitter

机译:Twitter中的流趋势检测

获取原文
获取原文并翻译 | 示例

摘要

As social media continue to grow, the Zeitgeist of society is increasingly found not in the headlines of traditional media institutions, but in the activity of ordinary individuals. The identification of trending topics utilises social media (such as Twitter) to provide an overview of the topics and issues that are currently popular within the online community. In this paper, we outline methodologies of detecting and identifying trending topics from streaming data. Data from Twitter's streaming API was collected and put into documents of equal duration using data collection procedures that allow for analysis over multiple timespans, including those not currently associated with Twitter-identified trending topics. Term frequency-inverse document frequency analysis and relative normalised term frequency analysis were performed on the documents to identify the trending topics. Relative normalised term frequency analysis identified unigrams, bigrams, and trigrams as trending topics, while term frequency-inverse document frequency analysis identified unigrams as trending topics. Application of these methodologies to streaming data resulted in F-measures ranging from 0.1468 to 0.7508.
机译:随着社交媒体的不断发展,越来越多的社会时代精神不再出现在传统媒体机构的头条新闻中,而是出现在普通人的活动中。趋势主题的标识利用社交媒体(例如Twitter)提供当前在线社区中流行的主题和问题的概述。在本文中,我们概述了从流数据中检测和识别趋势主题的方法。使用数据收集程序收集来自Twitter流API的数据,并将其放入等长的文档中,该过程可进行多个时间范围内的分析,包括当前与Twitter识别的趋势主题不相关的时间范围。对文档执行了词频逆文档频率分析和相对归一化的词频分析,以识别趋势主题。相对归一化的词频分析将字母组合,双字母组和三字母组合识别为趋势主题,而词频逆文档频率分析将字母组合作为趋势主题。这些方法在流数据上的应用导致F度量范围从0.1468到0.7508。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号