【24h】

Credibility in Context: An Analysis of Feature Distributions in Twitter

机译:语境中的可信度:Twitter中的特征分布分析

获取原文

摘要

Twitter is a major forum for rapid dissemination of user-provided content in real time. As such, a large proportion of the information it contains is not particularly relevant to many users and in fact is perceived as unwanted 'noise' by many. There has been increased research interest in predicting whether tweets are relevant, newsworthy or credible, using a variety of models and methods. In this paper, we focus on an analysis that highlights the utility of the individual features in Twitter such as hash tags, retweets and mentions for predicting credibility. We first describe a context-based evaluation of the utility of a set of features for predicting manually provided credibility assessments on a corpus of microblog tweets. This is followed by an evaluation of the distribution/presence of each feature across 8 diverse crawls of tweet data. Last, an analysis of feature distribution across dyadic pairs of tweets and retweet chains of various lengths is described. Our results show that the best indicators of credibility include URLs, mentions, retweets and tweet length and that features occur more prominently in data describing emergency and unrest situations.
机译:Twitter是一个主要的论坛,实时快速传播用户提供的内容。因此,它包含的大部分信息与许多用户没有特别相关,事实上被认为是许多人的不必要的“噪音”。使用各种模型和方法预测推文是否具有相关性,新闻价值或可信的研究兴趣。在本文中,我们专注于分析,突出显示Twitter中的个别功能的效用,例如哈希标签,转发和提及以预测可信度。我们首先描述了一种基于语境的评估,对一组特征的实用性进行了预测,用于预测手动为微博推文的语料库提供可信度评估。然后,这是评估跨越8个推文数据爬网的每个功能的分布/存在。最后,描述了各种长度的二次发布和转发链的特征分布的分析。我们的研究结果表明,最佳可信度指标包括URL,提到,转发和推文长度,并且在描述紧急情况和骚乱情况的数据中更加突出地发生。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号