【24h】

Credibility in Context: An Analysis of Feature Distributions in Twitter

机译:上下文中的可信度:Twitter中的功能分布分析

获取原文
获取原文并翻译 | 示例

摘要

Twitter is a major forum for rapid dissemination of user-provided content in real time. As such, a large proportion of the information it contains is not particularly relevant to many users and in fact is perceived as unwanted 'noise' by many. There has been increased research interest in predicting whether tweets are relevant, newsworthy or credible, using a variety of models and methods. In this paper, we focus on an analysis that highlights the utility of the individual features in Twitter such as hash tags, retweets and mentions for predicting credibility. We first describe a context-based evaluation of the utility of a set of features for predicting manually provided credibility assessments on a corpus of microblog tweets. This is followed by an evaluation of the distribution/presence of each feature across 8 diverse crawls of tweet data. Last, an analysis of feature distribution across dyadic pairs of tweets and retweet chains of various lengths is described. Our results show that the best indicators of credibility include URLs, mentions, retweets and tweet length and that features occur more prominently in data describing emergency and unrest situations.
机译:Twitter是一个主要论坛,可实时快速分发用户提供的内容。因此,其中包含的大部分信息与许多用户并没有特别的关系,实际上,许多用户将其视为有害的“噪音”。使用各种模型和方法来预测推文是否相关,具有新闻价值或可信度的研究兴趣越来越高。在本文中,我们将重点放在分析上,该分析突出显示了Twitter中各个功能的效用,例如哈希标签,转发和提及可信度的提及。我们首先描述了一组功能的实用性的基于上下文的评估,该功能用于预测在微博推文集上手动提供的信誉评估。接下来是对8种不同的tweet数据爬网中每个功能的分布/状态进行评估。最后,描述了跨各种长度的推文和转推链的双对特征分布。我们的结果表明,最好的可信度指标包括URL,提及,转发和推文长度,并且这些特征在描述紧急情况和动乱情况的数据中更加突出。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号