【24h】

Prediction of Twitter Message Deletion

机译:Twitter邮件删除的预测

获取原文

摘要

Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.
机译:社交媒体是人们建立声誉或推广想法的一种方式。与其他社交媒体资源相比,Twitter是实时文本信息的生成器,它主要用于共享想法,观点和突发新闻。它的意思是简短,快速,引人注目的陈述,覆盖全球数百万用户。发布不当内容可能会影响公众形象,名人,政客以及普通Twitter用户的隐私。如果我们可以提前警告用户要发布的消息中的潜在漏洞,我们可以保护其身份免遭泄露。因此,自动识别带有内容的消息会导致将来删除它,这是一个有前途的研究领域。在本文中,我们正在分析英语的Twitter消息,目的是建立一个分类器,以预测特定帖子是否会被用户删除。我们在进行分类时应用了依赖于推文基于上下文的信息的递归神经网络(RNN)模型。这项工作的另一个贡献是构建了一系列丰富的功能,包括推特元数据,用户信息和推文文本,以在推特数据上训练经典的机器学习算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号