首页> 外文会议>IEEE Annual Ubiquitous Computing, Electronics and Mobile Communication Conference >Using BERT to Extract Topic-Independent Sentiment Features for Social Media Bot Detection
【24h】

Using BERT to Extract Topic-Independent Sentiment Features for Social Media Bot Detection

机译:使用BERT提取独立于主题的情绪功能,用于社交媒体机器人检测

获取原文

摘要

Millions of online posts about different topics and products are shared on popular social media platforms. One use of this content is to provide crowd-sourced information about a specific topic, event, or product. However, this use raises an important question: what percentage of the information available through these services is trustworthy? In particular, might some of this information be generated by a machine, i.e., a "bot" instead of a human? Bots can be, and often are, purposely designed to generate enough volume to skew an apparent trend or position on a topic, yet the consumer of such content cannot easily distinguish a bot post from a human post. This paper introduces a new model that uses Bidirectional Encoder Representations from Transformers (Google Bert) for sentiment classification of tweets to identify topic-independent features for the social media bot detection model. Using a Natural Language Processing approach to derive topic-independent features for the new bot detection model distinguishes this work from previous bot detection models. We achieve 94% accuracy classifying the contents of Cresci data set [1] as generated by a bot or a human, where the most accurate prior work achieved an accuracy of 92%.
机译:关于不同主题和产品的数百万的在线帖子在流行的社交媒体平台上共享。这种内容的一个使用是提供有关特定主题,事件或产品的人群源信息。但是,此用途提出了一个重要问题:通过这些服务可获得的信息的百分比是值得信赖的?特别地,可能由机器,即“机器人”而不是人类生成其中一些信息?机器人可以是,并且通常是故意设计的,以产生足够的音量来倾斜一个明显的趋势或姿势在一个主题上,但这种内容的消费者不能轻易区分从人柱的机器人邮政区别。本文介绍了一种新的模型,它使用来自变压器(Google Bert)的双向编码器表示,用于识别社交媒体机器人检测模型的主题特征的情绪分类。使用自然语言处理的方法来推导话题无关的功能,为新的bot检测模型区别于以前的机器人检测模型这项工作。我们达到了94%的准确性,分类了由机器人或人类产生的CRESCI数据集[1]的内容,其中最准确的先前工作达到了92%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号