首页> 外文会议>International conference on recent advances in natural language processing >Language-Agnostic Twitter Bot Detection

【24h】

Language-Agnostic Twitter Bot Detection

机译：与语言无关的Twitter Bot检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we address the problem of detecting Twitter bots. We analyze a dataset of 8385 Twitter accounts and their tweets consisting of both humans and different kinds of bots. We use this data to train machine learning classifiers that distinguish between real and bot accounts. We identify features that are easy to extract while still providing good results. We analyze different feature groups based on account specific, tweet specific and behavioral specific features and measure their performance compared to other state of the art bot detection methods. For easy future portability of our work we focus on language-agnostic features. With Ad-aBoost, the best performing classifier, we achieve an accuracy of 0.988 and an AUC of 0.995. As the creation of good training data in machine learning is often difficult - especially in the domain of Twitter bot detection - we additionally analyze to what extent smaller amounts of training data lead to useful results by reviewing cross-validated learning curves. Our results indicate that using few but expressive features already has a good practical benefit for bot detection, especially if only a small amount of training data is available.

机译：在本文中，我们解决了检测Twitter机器人的问题。我们分析了8385个Twitter帐户及其推文的数据集，这些推文包括人类和不同种类的机器人。我们使用这些数据来训练区分真实账户和机器人账户的机器学习分类器。我们确定易于提取的特征，同时仍能提供良好的效果。我们根据帐户特定，tweet特定和行为特定的特征来分析不同的特征组，并与其他先进的bot检测方法相比，评估其性能。为了将来可以轻松移植我们的工作，我们专注于与语言无关的功能。使用性能最佳的分类器Ad-aBoost，我们可以实现0.988的准确度和0.995的AUC。由于在机器学习中创建良好的训练数据通常很困难-尤其是在Twitter机器人检测领域-我们还通过查看经过交叉验证的学习曲线来分析更少量的训练数据在多大程度上导致有用的结果。我们的结果表明，使用很少但富有表现力的功能已经对机器人检测具有良好的实践意义，尤其是在只有少量训练数据的情况下。

著录项

来源
《International conference on recent advances in natural language processing 》|2019年|550-558|共9页
会议地点
作者
Juergen Knauth;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Real-Time Bot Detection from Twitter Using the Twitterbot+ Framework [J] . Kheir Eddine Daouadi, Rim Zghal Reba?, Ikram Amous Journal of Universal Computer Science . 2020 ,第4期

机译：使用Twitterbot +框架从Twitter中检测实时机床检测
2. Lascad : Language-agnostic software categorization and similar application detection [J] . Altarawy Doaa, Shahin Hossameldin, Mohammed Ayat, The Journal of Systems and Software . 2018 ,第auga期

机译：Lascad：与语言无关的软件分类和类似的应用程序检测
3. Detection of Malicious Social Bots Using Learning Automata With URL Features in Twitter Network [J] . Rout Rashmi Ranjan, Lingam Greeshma, Somayajulu D. V. L. N. Computational Social Systems, IEEE Transactions on . 2020 ,第4期

机译：在Twitter网络中使用Learnic Automata检测具有URL功能的乐意社交机器人
4. Language-Agnostic Twitter Bot Detection [C] . Juergen Knauth International conference on recent advances in natural language processing . 2019

机译：语言 - 无话用Twitter机器人检测
5. Local News and Event Detection in Twitter [D] . Wei, Hong. 2020

机译：Twitter中的本地新闻和事件检测
6. Towards Automatic Bot Detection in Twitter for Health-related Tasks [O] . Anahita Davoudi, Ari Z. Klein, Abeed Sarker, 2020

机译：致力于Twitter中针对健康相关任务的自动Bot检测
7. Language-Agnostic Twitter Bot Detection [O] . Jürgen Knauth 2019

机译：语言 - 无话用Twitter机器人检测

Language-Agnostic Twitter Bot Detection

摘要

著录项

相似文献

相关主题

期刊订阅