首页> 外文会议>International Workshop on Semantic Evaluation >CoLi at UdS at SemEval-2020 Task 12: Offensive Tweet Detection with Ensembling

【24h】

CoLi at UdS at SemEval-2020 Task 12: Offensive Tweet Detection with Ensembling

机译：在Semeval-2020的Coli在Semeval-2020任务12：令人反感的推文检测与合奏

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With today's proliferation of maliciously intended communication across all social media platforms, finding ways of effectively combating these messages grows increasingly important. We present our submission and results for SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020) where we participated in offensive tweet classification tasks in English, Arabic, Greek, Turkish and Danish. Our approach included classical machine learning architectures such as support vector machines and logistic regression combined in an ensemble with a multilingual transformer-based model (XLM-R). The transformer model is trained on all languages combined in order to create a fully multilingual model which can leverage knowledge between languages. The machine learning model hyperparameters are fine-tuned and the statistically best performing ones included in the final ensemble. We further discuss the results of our model and see that our broad approach provides competitive but not task-winning performance. We also include an error analysis and potential improvements for future work.

机译：随着在所有社交媒体平台上的恶意沟通的激增，发现有效地打击这些信息的方法越来越重要。我们展示了Semeval-2020任务12的提交和结果：社交媒体中的多语言攻击性语言识别（违法者2020），我们参加了英语，阿拉伯语，希腊语，土耳其语和丹麦语的进攻性推文分类任务。我们的方法包括古典机器学习架构，如支持向量机和Logistic回归组合在具有多语言变换器的模型（XLM-R）的集合中组合。变压器模型在所有语言上培训，以便创建一个完全多语言模型，可以利用语言之间的知识。机器学习模型HyperParameters是微调的，并且在最终集合中包含的统计上最佳性能。我们进一步讨论了我们模型的结果，并了解我们的广泛方法提供了竞争而不是任务胜利的性能。我们还包括未来工作的错误分析和潜在改进。

著录项

来源
《International Workshop on Semantic Evaluation 》|2020年|1916-1924|共9页
会议地点
作者
Kathryn Chapman; Johannes Bernhard; Dietrich Klakow;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Sequence of the Escherichia coli O121 O-Antigen Gene Cluster and Detection of Enterohemorrhagic E. coli O121 by PCR Amplification of the wzx and wzy Genes [J] . Pina M. Fratamico, Connie E. Briggs, Danielle Needle, Journal of Clinical Microbiology . 2003 ,第7期

机译：大肠杆菌O121 O抗原基因簇的序列以及通过wzx和wzy基因的PCR扩增检测大肠出血性大肠杆菌O121
2. Evaluating Machine Learning Techniques for Detecting Offensive and Hate Speech in South African Tweets [J] . Oriola Oluwafemi, Kotze Eduan Quality Control, Transactions . 2020 ,第期

机译：评估机器学习技术，用于检测南非推文中的冒犯和仇恨言论
3. Trump appointee apologizes for offensive tweets [J] . Hannah Northey Greenwire . 2017 ,第JUNa23期

机译：特朗普任命的人为冒犯性推文道歉
4. KEIS@JUST at SemEval-2020 Task 12: Identifying Multilingual Offensive Tweets Using Weighted Ensemble and Fine-Tuned BERT [C] . Saja Khaled Tawalbeh, Mahmoud Hammad, Mohammad AL-Smadi International Workshop on Semantic Evaluation . 2020

机译：Keis @只是在Semeval-2020任务12：使用加权集合和微调伯特识别多语言进攻推文
5. A Domain Adaptation Approach for Offensive Language Detection with Bidirectional Transformers [D] . Singh, Sumer. 2020

机译：双向变压器攻击性语言检测的域适应方法
6. Use of Sub-ensembles and Multi-template Observers to Evaluate Detection Task Performance for Data That are Not Multivariate Normal [O] . Xin Li, Abhinav K. Jha, Michael Ghaly, -1

机译：使用子集合和多模板观察器评估非多元正态数据的检测任务性能
7. DeepAnalyzer at SemEval-2019 Task 6: A deep learning-based ensemble method for identifying offensive tweets [O] . Gretel Liz De la Peña, Paolo Rosso 2019

机译：Semaval-2019的Deepanalyzer任务6：一种基于深度学习的集合方法，用于识别令人反感的推文

CoLi at UdS at SemEval-2020 Task 12: Offensive Tweet Detection with Ensembling

摘要

著录项

相似文献

相关主题

期刊订阅