Team Rouges at SemEval-2020 Task 12: Cross-lingual Inductive Transfer to Detect Offensive Language

机译：Semeval-2020的团队凿鲁贝任务12：交叉舌诱导转移以检测冒犯性语言

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the growing use of social media and its availability, many instances of the use of offensive language have been observed across multiple languages and domains. This phenomenon has given rise to the growing need to detect the offensive language used in social media cross-lingually. In OffensEval 2020, the organizers have released the multilingual Offensive Language Identification Dataset (mOLID), which contains tweets in five different languages, to detect offensive language. In this work, we introduce a cross-lingual inductive approach to identify the offensive language in tweets using the contextual word embedding XLM-RoBERTa (XLM-R). We show that our model performs competitively on all five languages, obtaining the fourth position in the English task with an F1-score of 0.919 and eighth position in the Turkish task with an F1-score of 0.781. Further experimentation proves that our model works competitively in a zero-shot learning environment, and is extensible to other languages.

机译：随着社交媒体的使用越来越多，已经在多种语言和域中观察到许多使用攻击性语言的情况。这种现象使得越来越需要越来越多地检测社交媒体上使用的令人反感的语言。在Iffenseval 2020中，组织者发布了多语言攻击语言识别数据集（Molid），其中包含五种不同语言的推文，以检测令人反感的语言。在这项工作中，我们使用嵌入XLM-Roberta（XLM-R）来介绍跨语言的归纳方法来识别推文中的攻击性语言。我们展示我们的模型对所有五种语言表现得很有竞争力，在土耳其任务中获得英语任务中的第四个职位，F1分数为0.919和第八位，F1分数为0.781。进一步的实验证明，我们的模型在零射击学习环境中竞争地相同工作，并且是可扩展的其他语言。

著录项

来源
《International Workshop on Semantic Evaluation》|2020年|2183-2189|共7页
会议地点
作者
Tanvi Dadu; Kartikey Pant;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Investigating cross-lingual training for offensive language detection [J] . Andra? Pelicon, Ravi Shekhar, Bla? ?krlj, PeerJ Computer Science . 2021,第a期

机译：调查攻击性语言检测的交叉思考
2. Multi-Level Cross-Lingual Transfer Learning With Language Shared and Specific Knowledge for Spoken Language Understanding [J] . He Keqing, Xu Weiran, Yan Yuanmeng Quality Control, Transactions . 2020,第期

机译：具有语言共享的多层次交叉传输学习和语言理解的特定知识
3. Towards inductive learning of surgical task knowledge: a preliminary case study of the peg transfer task [J] . Daniele Meli, Paolo Fiorini, Mohan Sridharan Procedia Computer Science . 2020,第5期

机译：对外科任务知识的归纳学习：PEG转移任务的初步案例研究
4. NLPDove at SemEval-2020 Task 12: Improving Offensive Language Detection with Cross-lingual Transfer [C] . Hwijeen Ahn, Jimin Sun, Chan Young Park, International Workshop on Semantic Evaluation . 2020

机译：在Semeval-2020任务12的NLPDOVE：通过交叉传输提高令人反感的语言检测
5. Detecting Offensive Social Media Text in Nepali Language [D] . ?Timilsina, Sandesh 2020

机译：进攻检测社会化媒体中的文本尼泊尔语
6. Sentence Repetition Tasks to Detect and Prevent Language Difficulties: A Scoping Review [O] . Irene Rujas, Sonia Mariscal, Eva Murillo, 2021

机译：句子重复任务以检测和预防语言困难：审查评论
7. UNBNLP at SemEval-2019 Task 5 and 6: Using Language Models to Detect Hate Speech and Offensive Language [O] . Ali Hakimi Parizi, Milton King, Paul Cook 2019

机译：UNBNLP在Semeval-2019任务5和6：使用语言模型来检测仇恨语音和攻击性语言

Team Rouges at SemEval-2020 Task 12: Cross-lingual Inductive Transfer to Detect Offensive Language

摘要

著录项

相似文献

相关主题

期刊订阅