首页> 外文会议>International Workshop on Semantic Evaluation >Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models

【24h】

Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models

机译：伽利略在Semeval-2020任务12：多语言学习使用预先训练的语言模型进行攻击性语言识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes Galileo's performance in SemEval-2020 Task 12 on detecting and categorizing offensive language in social media. For Offensive Language Identification, we proposed a multi-lingual method using Pre-trained Language Models, ERNIE and XLM-R. For offensive language categorization, we proposed a knowledge distillation method trained on soft labels generated by several supervised models. Our team participated in all three sub-tasks. In Sub-task A -Offensive Language Identification, we ranked first in terms of average Fl scores in all languages. We are also the only team which ranked among the top three across all languages. We also took the first place in Sub-task B - Automatic Categorization of Offense Types and Sub-task C - Offence Target Identification.

机译：本文介绍了伽利略在Semeval-2020任务12中的性能，在社交媒体中检测和分类攻击性语言。对于令人反感的语言识别，我们提出了一种使用预先训练的语言模型，ernie和xlm-r的多语言方法。对于令人反感的语言分类，我们提出了一种知识蒸馏方法，这些方法培训了由多个监督模型产生的软标签。我们的团队参加了所有三个子任务。在子任务中，我们在所有语言中的平均流行评分中排名第一。我们也是唯一一个在所有语言中排名前三个的团队。我们还介绍了子任务B的第一名 - 自动分类进攻类型和子任务C - 冒犯目标识别。

著录项

来源
《International Workshop on Semantic Evaluation 》|2020年|1448-1455|共8页
会议地点
作者
Shuohuan Wang; Jiaxiang Liu; Xuan Ouyang; Yu Sun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Learning to Predict U.S. Policy Change Using New York Times Corpus with Pre-Trained Language Model [J] . Guoshuai Zhang, Jiaji Wu, Mingzhou Tan, Multimedia Tools and Applications . 2020 ,第45a46期

机译：使用纽约时报语料库进行预先培训的语言模型，学习预测美国政策更改
2. Deep Learning for predicting neutralities in Offensive Language Identification Dataset [J] . Sharma Mayukh, Kandasamy Ilanthenral, Kandasamy Vasantha Expert systems with applications . 2021 ,第Deca期

机译：深度学习，用于预测攻击语言识别数据集中的中和
3. The affordances theory in teaching and learning African first additional languages: A case for task-based language teaching [J] . Minas Edith Christina Ecological restoration . 2020 ,第1期

机译：教学和学习非洲第一语言的带来理论：基于任务的语言教学案例
4. SINAI at SemEval-2020 Task 12: Offensive language identification exploring transfer learning models [C] . Flor Miriam Plaza-del-Arco, M. Dolores Molina-Gonzalez, L. Alfonso Urena-Lopez, International Workshop on Semantic Evaluation . 2020

机译：西奈Semeval-2020任务12：令人反感的语言识别探索转移学习模型
5. Logic, formal languages, and formal language identification. Some logical properties of the languages in the Chomsky hierarchy, and an interrogative model of formal language identification. [D] . Pylkko, Pauli Olavi. 1988

机译：逻辑，形式语言和形式语言标识。乔姆斯基层次结构中语言的某些逻辑属性，以及形式语言标识的疑问模型。
6. Relation Extraction from Clinical Narratives Using Pre-trained Language Models [O] . Qiang Wei, Zongcheng Ji, Yuqi Si, 2019

机译：使用预训练的语言模型从临床叙事中提取关系
7. HAD-Tübingen at SemEval-2019 Task 6: Deep Learning Analysis of Offensive Language on Twitter: Identification and Categorization [O] . Himanshu Bansal, Daniel Nagel, Anita Soloveva 2019

机译：Had-Tübingen在Semeval-2019任务6：Twitter上的攻击性语言的深度学习分析：识别和分类
8. Information Processing Models and Computer Aids for Human Performance. Task I: Second-Language Learning. [R] . Kalikow, D. N. 1972

机译：人类绩效的信息处理模型和计算机辅助工具。任务I：第二语言学习。

Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models

摘要

著录项

相似文献

相关主题

期刊订阅