indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages

机译：Dravidianlangtech-eacl2021的indicnlp @ kgp：Dravidian语言中的令人反感语言识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper presents the submission of the team indicnlp@kgp to the EACL 2021 shared task "Offensive Language Identification in Dravidian Languages". The task aimed to classify different offensive content types in 3 code-mixed Dravidian language datasets. The work leverages existing state of the art approaches in text classification by incorporating additional data and transfer learning on pre-trained models. Our final submission is an ensemble of an AWD-LSTM based model along with 2 different transformer model architectures based on BERT and RoBERTa. We achieved weighted-average F1 scores of 0.97, 0.77. and 0.72 in the Malayalam-English, Tamil-English, and Kannada-English datasets ranking 1st, 2nd, and 3rd on the respective tasks.

机译：本文介绍了将TeamNingnlp @ KGP提交给EACL 2021共享任务“在Dravidian语言中的冒犯性语言识别”。该任务旨在在3个代码混合的Dravidian语言数据集中对不同的冒犯内容类型进行分类。该工作通过在预先训练的模型上结合额外的数据并转移学习，利用文本分类中的现有技术方法。我们的最终提交是基于AWD-LSTM的模型以及基于BERT和Roberta的2种不同的变压器模型架构。我们达到了0.97,0.77的加权平均F1分数。在Malayalam-English，Tamil-English和Kannada-English数据集中排名第1，第2和第3个，在相应的任务中排名第1。

著录项

来源
《Workshop on Speech and Language Technologies for Dravidian Languages》|2021年|330-335|共6页
会议地点
作者
Kushal Kedia; Abhilash Nandy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Comparison of Different Orthographies for Machine Translation of Under-Resourced Dravidian Languages [J] . Bharathi Raja Chakravarthi, Mihael Arcan, John P. McCrae OASIcs : OpenAccess Series in Informatics . 2019,第1期

机译：资源不足的德拉维语机器翻译中不同拼字法的比较
2. Automatic continuous speech recogniser for Dravidian languages using the auto associative neural network [J] . J. Sangeetha, S. Jothilakshmi International journal of computational vision and robotics . 2016,第1a2期

机译：使用自动联想神经网络的Dravidian语言自动连续语音识别器
3. Parts of Speech Taggers for Dravidian Languages [J] . Anjali M K, BabuAnto P International Journal of Engineering Trends and Technology . 2015,第7期

机译：德拉威语的语音标注器
4. IRNLP_DAIICT@DravidianLangTech-EACL2021: Offensive Language identification in Dravidian Languages using TF-IDF Char N-grams and MuRIL [C] . Bhargav Dave, Shripad Bhat, Prasenjit Majumder Workshop on Speech and Language Technologies for Dravidian Languages . 2021

机译：Irnlp_daiict @dravidianlangtech-eacl2021：使用TF-IDF Char N-Grams和Muril的Dravidian语言中的攻击性语言识别
5. Logic, formal languages, and formal language identification. Some logical properties of the languages in the Chomsky hierarchy, and an interrogative model of formal language identification. [D] . Pylkko, Pauli Olavi. 1988

机译：逻辑，形式语言和形式语言标识。乔姆斯基层次结构中语言的某些逻辑属性，以及形式语言标识的疑问模型。
6. A Bayesian phylogenetic study of the Dravidian language family [O] . Vishnupriya Kolipakam, Fiona M. Jordan, Michael Dunn, 2018

机译：贝拉维语族的贝叶斯系统发育研究
7. On State-of-the-art of POS Tagger, ‘Sandhi’ Splitter, ‘Alankaar’ Finder and ‘Samaas’ Finder for Indo-Aryan and Dravidian Languages [O] . Hema Gaikwad, Jatinderkumar R. 2021

机译：关于POS标签，'Sandhi'Splitter，'Alankaar'Finder和'Samaas'Finder的indo-aryan和Dravidian语言

indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages

摘要

著录项

相似文献

相关主题

期刊订阅