JUNLP@DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Langauges

机译：JNLP @ Dravidianlangtech-EACL 2021：Dravidian语言中的攻击性语言识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Offensive language identification has been an active area of research in natural language processing. With the emergence of multiple social media platforms offensive language identification has emerged as a need of the hour. Traditional offensive language identification models fail to deliver acceptable results as social media contents are largely in multilingual and are code-mixed in nature. This paper tries to resolve this problem by using IndicBERT and BERT architectures, to facilitate identification of offensive languages for Kannada-English, Malayalam-English, and Tamil-English code-mixed language pairs extracted from social media. The presented approach when evaluated on the test corpus provided precision, recall, and F1 score for language pair Kannada-English as 0.62, 0.71, and 0.66, respectively, for language pair Malayalam-English as 0.77, 0.43, and 0.53, respectively, and for Tamil-English as 0.71,0.74, and 0.72, respectively.

机译：令人反感的语言识别是自然语言处理中的活跃领域。随着多个社交媒体平台的出现，令人攻击的语言识别已经出现了一个小时。传统的攻击性语言识别模型未能提供可接受的结果，因为社交媒体内容主要是多语言，并且在自然中是代码混合的。本文试图通过使用Takebert和Bert架构来解决此问题，以便于从社交媒体提取kannada-English，Malayalam-English和Tamil-English-Code-Mand-Mancial-Commicy语言对的攻击性语言。在测试语料库中评估时，呈现的方法分别为kannada-braing为0.62,0.71和0.66，分别为0.62,0.71和0.66分别提供精确，召回和F1分数分别为0.77,0.43和0.53，以及对于泰米尔英语为0.71,0.74和0.72。

著录项

来源
《Workshop on Speech and Language Technologies for Dravidian Languages》|2021年|319-322|共4页
会议地点
作者
Avishek Garain; Atanu Mandal; Sudip Kumar Naskar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Comparison of Different Orthographies for Machine Translation of Under-Resourced Dravidian Languages [J] . Bharathi Raja Chakravarthi, Mihael Arcan, John P. McCrae OASIcs : OpenAccess Series in Informatics . 2019,第1期

机译：资源不足的德拉维语机器翻译中不同拼字法的比较
2. Automatic continuous speech recogniser for Dravidian languages using the auto associative neural network [J] . J. Sangeetha, S. Jothilakshmi International journal of computational vision and robotics . 2016,第1a2期

机译：使用自动联想神经网络的Dravidian语言自动连续语音识别器
3. Parts of Speech Taggers for Dravidian Languages [J] . Anjali M K, BabuAnto P International Journal of Engineering Trends and Technology . 2015,第7期

机译：德拉威语的语音标注器
4. IRNLP_DAIICT@DravidianLangTech-EACL2021: Offensive Language identification in Dravidian Languages using TF-IDF Char N-grams and MuRIL [C] . Bhargav Dave, Shripad Bhat, Prasenjit Majumder Workshop on Speech and Language Technologies for Dravidian Languages . 2021

机译：Irnlp_daiict @dravidianlangtech-eacl2021：使用TF-IDF Char N-Grams和Muril的Dravidian语言中的攻击性语言识别
5. Logic, formal languages, and formal language identification. Some logical properties of the languages in the Chomsky hierarchy, and an interrogative model of formal language identification. [D] . Pylkko, Pauli Olavi. 1988

机译：逻辑，形式语言和形式语言标识。乔姆斯基层次结构中语言的某些逻辑属性，以及形式语言标识的疑问模型。
6. A Bayesian phylogenetic study of the Dravidian language family [O] . Vishnupriya Kolipakam, Fiona M. Jordan, Michael Dunn, 2018

机译：贝拉维语族的贝叶斯系统发育研究
7. Multilingual Offensive Language Identification with Cross-lingual Embeddings [O] . Tharindu Ranasinghe, Marcos Zampieri 2020

机译：跨舌嵌入的多语言攻击语言识别

JUNLP@DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Langauges

摘要

著录项

相似文献

相关主题

期刊订阅