首页> 外文会议>Workshop on Speech and Language Technologies for Dravidian Languages >JudithJeyafreedaAndrew@DravidianLangTech-EACL2021: Offensive language detection for Dravidian Code-mixed YouTube comments
【24h】

JudithJeyafreedaAndrew@DravidianLangTech-EACL2021: Offensive language detection for Dravidian Code-mixed YouTube comments

机译:judithjeyafreedaandrew @ dravidianlangtech-eacl2021:Dravidian代码混合YouTube评论的攻击性语言检测

获取原文

摘要

Messaging online has become one of the major ways of communication. At this level, there are cases of online/digital bullying. These include rants, taunts, and offensive phrases. Thus the identification of offensive language on the internet is a very essential task. In this paper, the task of offensive language detection on YouTube comments from the Dravidian languages of Tamil, Malayalam and Kannada are seen upon as a mutliclass classification problem. After being subjected to language specific pre-processing, several Machine Learning algorithms have been trained for the task at hand. The paper presents the accuracy results on the development datasets for all Machine Learning models that have been used and finally presents the weighted average scores for the test set when using the best performing Machine Learning model.
机译:Messaging Online已成为沟通的主要方式之一。 在这个级别,有在线/数字欺凌的情况。 这些包括咆哮,嘲讽和令人反感的短语。 因此,在互联网上识别令人反感的语言是一个非常重要的任务。 在本文中,作为泰米尔,马拉雅拉姆和克南人的Dravidian语言对YouTube评论的攻击性语言检测的任务被视为笨拙的分类问题。 经过语言特定的预处理后,已经为手头的任务培训了几种机器学习算法。 本文介绍了已使用的所有机器学习模型的开发数据集的准确性结果,最终显示使用最佳执行机器学习模型时为测试集的加权平均分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号