首页> 外文会议>Workshop on Language Technology for Equality, Diversity and Inclusion >ssn_diBERTsity@LT-EDI-EACL2021: Hope Speech Detection on multilingual YouTube comments via transformer based approach
【24h】

ssn_diBERTsity@LT-EDI-EACL2021: Hope Speech Detection on multilingual YouTube comments via transformer based approach

机译:SSN_DIBERTSITY @ LT-EDI-EACL2021:希望通过基于变压器的方法对多语种YouTube评论的言语检测

获取原文

摘要

In recent times, there exists an abundance of research to classify abusive and offensive texts focusing on negative comments but only minimal research using the positive reinforcement approach. The task was aimed at classifying texts into 'Hope_speech', 'Non_hope_speech', and 'Not in language'. The datasets were provided by the LT-EDI organisers in English, Tamil, and Malayalam languages with texts sourced from YouTube comments. We trained our data using transformer models, specifically mBERT for Tamil and Malayalam and BERT for English, and achieved weighted average F1-scores of 0.46, 0.81, 0.92 for Tamil, Malayalam, and English respectively. In the task results, our approach ranked second, fourth and fourteenth place in English, Malayalam and Tamil respectively.
机译:最近,存在丰富的研究,分类滥用和冒犯性的文本,专注于负面评论,而是使用积极的强化方法的最小研究。 该任务旨在将文本分类为“hope_speech”,'non_hope_speech'和'不是语言'。 数据集由LT-EDI组织者提供英文,泰米尔和马拉雅拉姆语言,与来自YouTube评论的文本。 我们使用变压器模型,特别是Mbert for Tamil和Malayalam以及英语伯特的培训,并分别获得了0.46,0.81,0.92的加权平均f1分别用于泰米尔,Malayalam和英文。 在任务结果中,我们的方法分别排名第二,第四,第四和第十四位,分别用英语,马来拉兰州和泰米尔排名第二。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号