CiTIUS-COLE at SemEval-2019 Task 5: Combining Linguistic Features to Identify Hate Speech Against Immigrants and Women on Multilingual Tweets

机译：CiTIUS-COLE在SemEval-2019任务5：结合语言特征来识别针对多语言推文针对移民和妇女的仇恨言论

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article describes the strategy submitted by the CiTIUS-COLE team to SemEval 2019 Task 5, a task which consists of binary classification where the system predicts whether a tweet in English or in Spanish is hateful against women or immigrants or not. The proposed strategy relies on combining linguistic features to improve the classifier's performance. More precisely, the method combines textual and lexical features, embedding words with the bag of words in Term Frequency-Inverse Document Frequency (TF-IDF) representation. The system performance reaches about 81% F1 when it is applied to the training dataset. but its F1 drops to 36% on the official test dataset for the English and 64% for the Spanish language concerning the hate speech class.

机译：本文介绍了CiTIUS-COLE团队向SemEval 2019任务5提交的策略，该任务由二进制分类组成，系统会预测英语或西班牙语的推文是否讨厌女性或移民。所提出的策略依赖于结合语言特征来提高分类器的性能。更准确地说，该方法结合了文本和词汇特征，将单词与词包以术语频率-逆文档频率（TF-IDF）表示形式嵌入。当将其应用于训练数据集时，系统性能将达到约81％F1。但在针对仇恨言语类的英语官方测试数据集上，其F1下降至36％，在西班牙语语言测试中，其F1下降至64％。

著录项

来源
《Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies;International workshop on semantic evaluation》|2019年|387-390|共4页
会议地点 Minneapolis(US)
作者
Sattam Almatarneh; Pablo Gamallo; Francisco J. Ribadas Pena;
展开▼
作者单位

(CiTIUS) Universidade de Santiago de Compostela Spain University of Vigo Spain;

(CiTIUS) Universidade de Santiago de Compostela Spain;

Department of Computer Science University of Vigo Spain;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:42:18

相似文献

外文文献
中文文献
专利

1. Novel Approach for Generating Hybrid Features Set to Effectively Identify Hate Speech [J] . Inteligencia Artificial : Ibero-American Journal of Artificial Intelligence . 2020,第66期

机译：生成混合特征的新方法集合以有效识别仇恨语音
2. Identifying second language speech tasks and ability levels for successful nurse oral interaction with patients in a linguistic minority setting: an instrument development project. [J] . Isaacs T, Laurier MD, Turner CE, Health communication . 2011,第6期

机译：识别在少数语言环境中与患者成功进行护士口头互动的第二语言语音任务和能力水平：一项仪器开发项目。
3. Identifying Second Language Speech Tasks and Ability Levels for Successful Nurse Oral Interaction with Patients in a Linguistic Minority Setting: An Instrument Development Project [J] . Talia Isaacsa* Michel D. Laurierb Carolyn E. Turnerc Norman Segalowitzd Health Communication . 2011,第6期

机译：确定少数语言环境中成功进行护士与患者的口头互动的第二语言语音任务和能力水平：一项仪器开发项目
4. CiTIUS-COLE at SemEval-2019 Task 5: Combining Linguistic Features to Identify Hate Speech Against Immigrants and Women on Multilingual Tweets [C] . Sattam Almatarneh, Pablo Gamallo, Francisco J. Ribadas Pena Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2019

机译：CITIUS-COLE在Semeval-2019任务5：结合语言特征，以识别对多语言推文的移民和妇女的仇恨言论
5. Linguistic and Sociolinguistic Factors of Integration within a Multilingual Context: The case of immigrants in Montreal. [D] . Calinon, Anne-Sophie. 2009

机译：多语言环境下融合的语言和社会语言因素：蒙特利尔的移民案例。
6. Dramatic effects of speech task on motor and linguistic planning in severely dysfluent parkinsonian speech [O] . Diana Van Lancker Sidtis, Krista Cameron, John J. Sidtis -1

机译：言语任务对运动与语言规划的戏剧效果严重困扰帕金森汉语演讲
7. SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter [O] . Valerio Basile, Cristina Bosco, Elisabetta Fersini, 2019

机译：Semeval-2019任务5：对Twitter中移民和女性的仇恨言论的多语言检测

CiTIUS-COLE at SemEval-2019 Task 5: Combining Linguistic Features to Identify Hate Speech Against Immigrants and Women on Multilingual Tweets

摘要

著录项

相似文献

相关主题

期刊订阅