Detecting Offensive Tweets in Hindi-English Code-Switched Language

机译：在印度英语代码交换语言中检测到令人反感的推文

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The exponential rise of social media websites like Twitter, Facebook and Reddit in linguistically diverse geographical regions has led to hybridization of popular native languages with English in an effort to ease communication. The paper focuses on the classification of offensive tweets written in Hinglish language, which is a portmanteau of the Indic language Hindi with the Roman script. The paper introduces a novel tweet dataset, titled Hindi-English Offensive Tweet (HEOT) dataset, consisting of tweets in Hindi-English code switched language split into three classes: non-offensive, abusive and hate-speech. Further, we approach the problem of classification of the tweets in HEOT dataset using transfer learning wherein the proposed model employing Convolutional Neural Networks is pre-trained on tweets in English followed by retraining on Hinglish tweets.

机译：在语言上不同地理区域的Twitter，Facebook和Reddit等社交媒体网站的指数升高导致了与英语流行的母语杂交，以简化沟通。本文重点介绍以HINGLISH语言编写的进攻推文的分类，这是与罗马剧本的印度语印地的Portmanteau。本文介绍了一个新的Tweet DataSet，标题为Hindi-Englished Freepore Tweet（Heot）DataSet，由印地文 - 英语代码交换语言分为三个类：非冒犯，滥用和仇恨语音。此外，我们使用转移学习方法在Heot DataSet中分类推文的分类问题，其中采用卷积神经网络的提出模型在英语推文上预先培训，然后在HINGLISH推文上进行再培训。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|xiii 85 p.|共9页
会议地点
作者
Puneet Mathur; Rajiv Ratn Shah; Ramit Sawhney; Debanjan Mahata;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Evaluating Machine Learning Techniques for Detecting Offensive and Hate Speech in South African Tweets [J] . Oriola Oluwafemi, Kotze Eduan Quality Control, Transactions . 2020,第期

机译：评估机器学习技术，用于检测南非推文中的冒犯和仇恨言论
2. Detecting Misogyny and Xenophobia in Spanish Tweets Using Language Technologies [J] . FLOR-MIRIAM PLAZA-DEL-ARCO, M. DOLORES MOLINA-GONZALEZ, L. ALFONSO URENA-LOPEZ, ACM Transactions on Internet Technology . 2020,第2期

机译：使用语言技术检测西班牙推文中的Misogyny和仇外心理
3. Detecting malicious tweets in trending topics using a statistical analysis of language [J] . Juan Martinez-Romo, Lourdes Araujo Expert Systems with Application . 2013,第8期

机译：使用语言统计分析检测趋势主题中的恶意推文
4. Detecting Offensive Tweets in Hindi-English Code-Switched Language [C] . Puneet Mathur, Rajiv Ratn Shah, Ramit Sawhney, Sixth international workshop on natural language processing for social media . 2018

机译：用印地语-英语代码转换语言检测令人反感的推文
5. Detecting Offensive Social Media Text in Nepali Language [D] . ?Timilsina, Sandesh 2020

机译：进攻检测社会化媒体中的文本尼泊尔语
6. Examining the relationship between comprehension and production processes in code-switched language [O] . Rosa E. Guzzardo Tamargo, Jorge R. Valdés Kroff, Paola E. Dussias -1

机译：用代码转换语言检查理解和生产过程之间的关系
7. Automatic Language Identification in Code-Switched Hindi-English Social Media Text [O] . Li Nguyen, Christopher Bryant, Sana Kidwai, 2021

机译：代码切换后印度英语社交媒体文本中的自动语言识别
8. Detecting Malicious Tweets in Twitter Using Runtime Monitoring With Hidden Information. [R] . Yilmaz, A. 2016

机译：使用隐藏信息的运行时监控在Twitter中检测恶意推文。

Detecting Offensive Tweets in Hindi-English Code-Switched Language

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅