首页> 外文会议>International Conference on speech and computer >Evaluating Novel Features for Aggressive Language Detection
【24h】

Evaluating Novel Features for Aggressive Language Detection

机译:评估侵略性语言检测的新颖功能

获取原文

摘要

The widespread use and abuse of social media and other platforms to voice opinions online has necessitated the development of tools to regulate this exchange of opinions in light of ethical and legal considerations. In this work, we aim to detect patterns of aggressive language to gain insight into what differentiates it from non-inflammatory language. Of particular interest are features of comments that, taken together, allow this distinction to be made automatically. To that end, we employ feature selection techniques to find optimal feature subsets. We apply the feature selection and model evaluation process to two independent datasets. Depending on the dataset and model type, between 3 and 19 features are enough to outperform the full set of 68 features. Overall, the best F_1 scores per dataset are 89.4%, using 35 features with a Gaussian SVM and 82.7%, using 17 features with a linear SVM.
机译:社交媒体和其他平台广泛使用和滥用在线发表意见,因此需要开发工具来根据道德和法律考量来规范意见交流。在这项工作中,我们旨在检测攻击性语言的模式,以洞悉其与非炎症性语言的区别。特别引起关注的是评论的功能,这些功能加在一起可以自动进行区分。为此,我们采用特征选择技术来找到最佳特征子集。我们将特征选择和模型评估过程应用于两个独立的数据集。根据数据集和模型类型,3个到19个特​​征就足以胜过全部68个特征。总体而言,使用高斯SVM的35个特征和使用线性SVM的17个特征时,每个数据集的最佳F_1得分是89.4%,而使用数据集的17个特征则是82.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号