首页> 外文期刊>Expert Systems with Application >Behind the cues: A benchmarking study for fake news detection
【24h】

Behind the cues: A benchmarking study for fake news detection

机译:线索背后:假新闻检测的基准研究

获取原文
获取原文并翻译 | 示例
           

摘要

Fake news has become a problem of great impact in our information driven society because of the continuous and intense fakesters content distribution. Information quality in news feeds is under questionable veracity calling for automated tools to detect fake news articles. Due to many faces of fakesters, creating such tool is a challenging problem. In this work, we propose a model for fake news detection using content based features and Machine Learning (ML) algorithms. To conclude in most accurate model we evaluate several feature sets proposed for deception detection and word embeddings as well. Moreover, we test the most popular ML classifiers and investigate the possible improvement reached under ensemble ML methods such as AdaBoost and Bagging. An extensive set of earlier data sources has been used for experimentation and evaluation of both feature sets and ML classifiers. Moreover, we introduce a new text corpus, the "UNBiased" (UNB) dataset, which integrates various news sources and fulfills several standards and rules to avoid biased results in classification task. Our experimental results show that the use of an enhanced linguistic feature set with word embeddings along with ensemble algorithms and Support Vector Machines (SVMs) is capable to classify fake news with high accuracy. (C) 2019 Elsevier Ltd. All rights reserved.
机译:由于伪造者内容的持续和密集分布,假新闻已成为对我们的信息驱动社会产生重大影响的问题。新闻提要中的信息质量准确性受到质疑,要求使用自动工具来检测假新闻。由于伪造者的面孔众多,因此创建这样的工具是一个具有挑战性的问题。在这项工作中,我们提出了一个基于内容的特征和机器学习(ML)算法的假新闻检测模型。总而言之,在最准确的模型中,我们评估了建议用于欺骗检测和词嵌入的几个特征集。此外,我们测试了最流行的ML分类器,并研究了诸如AdaBoost和Bagging之类的整体ML方法可能实现的改进。大量的早期数据源已用于特征集和ML分类器的实验和评估。此外,我们引入了一个新的文本语料库,即“ UNBiased ”(UNB)数据集,该数据集集成了各种新闻源并满足了一些标准和规则,以避免分类任务中的结果出现偏差。我们的实验结果表明,使用带有词嵌入功能的增强语言功能集以及集成算法和支持向量机(SVM)可以对伪造新闻进行高精度分类。 (C)2019 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号