A Tool for Fake News Detection

机译：假新闻检测工具

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The word post-truth was considered by Oxford Dictionaries Word of the Year 2016. The word is an adjective relating to or denoting circumstances in which objective facts are less influential in shaping public opinion than appeals to emotion and personal belief. This leads to misinformation and problems in society. Hence, it is important to make effort to detect these facts and prevent them from spreading. In this paper we propose machine learning techniques, in particular supervised learning, for fake news detection. More precisely, we used a dataset of fake and real news to train a machine learning model using Scikit-learn library in Python. We extracted features from the dataset using text representation models like Bag-of-Words, Term Frequency-Inverse Document Frequency (TF-IDF) and Bi-gram frequency. We tested two classification approaches, namely probabilistic classification and linear classification on the title and the content, checking if it is clickbaitonclickbait, respectively fake/real. The outcome of our experiments was that the linear classification works the best with the TF-IDF model in the process of content classification. The Bi-gram frequency model gave the lowest accuracy for title classification in comparison with Bag-of-Words and TF-IDF.

机译：牛津词典授予“ 2016年度最佳词汇”一词。该词是一个形容词，与情况有关，或表示客观事实对公众舆论的影响小于对情感和个人信仰的吸引力。这导致了错误的信息和社会问题。因此，重要的是要努力发现这些事实并防止它们传播。在本文中，我们提出了用于伪造新闻检测的机器学习技术，特别是监督学习。更准确地说，我们使用了虚假和真实新闻的数据集，使用Python中的Scikit-learn库训练了机器学习模型。我们使用文本表示模型（例如单词袋，术语频率-逆文档频率（TF-IDF）和Bi-gram频率）从数据集中提取了特征。我们测试了两种分类方法，分别是标题和内容的概率分类和线性分类，分别检查是点击诱饵还是非点击诱饵，分别是假冒/真实。我们的实验结果是，在内容分类过程中，线性分类最适合TF-IDF模型。与单词袋和TF-IDF相比，Bi-gram频率模型为标题分类提供了最低的准确性。

著录项

来源
《International Symposium on Symbolic and Numeric Algorithms for Scientific Computing》|2018年|379-386|共8页
会议地点
作者
Bashar Al Asaad; Madalina Erascu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
content management; feature extraction; Internet; learning (artificial intelligence); pattern classification; probability; Python; text analysis;

机译：内容管理;特征提取;互联网;学习（人工智能）;模式分类;概率; Python;文本分析;

相似文献

外文文献
中文文献
专利

1. FakeNewsTracker: a tool for fake news collection, detection, and visualization [J] . Shu Kai, Mahudeswaran Deepak, Liu Huan Computational & Mathematical Organization Theory . 2019,第1期

机译：FakeNewsTracker：用于伪造新闻收集，检测和可视化的工具
2. Fake news detection tool (FNDT): Shield against sentimental deception [J] . Suri Bhawna, Taneja Shweta, Aggarwal Soumya, Journal of information and optimization sciences . 2020,第6期

机译：假新闻检测工具（FNDT）：防止感伤性欺骗
3. Fake Detect: A Deep Learning Ensemble Model for Fake News Detection [J] . Nida Aslam, Irfan Ullah Khan, Farah Salem Alotaibi, Complexity . 2021,第a期

机译：假检测：假新闻检测的深度学习集合模型
4. Evaluation of Tools and Extension for Fake News Detection [C] . Dilip Kumar Sharma, Sonal Garg, Priya Shrivastava International Conference on Innovative Practices in Technology and Management . 2021

机译：评估假新闻检测的工具和延伸
5. Hierarchical Attention Networks for Fake News Detection [D] . Jeong, Haeseung. 2021

机译：用于假新闻检测的分层关注网络
6. Determinants of individuals’ belief in fake news: A scoping review determinants of belief in fake news [O] . Kirill Bryanov, Victoria Vziatysheva, Stefano Triberti, 2021

机译：个人对假新闻信仰的决定因素：假新闻信仰的范围审查决定因素
7. Combating Fake News in “Low-Resource” Languages: Amharic Fake News Detection Accompanied by Resource Crafting [O] . Fantahun Gereme, William Zhu, Tewodros Ayall, 2021

机译：打击“低资源”语言的假新闻：Amharic假新闻检测伴随着资源制作

A Tool for Fake News Detection

摘要

著录项

相似文献

相关主题

期刊订阅