Deep contextualized text representation and learning for fake news detection

Mohammadreza Samadi; Maryam Mousavian; Saeedeh Momtazi

首页> 外文期刊>Information Processing & Management >Deep contextualized text representation and learning for fake news detection

【24h】

Deep contextualized text representation and learning for fake news detection

机译：虚拟新闻检测的深层语境化文本表示和学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, due to the widespread use of social media and broadcasting agencies around the world, people are extremely exposed to being affected by false information and fake news, all of which have negative impacts on both collective thoughts and governments' policies. In recent years, the great success of pre-trained models for embedding contextual information from texts motivates researchers to utilize these embeddings in different natural language processing tasks. However, in a complex task like fake news detection, it is not determined which contextualized embedding can assist the classifier with more valuable features. Due to the lack of a comparative study about utilizing different contextualized pre-trained models besides distinct neural classifiers, we aim to dive into a comparative study about using different classifiers and embedding models. In this paper, we propose three classifiers with different pre-trained models for embedding input news articles. We connect Single-Layer Perceptron (SLP), Multi-Layer Perceptron (MLP), and Convolutional Neural Network (CNN) after the embedding layer which consists of novel pre-trained models such as BERT, RoBERTa, GPT2, and Funnel Transformer in order to benefit from deep contextualized representation provided by those models as well as deep neural classifications. We evaluate our proposed models on three well-known fake news datasets: LIAR (Wang, 2017), ISOT (Ahmed et al., 2017), and COVID-19 Patwa et al. (2020). The results on these three datasets show the superiority of our proposed models for fake news detection compared to the state-of-the-art models. The results show 7% and 0.1% improvements in classification accuracy compared to the proposed model by Goldani et al. (2021) on LIAR and ISOT, respectively. We also achieved 1% improvement compared to the proposed model by Shifath et al. (2021) on the COVID-19 dataset.

机译：近年来，由于世界各地的社交媒体和广播机构广泛使用，人们非常接触受虚假信息和假新闻的影响，所有这些都对集体思想和政府的政策产生负面影响。近年来，从文本中嵌入上下文信息的预先训练模型的巨大成功激励了研究人员利用不同的自然语言处理任务中的这些嵌入。然而，在像假新闻检测等复杂任务中，尚不确定哪个上下文化嵌入可以帮助分类器具有更有价值的功能。由于除了不同的神经分类机之外，缺乏关于利用不同的上下文化预先训练模型的比较研究，我们的目的旨在潜入关于使用不同分类器和嵌入模型的比较研究。在本文中，我们提出了三个分类器，具有不同的预先训练模型，用于嵌入输入新闻文章。在嵌入层之后，将单层Perceptron（SLP），多层Perceptron（MLP）和卷积神经网络（CNN）连接，该嵌入层由伯特，罗伯塔，GPT2和漏斗变压器等新型预训练型号组成从这些模型提供的深层语境化表示中受益以及深度神经分类。我们在三个着名的假新闻数据集中评估我们提出的模型：骗子（王，2017），Isot（Ahmed等，2017）和Covid-19 Patwa等人。（2020）。与最先进的模型相比，这三个数据集的结果显示了我们所提出的假新闻检测模型的优势。与Goldani等人的建议的型号相比，结果显示了7％和0.1％的分类准确性提高。（2021）分别在骗子和Isot。与Shifath等人的建议模型相比，我们还实现了1％的改进。（2021）在Covid-19数据集上。

著录项

来源
《Information Processing & Management》 |2021年第6期|102723.1-102723.13|共13页
作者
Mohammadreza Samadi; Maryam Mousavian; Saeedeh Momtazi;
展开▼
作者单位

Computer Engineering Department Amirkabir University of Technology Tehran Iran;

Computer Engineering Department Amirkabir University of Technology Tehran Iran;

Computer Engineering Department Amirkabir University of Technology Tehran Iran;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Fake news detection; Deep neural network; Contextualized text representation;

机译：假新闻检测;深神经网络;上下文化文本表示;
入库时间 2022-08-19 03:06:55

相似文献

外文文献
中文文献
专利

1. Combining Similarity Features and Deep Representation Learning for Stance Detection in the Context of Checking Fake News [J] . LUIS BORGES, BRUNO MARTINS, PAVEL CALADO ACM journal of data and information quality . 2019,第3期

机译：在检查假新闻背景下的相似特征和深度表示学习的立场检测
2. Fake Detect: A Deep Learning Ensemble Model for Fake News Detection [J] . Nida Aslam, Irfan Ullah Khan, Farah Salem Alotaibi, Complexity . 2021,第a期

机译：假检测：假新闻检测的深度学习集合模型
3. SemSeq4FD: Integrating global semantic relationship and local sequential order to enhance text representation for fake news detection [J] . Wang Yuhang, Wang Li, Yang Yanjie, Expert systems with applications . 2021,第Mara期

机译：SEMSEQ4FD：整合全局语义关系和本地顺序，以增强假新闻检测的文本表示
4. Learning Contextual Features with Multi-head Self-attention for Fake News Detection [C] . Yangqian Wang, Hao Han, Ye Ding, International conference on cognitive computing;Services conference federation . 2019

机译：通过多头自我注意学习上下文特征以进行伪新闻检测
5. Machine Learning and Semantic Knowledge Assisted Fake News Detection Models [D] . Sabeeh, Vian Talal. 2020

机译：机器学习和语义知识辅助假新闻检测模型
6. Deep-Learning-Based Detection of Infants with Autism Spectrum Disorder Using Auto-Encoder Feature Representation [O] . Jung Hyuk Lee, Geon Woo Lee, Guiyoung Bong, 2020

机译：基于深度学习的自动编码器特征表示自闭症谱系疾病的婴儿检测
7. USING DEEP LEARNING AND LINGUISTIC ANALYSIS TO PREDICT FAKE NEWS WITHIN TEXT [O] . John Nguyen -1

机译：利用深度学习和语言分析预测文本内的假新闻

Deep contextualized text representation and learning for fake news detection

摘要

著录项

相似文献

相关主题

期刊订阅