Fake News Detection with the New German Dataset 'GermanFakeNC'

机译：用新的德国DataSet'demberFakenc'假新闻检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The spread of misleading information and "alternative facts" on the internet gained in the last decade considerable importance worldwide. In recent years, several attempts have been made to counteract fake news based on automatic classification via machine; learning models. These, however, require labeled data. The scarcity of available corpora for predictive modeling is a major stumbling block in this field, especially in other languages than English. Our contribution is twofold. First, we introduce a new publicly available German dataset "German Fake News Corpus" (GermanFakeNC) for the task of fake news detection which consists of 490 manually fact-checked articles. Every false statement in the text is verified claim-by-claim by authoritative sources. Our ground truth for trustworthy news consists of 4,500 news articles from well-known mainstream news publishers. With regard to the second contribution, we choose a Convolutional Neural Network (CNN) (k = 0.89) and the widely used SVM (k = 0.72) technique to detect fake news. Thus we hope that our approach will stimulate the progress in fake news detection and claim verification across languages.

机译：误导信息和“替代事实”在互联网上的差异在过去十年中获得了全球的重要性。近年来，已经采取了几次尝试根据通过机器自动分类抵消假新闻;学习模型。但是，这些需要标记数据。可测量的预测建模的稀缺性是该领域的主要绊脚石，尤其是其他语言而不是英语。我们的贡献是双重的。首先，我们为假新闻检测的任务介绍了一个新的公开可用的德国数据集“德国假新闻语料库”（GermanFakenc），该任务由490个手动检查的文章组成。案文中的每一个虚假陈述都由权威来源索赔索赔。我们为值得信赖的新闻的基础事实由知名主流新闻出版商的4,500条新闻文章组成。关于第二种贡献，我们选择卷积神经网络（CNN）（K = 0.89）和广泛使用的SVM（k = 0.72）技术来检测假新闻。因此，我们希望我们的方法能够刺激假新闻检测和跨语言的索赔核查的进展。

著录项

来源
《International conference on theory and practice of digital libraries》|2019年|xv 422 p.|共8页
会议地点
作者
Inna Vogel; Peter Jiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电子图书馆、数字图书馆;
关键词
Fake news detection; Fake statement; Deception detection; German corpus;

机译：假新闻检测;假声明;欺骗检测;德国语料库;
入库时间 2022-08-20 20:19:25

相似文献

外文文献
中文文献
专利

1. Fake news detection: a survey of evaluation datasets [J] . Arianna D’Ulizia, Maria Chiara Caschera, Fernando Ferri, PeerJ Computer Science . 2021,第a期

机译：假新闻检测：评估数据集的调查
2. AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News & Hate Speech Detection Dataset [J] . Mohamed Seghir Hadj Ameur, Hassina Aliane Procedia Computer Science . 2021,第a期

机译：Aracovid19-MFH：阿拉伯Covid-19多标签假新闻＆amp; 讨厌语音检测数据集
3. "Bend the truth": Benchmark dataset for fake news detection in Urdu language and its evaluation [J] . Amjad Maaz, Sidorov Grigori, Zhila Alisa, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第2Pta2期

机译：“弯曲真相”：乌尔都语语言中假新闻检测的基准数据集及其评估
4. Fake News Detection with the New German Dataset 'GermanFakeNC' [C] . Inna Vogel, Peter Jiang International conference on theory and practice of digital libraries . 2019

机译：使用新的德国数据集“ GermanFakeNC”进行假新闻检测
5. Hierarchical Attention Networks for Fake News Detection [D] . Jeong, Haeseung. 2021

机译：用于假新闻检测的分层关注网络
6. Fake news detection: a survey of evaluation datasets [O] . Arianna D’Ulizia, Maria Chiara Caschera, Fernando Ferri, 2021

机译：假新闻检测：评估数据集的调查
7. "Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection [O] . Wang, William Yang 2017

机译：“Liar，Liar pants on Fire”：假新闻的新基准数据集发现

Fake News Detection with the New German Dataset 'GermanFakeNC'

摘要

著录项

相似文献

相关主题

期刊订阅