首页> 外文会议>International conference on theory and practice of digital libraries >Fake News Detection with the New German Dataset 'GermanFakeNC'
【24h】

Fake News Detection with the New German Dataset 'GermanFakeNC'

机译:用新的德国DataSet'demberFakenc'假新闻检测

获取原文

摘要

The spread of misleading information and "alternative facts" on the internet gained in the last decade considerable importance worldwide. In recent years, several attempts have been made to counteract fake news based on automatic classification via machine; learning models. These, however, require labeled data. The scarcity of available corpora for predictive modeling is a major stumbling block in this field, especially in other languages than English. Our contribution is twofold. First, we introduce a new publicly available German dataset "German Fake News Corpus" (GermanFakeNC) for the task of fake news detection which consists of 490 manually fact-checked articles. Every false statement in the text is verified claim-by-claim by authoritative sources. Our ground truth for trustworthy news consists of 4,500 news articles from well-known mainstream news publishers. With regard to the second contribution, we choose a Convolutional Neural Network (CNN) (k = 0.89) and the widely used SVM (k = 0.72) technique to detect fake news. Thus we hope that our approach will stimulate the progress in fake news detection and claim verification across languages.
机译:误导信息和“替代事实”在互联网上的差异在过去十年中获得了全球的重要性。近年来,已经采取了几次尝试根据通过机器自动分类抵消假新闻;学习模型。但是,这些需要标记数据。可测量的预测建模的稀缺性是该领域的主要绊脚石,尤其是其他语言而不是英语。我们的贡献是双重的。首先,我们为假新闻检测的任务介绍了一个新的公开可用的德国数据集“德国假新闻语料库”(GermanFakenc),该任务由490个手动检查的文章组成。案文中的每一个虚假陈述都由权威来源索赔索赔。我们为值得信赖的新闻的基础事实由知名主流新闻出版商的4,500条新闻文章组成。关于第二种贡献,我们选择卷积神经网络(CNN)(K = 0.89)和广泛使用的SVM(k = 0.72)技术来检测假新闻。因此,我们希望我们的方法能够刺激假新闻检测和跨语言的索赔核查的进展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号