首页> 外文会议>IEEE International Conference on Semantic Computing >Classifying Fake News Articles Using Natural Language Processing to Identify In-Article Attribution as a Supervised Learning Estimator
【24h】

Classifying Fake News Articles Using Natural Language Processing to Identify In-Article Attribution as a Supervised Learning Estimator

机译:使用自然语言处理对虚假新闻文章进行分类,以将文章内归因识别为监督学习估计者

获取原文

摘要

Intentionally deceptive content presented under the guise of legitimate journalism is a worldwide information accuracy and integrity problem that affects opinion forming, decision making, and voting patterns. Most so-called `fake news' is initially distributed over social media conduits like Facebook and Twitter and later finds its way onto mainstream media platforms such as traditional television and radio news. The fake news stories that are initially seeded over social media platforms share key linguistic characteristics such as making excessive use of unsubstantiated hyperbole and non-attributed quoted content. In this paper, the results of a fake news identification study that documents the performance of a fake news classifier are presented. The Textblob, Natural Language, and SciPy Toolkits were used to develop a novel fake news detector that uses quoted attribution in a Bayesian machine learning system as a key feature to estimate the likelihood that a news article is fake. The resultant process precision is 63.333% effective at assessing the likelihood that an article with quotes is fake. This process is called influence mining and this novel technique is presented as a method that can be used to enable fake news and even propaganda detection. In this paper, the research process, technical analysis, technical linguistics work, and classifier performance and results are presented. The paper concludes with a discussion of how the current system will evolve into an influence mining system.
机译:以合法新闻为幌子提供的故意欺骗性内容是一个全球性的信息准确性和完整性问题,会影响意见的形成,决策和投票方式。多数所谓的“假新闻”最初是通过社交媒体渠道(如Facebook和Twitter)分发的,后来又进入了传统电视和广播新闻等主流媒体平台。最初在社交媒体平台上播撒的虚假新闻故事具有关键的语言特征,例如过度使用未经证实的夸张和未经引用的引用内容。本文提出了伪造新闻识别研究的结果,该研究记录了伪造新闻分类器的性能。 Textblob,Natural Language和SciPy工具包用于开发新颖的假新闻检测器,该检测器使用贝叶斯机器学习系统中的引用属性作为主要功能来估计新闻是假的可能性。在评估带有引号的文章是假的可能性时,所得的处理精度为63.333%。此过程称为影响力挖掘,此新技术作为一种可用于启用假新闻甚至宣传检测的方法而提出。本文介绍了研究过程,技术分析,技术语言学工作以及分类器的性能和结果。本文最后讨论了当前系统将如何演变成影响挖掘系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号