首页> 外文期刊>Information Processing & Management >Proppy: Organizing the news based on their propagandistic content
【24h】

Proppy: Organizing the news based on their propagandistic content

机译:宣传:根据宣传内容整理新闻

获取原文
获取原文并翻译 | 示例

摘要

Propaganda is a mechanism to influence public opinion, which is inherently present in extremely biased and fake news. Here, we propose a model to automatically assess the level of propagandistic content in an article based on different representations, from writing style and readability level to the presence of certain keywords. We experiment thoroughly with different variations of such a model on a new publicly available corpus, and we show that character n-grams and other style features outperform existing alternatives to identify propaganda based on word n-grams. Unlike previous work, we make sure that the test data comes from news sources that were unseen on training, thus penalizing learning algorithms that model the news sources used at training time as opposed to solving the actual task. We integrate our supervised model in a public website, which organizes recent articles covering the same event on the basis of their propagandistic contents. This allows users to quickly explore different perspectives of the same story, and it also enables investigative journalists to dig further into how different media use stories and propaganda to pursue their agenda.
机译:宣传是一种影响公众舆论的机制,这种机制固有地存在于极度偏颇和虚假的新闻中。在这里,我们提出了一个模型,该模型可以根据不同的表示形式自动评估文章中宣传内容的水平,从写作风格和可读性水平到某些关键字的存在。我们在新的公开语料库上对这种模型的不同变体进行了充分的实验,结果表明,字符n-gram和其他样式功能优于现有的替代方法,可以基于单词n-gram识别宣传。与以前的工作不同,我们确保测试数据来自培训中看不到的新闻源,从而不利于对训练时使用的新闻源建模的学习算法,而不是解决实际任务。我们将监督模式整合到一个公共网站中,该网站根据其宣传内容组织涵盖同一事件的近期文章。这样一来,用户可以快速探索同一故事的不同视角,还使调查记者能够进一步挖掘不同媒体如何利用故事和宣传来追求自己的议程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号