Propaganda Identification Using Topic Modelling

Yakunin Kirill; Ionescu George Mihail; Murzakhmetov Sanzhar; Mussabayev Rustam; Filatova Olga; Mukhamediev Ravil

首页> 外文期刊>Procedia Computer Science >Propaganda Identification Using Topic Modelling

【24h】

Propaganda Identification Using Topic Modelling

机译：使用主题建模的宣传识别

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a method based on topic modelling for identifying texts with propagandistic content. The method is an attempt to incorporate transfer learning idea of obtaining effective vector representation from a large unlabeled or (semi-) automatically labelled dataset, while also attempting to minimize the amount of necessary manual expert labelling by introducing high level labelling (either manual or automatic) on some explicit document property. The proposed method includes four key stages: formation of corpus partitioning, computing a topic model of a united corpus, calculation of corpora imbalance estimates of each topic; extrapolating the results of the imbalance estimation on all documents. The method was cross-validated on a labelled subsample of 1000 news, and achieves high predictive power – ROC AUC 0.73.

机译：本文介绍了一种基于主题建模的方法，用于识别具有宣传内容的文本。该方法是一种尝试结合从大型未标记的或（半）自动标记的数据集获得有效矢量表示的传递学习理念，同时还试图通过引入高级标签（手动或自动）最小化必要的手动专家标签的数量）在一些明确的文件属性上。该方法包括四个关键阶段：组成语料库分区，计算联合组的主题模型，计算每个主题的基层不平衡估计;推断所有文件的不平衡估计结果。该方法在1000个新闻的标记子相位上交叉验证，实现了高预测功率 - ROC AUC 0.73。

著录项

来源
《Procedia Computer Science 》 |2020年第5期| 共8页
作者
Yakunin Kirill; Ionescu George Mihail; Murzakhmetov Sanzhar; Mussabayev Rustam; Filatova Olga; Mukhamediev Ravil;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
propagandanatural language processingtopic modellingtext classificationmass media analysis;

机译：宣传语言正在处理特性模式ModellingText分类媒体分析;

相似文献

外文文献
中文文献
专利

1. Identification of Topics from Scientific Papers through Topic Modeling [J] . Denis Luiz Marcello Owa Open Journal of Applied Sciences . 2021 ,第4期

机译：通过主题建模识别科学论文的主题
2. Identification of Topics from Scientific Papers through Topic Modeling [J] . Denis Luiz Marcello Owa 应用科学（英文） . 2021 ,第004期

机译：通过主题建模识别科学论文的主题
3. Topics and emotions in Russian Twitter propaganda [J] . Daniel Taninecz Miller First Monday . 2019 ,第5期

机译：俄语Twitter宣传中的主题和情感
4. A Model of Propaganda Battle with Individuals’ Opinions on Topics Saliency [C] . Olga Proncheva International Conference on Management of large-scale system development . 2020

机译：个人关于话题显着性的宣传战模型
5. Topics in identification and inference in duration models. [D] . Szydlowski, Arkadiusz Marcin. 2014

机译：持续时间模型中的识别和推断主题。
6. Discovering Health Topics in Social Media Using Topic Models [O] . Michael J. Paul, Mark Dredze -1

机译：使用主题模型在社交媒体中发现健康主题
7. Identification of Topics from Scientific Papers through Topic Modeling [O] . Denis Luiz Marcello Owa 2021

机译：通过主题建模识别科学论文的主题
8. Grey-Box Modelling and Identification Topics [R] . Franciscus, H. J. A. 1992

机译：灰盒建模与识别主题

Propaganda Identification Using Topic Modelling

摘要

著录项

相似文献

相关主题

期刊订阅