首页> 外文会议>International Conference on Emerging eLearning Technologies and Applications >Annotated dataset for the fake news classification in Slovak language
【24h】

Annotated dataset for the fake news classification in Slovak language

机译:斯洛伐克语中假新闻分类的注释数据集

获取原文

摘要

Fake news detection currently presents an active field of research. Detection methods based on natural language processing and machine learning are being developed to automatically identify the possible misinformation contained within the news articles. To successfully train these models, annotated data are needed. In English language, multiple human-annotated datasets already are available and are being widely used in the research. The main objective of the work presented in this paper, was to create similar dataset consisting of articles in Slovak language. We collected the data from the various local news portals including reputable publishers as well as suspicious conspiratory portals. To obtain the annotations, we used crowdsourcing approach. Annotated dataset was used in preliminary experiments, in which neural network classifier was trained and evaluated.
机译:假新闻检测目前提出了一个活跃的研究领域。正在开发基于自然语言处理和机器学习的检测方法以自动识别新闻文章中包含的可能的错误信息。要成功培训这些模型,需要注释数据。在英语中,已经提供了多个人类注释的数据集,并广泛用于研究。本文提出的工作的主要目标是创建类似的数据集,由斯洛伐克语中的文章组成。我们从各种本地新闻门户网站中收集了数据,包括信誉良好的出版商以及可疑的引入门户网站。为了获得注释,我们使用了众包方法。注释数据集用于初步实验,其中培训和评估神经网络分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号