首页> 外文会议>Multidisciplinary International Symposium on Disinformation in Open Online Media >Checkworthiness in Automatic Claim Detection Models: Definitions and Analysis of Datasets
【24h】

Checkworthiness in Automatic Claim Detection Models: Definitions and Analysis of Datasets

机译:自动索引检测模型中的核实性:数据集的定义和分析

获取原文

摘要

Public, professional and academic interest in automated fact-checking has drastically increased over the past decade, with many aiming to automate one of the first steps in a fact-check procedure: the selection of so-called checkworthy claims. However, there is little agreement on the definition and characteristics of checkworthiness among fact-checkers, which is consequently reflected in the datasets used for training and testing checkworthy claim detection models. After elaborate analysis of checkworthy claim selection procedures in fact-check organisations and analysis of state-of-the-art claim detection datasets, checkworthiness is defined as the concept of having a spatiotemporal and context-dependent worth and need to have the correctness of the objectivity it conveys verified. This is irrespective of the claim's perceived veracity judgement by an individual based on prior knowledge and beliefs. Concerning the characteristics of current datasets, it is argued that the data is not only highly imbalanced and noisy, but also too limited in scope and language. Furthermore, we believe that the subjective concept of checkworthiness might not be a suitable filter for claim detection.
机译:在过去的十年中,自动化事实检查的公众,专业和学术兴趣急剧增加,许多旨在自动化事实检查程序中的第一步之一:所谓的核实要求的选择。然而,对实况检查者之间的核实性的定义和特征几乎没有一致,从而反映在用于训练和测试核对要求检测模型的数据集中。在详细分析核查索赔选择程序后,检查组织和分析最先进的索赔检测数据集,核对性被定义为具有时空和上下文相关价值的概念,并且需要具有正确性它传达的客观性验证。根据先前的知识和信仰,无论索赔的索赔的可耻性判断如何关于当前数据集的特征,有人认为数据不仅高度不平衡和嘈杂,而且在范围和语言中也过于有限。此外,我们认为,核对的主观概念可能不是用于索取检测的合适过滤器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号