首页> 外文期刊>Royal Society Open Science >Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition
【24h】

Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition

机译:数据可用性,可重用性和分析性再现性:评估强制性开放数据政策在期刊认知中的影响

获取原文
           

摘要

Access to data is a critical feature of an efficient, progressive and ultimately self-correcting scientific ecosystem. But the extent to which in-principle benefits of data sharing are realized in practice is unclear. Crucially, it is largely unknown whether published findings can be reproduced by repeating reported analyses upon shared data (‘analytic reproducibility’). To investigate this, we conducted an observational evaluation of a mandatory open data policy introduced at the journal Cognition . Interrupted time-series analyses indicated a substantial post-policy increase in data available statements (104/417, 25% pre-policy to 136/174, 78% post-policy), although not all data appeared reusable (23/104, 22% pre-policy to 85/136, 62%, post-policy). For 35 of the articles determined to have reusable data, we attempted to reproduce 1324 target values. Ultimately, 64 values could not be reproduced within a 10% margin of error. For 22 articles all target values were reproduced, but 11 of these required author assistance. For 13 articles at least one value could not be reproduced despite author assistance. Importantly, there were no clear indications that original conclusions were seriously impacted. Mandatory open data policies can increase the frequency and quality of data sharing. However, suboptimal data curation, unclear analysis specification and reporting errors can impede analytic reproducibility, undermining the utility of data sharing and the credibility of scientific findings.
机译:访问数据是有效,逐步和最终自我纠正的科学生态系统的关键特征。但是在实践中实现了数据共享原则共享的主要效益的程度尚不清楚。至关重要的是,它在很大程度上未知是否通过重复在共享数据('分析再现性')上报告的分析来再现公布的发现。为了调查这一点,我们对在期刊认知期间引入的强制性开放数据政策进行了观察评估。中断的时间序列分析表明,数据可用陈述的实质性政策促进课程(104/417,25%的政策预期至136/174,78%的政策后),尽管并非所有数据都会重复使用(23/104,22 %预先罚款至85/136,62%,政策后)。对于第35项确定具有可重复使用数据的文章,我们试图重现1324个目标值。最终,无法在误差范围内再现64个值。对于22篇文章,所有目标值都被转载,但其中11名所需的作者援助。对于13篇文章,尽管作者援助,但至少可以复制一个值。重要的是,没有明确的迹象表明,原始结论严重影响。强制性开放数据策略可以提高数据共享的频率和质量。然而,次优数据策委,不明确的分析规范和报告误差可以阻碍分析重现性,破坏了数据共享的效用和科学结果的可信度。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号