首页> 外文期刊>BMC Medical Research Methodology >Facilitating harmonized data quality assessments. A data quality framework for observational health research data collections with software implementations in R
【24h】

Facilitating harmonized data quality assessments. A data quality framework for observational health research data collections with software implementations in R

机译:促进统一的数据质量评估。 具有r的软件实现的观测健康研究数据收集的数据质量框架

获取原文
       

摘要

No standards exist for the handling and reporting of data quality in health research. This work introduces a data quality framework for observational health research data collections with supporting software implementations to facilitate harmonized data quality assessments. Developments were guided by the evaluation of an existing data quality framework and literature reviews. Functions for the computation of data quality indicators were written in R. The concept and implementations are illustrated based on data from the population-based Study of Health in Pomerania (SHIP). The data quality framework comprises 34 data quality indicators. These target four aspects of data quality: compliance with pre-specified structural and technical requirements (integrity); presence of data values (completeness); inadmissible or uncertain data values and contradictions (consistency); unexpected distributions and associations (accuracy). R functions calculate data quality metrics based on the provided study data and metadata and R Markdown reports are generated. Guidance on the concept and tools is available through a dedicated website. The presented data quality framework is the first of its kind for observational health research data collections that links a formal concept to implementations in R. The framework and tools facilitate harmonized data quality assessments in pursue of transparent and reproducible research. Application scenarios comprise data quality monitoring while a study is carried out as well as performing an initial data analysis before starting substantive scientific analyses but the developments are also of relevance beyond research.
机译:卫生研究中数据质量的处理和报告没有任何标准。这项工作引入了用于观察健康研究数据收集的数据质量框架,支持软件实现,以促进统一的数据质量评估。通过评估现有数据质量框架和文学评论的评估指导的发展。数据质量指标计算的功能是以R编写的。概念和实施是基于来自Pomerania(船舶)的人口健康研究的数据。数据质量框架包括34个数据质量指示符。这些目标的数据质量四个方面:遵守预先规定的结构和技术要求(完整性);数据值(完整性);不可受理或不确定的数据值和矛盾(一致性);意外的分布和关联(准确性)。 R功能根据提供的研究数据和元数据计算数据质量指标,并生成RACKDOWS报告。有关概念和工具的指导可通过专用网站获得。呈现的数据质量框架是它的第一个观察健康研究数据收集,这些研究数据收集将正式概念与R的实施联系起来。框架和工具促进了透明和可重复研究的统一数据质量评估。应用场景包括数据质量监测,同时进行研究以及在开始实质性科学分析之前进行初始数据分析,但发展也与研究相比也是相关性的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号