首页> 美国卫生研究院文献>BMC Medical Research Methodology >The utility of multivariate outlier detection techniques for data quality evaluation in large studies: an application within the ONDRI project
【2h】

The utility of multivariate outlier detection techniques for data quality evaluation in large studies: an application within the ONDRI project

机译:多变量离群值检测技术在大型研究中的数据质量评估:在ONDRI项目中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundLarge and complex studies are now routine, and quality assurance and quality control (QC) procedures ensure reliable results and conclusions. Standard procedures may comprise manual verification and double entry, but these labour-intensive methods often leave errors undetected. Outlier detection uses a data-driven approach to identify patterns exhibited by the majority of the data and highlights data points that deviate from these patterns. Univariate methods consider each variable independently, so observations that appear odd only when two or more variables are considered simultaneously remain undetected. We propose a data quality evaluation process that emphasizes the use of multivariate outlier detection for identifying errors, and show that univariate approaches alone are insufficient. Further, we establish an iterative process that uses multiple multivariate approaches, communication between teams, and visualization for other large-scale projects to follow.
机译:背景技术现在,大型和复杂的研究已成为常规,质量保证和质量控制(QC)程序可确保可靠的结果和结论。标准程序可能包括手动验证和重复输入,但是这些劳动密集型方法通常会使错误无法发现。离群值检测使用数据驱动的方法来识别大多数数据显示的模式,并突出显示与这些模式不同的数据点。单变量方法独立地考虑每个变量,因此仅当同时考虑两个或多个变量时才出现奇怪的观测值,但仍未被发现。我们提出了一种数据质量评估过程,该过程强调使用多变量离群值检测来识别错误,并表明仅单变量方法是不够的。此外,我们建立了一个迭代过程,该过程使用多个多元方法,团队之间的沟通以及可视化的其他大型项目。

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号