首页> 外文期刊>Computers & geosciences >A machine learning approach to quality control oceanographic data
【24h】

A machine learning approach to quality control oceanographic data

机译:机器学习方法质量控制海洋数据

获取原文
获取原文并翻译 | 示例
       

摘要

Sensor errors are inevitable when measuring the ocean; thus, a reliable dataset of observations requires a quality control (QC) procedure capable of detecting spurious measurements. While manual QC by human experts minimizes errors, it is inefficient to handle large datasets and vulnerable to inconsistencies between different experts. Although automatic QC addresses some of these issues, the traditional methods result in high rates of false positives. Here, I propose a machine learning approach to automatically QC oceanographic data based on the Anomaly Detection technique. Multiple tests are combined into a single multidimensional criterion that learns the behavior of the valid measurements and identifies bad samples as outliers. When applied to 13 years of hydrographic profiles, the Anomaly Detection resulted in the best classification performance, reducing the error by at least 50%. The Anomaly Detection approach introduced here was implemented in the Python package CoTeDe, an open-source framework to quality control oceanographic data.
机译:在测量海洋时,传感器错误是不可避免的;因此,观察的可靠数据集需要一种能够检测杂散测量的质量控制(QC)过程。虽然人类专家的手动QC最小化了错误,但是处理大型数据集并易受不同专家之间不一致的效率低下。虽然自动QC地址一些这些问题,但传统方法导致误报的高速度。在这里,我提出了一种基于异常检测技术自动QC海洋数据的机器学习方法。将多个测试组合成单个多维标准,用于了解有效测量的行为,并将错误的样本标识为异常值。当应用到13年的水文谱时,异常检测导致最佳分类性能,将误差减少至少50%。这里介绍的异常检测方法是在Python包Cotede中实现的,是质量控制海洋学数据的开放源框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号