...
首页> 外文期刊>Limnology and oceanography, methods >A data mining approach to evaluate suitability of dissolved oxygen sensor observations for lake metabolism analysis
【24h】

A data mining approach to evaluate suitability of dissolved oxygen sensor observations for lake metabolism analysis

机译:一种评价溶解氧传感器观测湖泊代谢分析的适用性的数据挖掘方法

获取原文
获取原文并翻译 | 示例
           

摘要

Despite rapid growth in continuous monitoring of dissolved oxygen for lake metabolism studies, the current best practice still relies on visual assessment and manual data filtering of sensor observations by experienced scientists in order to achieve meaningful results. This time consuming approach is fraught with potential for inconsistency and individual subjectivity. An automated method to assure the quality of data for the purpose of metabolism modeling is clearly needed to obtain consistent results representative of collective expertise. We used a hybrid approach of expert panel and data mining for data filtration. Symbolic Aggregate approXimation (SAX) treats discretized numerical timeseries segments as symbolic indications, creating a series of strings which are literally comparable to human words and sentences. This conversion allows established text mining techniques, such as classification methods to be applied to timeseries data. Half-hourly frequency surface dissolved oxygen data from 18 global lakes were used to create day-long segments of the original time series data. Three hundred sets of 1-d measurements were provided to a group of seven anonymous experts, experienced in manual filtering of oxygen data for metabolism modeling studies. The collective results were treated as expert panel decisions, and were used to rank the data by confidence level for use in metabolism calculations. While considerable variation occurred in the way the experts perceived the quality of the data, the model provides an objective and quantitative assessment method. The program output will assist the decision making process in determining whether data should be used for metabolism calculations. An R version of the program is available for download.
机译:尽管对湖泊代谢研究的溶解氧持续监测不断增长,但目前的最佳实践仍然依赖于经验丰富的科学家对传感器观测的视觉评估和手动数据过滤,以实现有意义的结果。这种耗时的方法充满了不一致和个人主体性的潜力。为了确保用于代谢建模目的的数据质量的自动化方法,以获得一致的结果代表集体专业知识。我们使用了一个专家面板的混合方法和数据挖掘进行数据过滤。符号聚合近似(SAX)将离散的数值段视为符号指示的离散数值,创建一系列字符串,这些字符串实际上与人类单词和句子相当。此转换允许建立的文本挖掘技术,例如要应用于TimeSeries数据的分类方法。从18个全球湖泊的半小时频率表面溶解氧数据用于创造原始时间序列数据的日期段。为一组七个匿名专家提供了三百套的1-D测量,在手动过滤氧气数据中进行代谢模拟研究。集体结果被视为专家面板决策,并用于通过置信水平对数据进行排名,以便在新陈代谢计算中使用。虽然专家察觉数据质量的方式发生了相当大的变化,但该模型提供了一种客观和定量评估方法。程序输出将帮助决策过程确定是否应用于代谢计算。 R版本可用于下载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号