...
首页> 外文期刊>Environmental Modelling & Software >A typology of different development and testing options for symbolic regression modelling of measured and calculated datasets
【24h】

A typology of different development and testing options for symbolic regression modelling of measured and calculated datasets

机译:用于测量和计算的数据集的符号回归建模的不同开发和测试选项的类型

获取原文
获取原文并翻译 | 示例
           

摘要

Data-driven modelling is used to develop two alternative types of predictive environmental model: a simulator, a model of a real-world process developed from either a conceptual understanding of physical relations and/or using measured records, and an emulator, an imitator of some other model developed on predicted outputs calculated by that source model. A simple four-way typology called Emulation Simulation Typology (EST) is proposed that distinguishes between (ⅰ) model type and (ⅱ) different uses of model development period and model test period datasets. To address the question of to what extent simulator and emulator solutions might be considered interchangeable i.e. provide similar levels of output accuracy when tested on data different from that used in their development, a pair of counterpart pan evaporation models was created using symbolic regression. Each model type delivered similar levels of predictive skill to that other of published solutions. Input-output sensitivity analysis of the two different model types likewise confirmed two very similar underlying response functions. This study demonstrates that the type and quality of data on which a model is tested, has a greater influence on model accuracy assessment, than the type and quality of data on which a model is developed, providing that the development record is sufficiently representative of the conceptual underpinnings of the system being examined. Thus, previously reported substantial disparities occurring in goodness-of-fit statistics for pan evaporation models are most likely explained by the use of either measured or calculated data to test particular models, where lower scores do not necessarily represent major deficiencies in the solution itself.
机译:数据驱动的建模用于开发两种替代类型的预测性环境模型:模拟器,通过对物理关系的概念性理解和/或使用测量的记录而开发的现实世界过程的模型,以及模拟器,在由该源模型计算的预测输出上开发的其他模型。提出了一种简单的四向分类法,称为仿真模拟分类法(EST),该方法可以区分(ⅰ)模型类型和(ⅱ)模型开发期间和模型测试期间数据集的不同用途。为了解决模拟器和仿真器解决方案在多大程度上可以互换的问题,即在使用与开发中使用的数据不同的数据进行测试时提供相似级别的输出精度,因此使用符号回归创建了一对对应的锅蒸发模型。每种模型类型都提供与其他已发布解决方案相似的预测技能水平。两种不同模型类型的输入输出灵敏度分析同样证实了两个非常相似的潜在响应函数。这项研究表明,测试模型的数据类型和质量比开发模型的数据类型和质量对模型准确性评估的影响更大,前提是开发记录足以代表模型的准确性。被检查系统的概念基础。因此,以前报告的锅蒸发模型拟合优度统计中出现的重大差异很可能是通过使用测量数据或计算数据来测试特定模型来解释的,其中较低的分数不一定代表解决方案本身的主要缺陷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号