A typology of different development and testing options for symbolic regression modelling of measured and calculated datasets

Darren J. Beriro; Robert J. Abrahart; C. Paul Nathanail; Jimmy Moreno; A. Salim Bawazir

首页> 外文期刊>Environmental Modelling & Software >A typology of different development and testing options for symbolic regression modelling of measured and calculated datasets

【24h】

A typology of different development and testing options for symbolic regression modelling of measured and calculated datasets

机译：用于测量和计算的数据集的符号回归建模的不同开发和测试选项的类型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data-driven modelling is used to develop two alternative types of predictive environmental model: a simulator, a model of a real-world process developed from either a conceptual understanding of physical relations and/or using measured records, and an emulator, an imitator of some other model developed on predicted outputs calculated by that source model. A simple four-way typology called Emulation Simulation Typology (EST) is proposed that distinguishes between (ⅰ) model type and (ⅱ) different uses of model development period and model test period datasets. To address the question of to what extent simulator and emulator solutions might be considered interchangeable i.e. provide similar levels of output accuracy when tested on data different from that used in their development, a pair of counterpart pan evaporation models was created using symbolic regression. Each model type delivered similar levels of predictive skill to that other of published solutions. Input-output sensitivity analysis of the two different model types likewise confirmed two very similar underlying response functions. This study demonstrates that the type and quality of data on which a model is tested, has a greater influence on model accuracy assessment, than the type and quality of data on which a model is developed, providing that the development record is sufficiently representative of the conceptual underpinnings of the system being examined. Thus, previously reported substantial disparities occurring in goodness-of-fit statistics for pan evaporation models are most likely explained by the use of either measured or calculated data to test particular models, where lower scores do not necessarily represent major deficiencies in the solution itself.

机译：数据驱动的建模用于开发两种替代类型的预测性环境模型：模拟器，通过对物理关系的概念性理解和/或使用测量的记录而开发的现实世界过程的模型，以及模拟器，在由该源模型计算的预测输出上开发的其他模型。提出了一种简单的四向分类法，称为仿真模拟分类法（EST），该方法可以区分（ⅰ）模型类型和（ⅱ）模型开发期间和模型测试期间数据集的不同用途。为了解决模拟器和仿真器解决方案在多大程度上可以互换的问题，即在使用与开发中使用的数据不同的数据进行测试时提供相似级别的输出精度，因此使用符号回归创建了一对对应的锅蒸发模型。每种模型类型都提供与其他已发布解决方案相似的预测技能水平。两种不同模型类型的输入输出灵敏度分析同样证实了两个非常相似的潜在响应函数。这项研究表明，测试模型的数据类型和质量比开发模型的数据类型和质量对模型准确性评估的影响更大，前提是开发记录足以代表模型的准确性。被检查系统的概念基础。因此，以前报告的锅蒸发模型拟合优度统计中出现的重大差异很可能是通过使用测量数据或计算数据来测试特定模型来解释的，其中较低的分数不一定代表解决方案本身的主要缺陷。

著录项

来源
《Environmental Modelling & Software》 |2013年第9期|29-41|共13页
作者
Darren J. Beriro; Robert J. Abrahart; C. Paul Nathanail; Jimmy Moreno; A. Salim Bawazir;
展开▼
作者单位

School of Geography, University of Nottingham, Nottingham NG7 2RD, UK;

School of Geography, University of Nottingham, Nottingham NG7 2RD, UK;

School of Geography, University of Nottingham, Nottingham NG7 2RD, UK;

Department of Civil Engineering, New Mexico State University, Box 30001, MSC 3CE, Las Cruces, NM 88003-0001, USA;

Department of Civil Engineering, New Mexico State University, Box 30001, MSC 3CE, Las Cruces, NM 88003-0001, USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Simulation; Emulation; Pan evaporation; Gene expression programming; Data driven modelling; Emulation simulation typology; Symbolic regression;

机译：模拟;仿真;锅蒸发基因表达编程;数据驱动的建模;仿真模拟类型学;符号回归;

相似文献

外文文献
中文文献
专利

1. Development of models for predicting roadside nitrogen dioxid concentrations and their evaluation against datasets measured in Dublin, Ireland [J] . Rajiv Ganguly, Brian M. Broderick Transportation Research . 2009,第4期

机译：预测路边二氧化氮浓度的模型的开发以及针对爱尔兰都柏林测量的数据集的评估
2. The Use of Measured and Calculated Acidity Values to Improve the Quality of Mine Drainage Datasets [J] . Robert S. Hedin Mine water and the environment . 2006,第3期

机译：使用测得和计算出的酸度值来改善矿山排水数据集的质量
3. A simple model to predict compound loss processes in aquatic ecotoxicological tests: calculated and measured triphenyltin levels in water and biota [J] . PAOLO TREMOLADA, SEBASTIEN BRISTEAU, DANIELA MOZZI, International journal of environmental analytical chemistry . 2006,第3a4期

机译：一个简单的模型来预测水生生态毒理学测试中的化合物损失过程：计算和测量水和生物群中三苯锡的水平
4. THE USE OF MEASURED AND CALCULATED ACIDITY VALUES TO IMPROVE THE QUALITY OF MINE DRAINAGE DATASETS [C] . Robert S Hedin West Virginia Surface Mine Drainage Task Force Symposium . 2004

机译：使用测量和计算的酸度值来提高矿井排水数据集的质量
5. Examining the impact of raster datasets on flood and low streamflow regional regression models using a custom GIS application. [D] . Hirabayashi, Satoshi. 2005

机译：使用自定义GIS应用程序检查栅格数据集对洪水和低流量区域回归模型的影响。
6. Testing for Measured Gene-Environment Interaction: Problems with the use of Cross-Product Terms and a Regression Model Reparameterization Solution [O] . Fazil Aliev, Shawn J. Latendresse, Silviu-Alin Bacanu, -1

机译：测得的基因-环境相互作用的测试：跨产品术语和回归模型重新参数化解决方案的使用问题
7. A typology of different development and testing options for symbolic regression modelling of measured and calculated datasets [O] . Beriro Darren J., Abrahart Robert J., Nathanail C. Paul, 2013

机译：用于测量和计算的数据集的符号回归建模的不同开发和测试选项的类型

A typology of different development and testing options for symbolic regression modelling of measured and calculated datasets

摘要

著录项

相似文献

相关主题

期刊订阅