首页> 外文期刊>Journal of Hydrology >A comparison of methods to avoid overfitting in neural networks training in the case of catchment runoff modelling
【24h】

A comparison of methods to avoid overfitting in neural networks training in the case of catchment runoff modelling

机译:流域径流建模中避免神经网络训练过拟合的方法比较

获取原文
获取原文并翻译 | 示例
           

摘要

Artificial neural networks (ANNs) becomes very popular tool in hydrology, especially in rainfall-runoff modelling. However, a number of issues should be addressed to apply this technique to a particular problem in an efficient way, including selection of network type, its architecture, proper optimization algorithm and a method to deal with overfitting of the data. The present paper addresses the last, rarely considered issue, namely comparison of methods to prevent multi-layer perceptron neural networks from overfitting of the training data in the case of daily catchment runoff modelling. Among a number of methods to avoid overfitting the early stopping, the noise injection and the weight decay have been known for about two decades, however only the first one is frequently applied in practice. Recently a new methodology called optimized approximation algorithm has been proposed in the literature.Overfitting of the training data leads to deterioration of generalization properties of the model and results in its untrustworthy performance when applied to novel measurements. Hence the purpose of the methods to avoid overfitting is somehow contradictory to the goal of optimization algorithms, which aims at finding the best possible solution in parameter space according to pre-defined objective function and available data. Moreover, different optimization algorithms may perform better for simpler or larger ANN architectures. This suggest the importance of proper coupling of different optimization algorithms, ANN architectures and methods to avoid overfitting of real-world data - an issue that is also studied in details in the present paper.The study is performed for Annapolis River catchment, characterized by significant seasonal changes in runoff, rapid floods during winter and spring, moderately dry summers, severe winters with snowfall, snow melting, frequent freeze and thaw, and presence of river ice. The present paper shows that the elaborated noise injection method may prevent overfitting slightly better than the most popular early stopping approach. However, the implementation of noise injection to real-world problems is difficult and the final model performance depends significantly on a number of very technical details, what somehow limits its practical applicability. It is shown that optimized approximation algorithm does not improve the results obtained by older methods, possibly due to over-simplified criterion of stopping the algorithm. Extensive calculations reveal that Evolutionary Computation-based algorithm performs better for simpler ANN architectures, whereas classical gradient-based Levenberg-Marquardt algorithm is able to benefit from additional input variables, representing precipitation and snow cover from one more previous day, and from more complicated ANN architectures. This confirms that the curse of dimensionality has severe impact on the performance of Evolutionary Computing methods.
机译:人工神经网络(ANN)成为水文学中非常流行的工具,尤其是在降雨径流模型中。但是,应解决许多问题,以便以有效方式将此技术应用于特定问题,包括网络类型的选择,其体系结构,适当的优化算法以及处理数据过度拟合的方法。本文讨论了最后一个很少考虑的问题,即在日常集水径流建模的情况下,防止多层感知器神经网络过度拟合训练数据的方法的比较。在许多避免过早拟合的方法中,已知噪声注入和重量衰减大约已有二十年了,但实际上只有第一种方法经常使用。最近,文献中提出了一种称为优化逼近算法的新方法,训练数据的过度拟合会导致模型的泛化性能下降,并且在应用于新型测量时会导致其不可信赖的性能。因此,避免过度拟合的方法的目的在某种程度上与优化算法的目标相矛盾,后者旨在根据预定义的目标函数和可用数据在参数空间中找到最佳的解决方案。此外,对于更简单或更大的ANN架构,不同的优化算法可能会表现更好。这表明正确耦合不同的优化算法,ANN架构和方法以避免真实数据过度拟合的重要性-这个问题也在本文中进行了详细研究。径流的季节性变化,冬季和春季的洪水泛滥,夏季中度干燥,严峻的冬季降雪,融雪,频繁的冻融,以及河冰的存在。本文表明,精心设计的噪声注入方法可能比最流行的早期停止方法更好地防止过拟合。但是,对现实问题进行噪声注入是困难的,最终模型的性能很大程度上取决于许多非常技术的细节,这在某种程度上限制了它的实际适用性。结果表明,优化的逼近算法并不能改善通过较旧方法获得的结果,这可能是由于停止算法的准则过于简化所致。大量的计算表明,基于进化计算的算法在较简单的人工神经网络体系结构中表现更好,而基于经典梯度的Levenberg-Marquardt算法能够受益于其他输入变量,这些变量代表前一天的降水和积雪以及更复杂的人工神经网络建筑。这证实了维数的诅咒对进化计算方法的性能产生了严重影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号