...
首页> 外文期刊>Hydrology and Earth System Sciences >Identifying rainfall-runoff events in discharge time series: a data-driven method based on information theory
【24h】

Identifying rainfall-runoff events in discharge time series: a data-driven method based on information theory

机译:识别放电时间序列中的降雨径流事件:基于信息理论的数据驱动方法

获取原文
获取原文并翻译 | 示例

摘要

In this study, we propose a data-driven approach for automatically identifying rainfall-runoff events in discharge time series. The core of the concept is to construct and apply discrete multivariate probability distributions to obtain probabilistic predictions of each time step that is part of an event. The approach permits any data to serve as predictors, and it is non-parametric in the sense that it can handle any kind of relation between the predictor(s) and the target. Each choice of a particular predictor data set is equivalent to formulating a model hypothesis. Among competing models, the best is found by comparing their predictive power in a training data set with user-classified events. For evaluation, we use measures from information theory such as Shannon entropy and conditional entropy to select the best predictors and models and, additionally, measure the risk of overfitting via cross entropy and Kullback-Leibler divergence. As all these measures are expressed in "bit", we can combine them to identify models with the best tradeoff between predictive power and robustness given the available data.
机译:在本研究中,我们提出了一种数据驱动方法,用于自动识别放电时间序列中的降雨径流事件。该概念的核心是构造和应用离散多变量概率分布,以获得作为事件的一部分的每个时间步骤的概率预测。该方法允许任何数据用作预测器,并且它是非参数,即它可以处理预测器和目标之间的任何类型的关系。每个特定预测器数据集的每个选择相当于制定模型假设。在竞争模型中,通过将它们的预测电力与用户分类事件的训练数据集进行比较来找到最好的。为了评估,我们使用来自Shannon Entopy和条件熵等信息理论的措施,以选择最佳的预测因子和模型,并且另外,通过交叉熵和Kullback-Leibler发散来衡量过度装备的风险。由于所有这些措施都以“位”表示,我们可以将它们组合以识别具有在可用数据的预测功率和鲁棒性之间具有最佳权衡的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号