首页> 外文期刊>Risk analysis >Improved Transferability of Data-Driven Damage Models Through Sample Selection Bias Correction
【24h】

Improved Transferability of Data-Driven Damage Models Through Sample Selection Bias Correction

机译:通过样本选择偏压校正提高数据驱动损坏模型的可转换性

获取原文
获取原文并翻译 | 示例
           

摘要

Damage models for natural hazards are used for decision making on reducing and transferring risk. The damage estimates from these models depend on many variables and their complex sometimes nonlinear relationships with the damage. In recent years, data-driven modeling techniques have been used to capture those relationships. The available data to build such models are often limited. Therefore, in practice it is usually necessary to transfer models to a different context. In this article, we show that this implies the samples used to build the model are often not fully representative for the situation where they need to be applied on, which leads to a "sample selection bias." In this article, we enhance data-driven damage models by applying methods, not previously applied to damage modeling, to correct for this bias before the machine learning (ML) models are trained. We demonstrate this with case studies on flooding in Europe, and typhoon wind damage in the Philippines. Two sample selection bias correction methods from the ML literature are applied and one of these methods is also adjusted to our problem. These three methods are combined with stochastic generation of synthetic damage data. We demonstrate that for both case studies, the sample selection bias correction techniques reduce model errors, especially for the mean bias error this reduction can be larger than 30%. The novel combination with stochastic data generation seems to enhance these techniques. This shows that sample selection bias correction methods are beneficial for damage model transfer.
机译:用于自然危害的损坏模型用于减少和转移风险的决策。这些模型的损害估计依赖于许多变量及其复杂的有时与损坏的非线性关系。近年来,已经使用了数据驱动的建模技术来捕获这些关系。构建此类模型的可用数据通常有限。因此,实际上通常需要将模型传输到不同的上下文。在本文中,我们表明这意味着用于构建模型的样本通常不会完全代表他们需要应用的情况,这导致“样本选择偏差”。在本文中,我们通过应用方法来增强数据驱动损坏模型,未以前应用于损坏建模,在机器学习(ML)培训之前纠正此偏差。我们用欧洲洪水案例展示了这一点,菲律宾中台风风损坏。应用来自ML文献的两个样品选择偏压校正方法,其中一种方法也调整到我们的问题。这三种方法与随机生成的合成损伤数据相结合。我们证明,对于这两种情况,样本选择偏压校正技术会降低模型误差,特别是对于平均偏置误差,该减少可以大于30%。与随机数据生成的新组合似乎增强了这些技术。这表明采样选择偏压校正方法有利于损坏模型转移。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号