【24h】

Feature Clustering for Extreme Events Analysis, with Application to Extreme Stream-Flow Data

机译:用于极端事件分析的特征聚类,并应用于极端流数据

获取原文

摘要

The dependence structure of extreme events of multivariate nature plays a special role for risk management applications, in particular in hydrology (flood risk). In a high dimensional context (d > 50), a natural first step is dimension reduction. Analyzing the tails of a dataset requires specific approaches: earlier works have proposed a definition of sparsity adapted for extremes, together with an algorithm detecting such a pattern under strong sparsity assumptions. Given a dataset that exhibits no clear sparsity pattern we propose a clustering algorithm allowing to group together the features that are 'dependent at extreme level', i.e., that are likely to take extreme values simultaneously. To bypass the computational issues that arise when it comes to dealing with possibly O(2~d) subsets of features, our algorithm exploits the graphical structure stemming from the definition of the clusters, similarly to the Apriori algorithm, which reduces drastically the number of subsets to be screened. Results on simulated and real data show that our method allows a fast recovery of a meaningful summary of the dependence structure of extremes.
机译:多元性极端事件的依存结构在风险管理应用中,特别是在水文学(洪水风险)中起着特殊的作用。在高尺寸环境中(d> 50),自然的第一步是尺寸减小。分析数据集的尾部需要特定的方法:早期的工作提出了适用于极端情况的稀疏性的定义,以及在强稀疏性假设下检测这种模式的算法。给定一个没有清晰稀疏模式的数据集,我们提出了一种聚类算法,可以将``在极端级别上依赖''(即可能同时采用极端值)的特征组合在一起。为了绕开可能涉及特征的O(2〜d)子集时出现的计算问题,我们的算法采用了基于聚类定义的图形结构,类似于Apriori算法,该算法大大减少了Apriori算法的数量。要筛选的子集。模拟和真实数据的结果表明,我们的方法可以快速恢复极端依赖关系的有意义的摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号