Feature Clustering for Extreme Events Analysis, with Application to Extreme Stream-Flow Data

机译：用于极端事件分析的特征聚类，并应用于极端流数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The dependence structure of extreme events of multivariate nature plays a special role for risk management applications, in particular in hydrology (flood risk). In a high dimensional context (d > 50), a natural first step is dimension reduction. Analyzing the tails of a dataset requires specific approaches: earlier works have proposed a definition of sparsity adapted for extremes, together with an algorithm detecting such a pattern under strong sparsity assumptions. Given a dataset that exhibits no clear sparsity pattern we propose a clustering algorithm allowing to group together the features that are 'dependent at extreme level', i.e., that are likely to take extreme values simultaneously. To bypass the computational issues that arise when it comes to dealing with possibly O(2~d) subsets of features, our algorithm exploits the graphical structure stemming from the definition of the clusters, similarly to the Apriori algorithm, which reduces drastically the number of subsets to be screened. Results on simulated and real data show that our method allows a fast recovery of a meaningful summary of the dependence structure of extremes.

机译：多元性极端事件的依存结构在风险管理应用中，特别是在水文学（洪水风险）中起着特殊的作用。在高尺寸环境中（d> 50），自然的第一步是尺寸减小。分析数据集的尾部需要特定的方法：早期的工作提出了适用于极端情况的稀疏性的定义，以及在强稀疏性假设下检测这种模式的算法。给定一个没有清晰稀疏模式的数据集，我们提出了一种聚类算法，可以将``在极端级别上依赖''（即可能同时采用极端值）的特征组合在一起。为了绕开可能涉及特征的O（2〜d）子集时出现的计算问题，我们的算法采用了基于聚类定义的图形结构，类似于Apriori算法，该算法大大减少了Apriori算法的数量。要筛选的子集。模拟和真实数据的结果表明，我们的方法可以快速恢复极端依赖关系的有意义的摘要。

著录项

来源
《International workshop on new frontiers in mining complex patterns;European conference on machine learning and principles and practice of knowledge discovery in databases》|2017年|132-147|共16页
会议地点
作者
Maeel Chiapino; Anne Sabourin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Extreme values; Dimension reduction; Pattern mining; Subspace clustering; Subgroup discovery;

机译：极端值;尺寸缩小;模式挖掘;子空间聚类;小组发现;

相似文献

外文文献
中文文献
专利

1. Statistical Modeling of Spatial Extremes through Max-Stable Process Models: Application to Extreme Rainfall Events in South Africa [J] . Tadele Akeba Diriba, Legesse Kassa Debusho Journal of hydrologic engineering . 2021,第10期

机译：最大稳定过程模型的空间极端统计建模：南非极端降雨事件的应用
2. Efficient Simulation for Dependent Rare Events with Applications to Extremes [J] . Andersen Lars Norvang, Laub Patrick J., Rojas-Nandayapa Leonardo Methodology and computing in applied probability . 2018,第1期

机译：在极端应用程序到极端的依赖稀有事件的高效模拟
3. Estimating generalized state density of near-extreme events and its applications in analyzing stock data [J] . Jin-Guan Lin, Chao Huang, Qing-Yun Zhuang, Insurance . 2010,第1期

机译：估计极端事件的广义状态密度及其在分析股票数据中的应用
4. Feature Clustering for Extreme Events Analysis, with Application to Extreme Stream-Flow Data [C] . Mael Chiapino, Anne Sabourin International Workshop on New Frontiers in Mining Complex Patterns . 2017

机译：具有极端事件分析的功能聚类，应用于极端流流数据
5. Modeling Extreme Heat Events Using Global Reanalysis Data: An Assessment of Current Climate Data and the Need for Improved Weather Station Data in Tennessee [D] . Reasons, John. 2019

机译：使用全球再分析数据建模极端热事件：对当前气候数据的评估以及对田纳西州改进的气象站数据的需求
6. Nonparametric Spatial Models for Extremes: Application to Extreme Temperature Data [O] . Montserrat Fuentes, John Henry, Brian Reich -1

机译：极端的非参数空间模型：应用于极端温度数据
7. Normalization Strategies for Enhancing Spatio-Temporal Analysis of Social Media Responses during Extreme Events: A Case Study based on Analysis of Four Extreme Events using Socio-Environmental Data Explorer (SEDE) [O] . J. Ajayakumar, E. Shook, V. K. Turner 2017

机译：在极端事件期间加强社交媒体响应的时空分析的规范化策略：基于使用社会环境数据资源管理器（sEDE）分析四个极端事件的案例研究

Feature Clustering for Extreme Events Analysis, with Application to Extreme Stream-Flow Data

摘要

著录项

相似文献

相关主题

期刊订阅