...
首页> 外文期刊>International Journal of Climatology: A Journal of the Royal Meteorological Society >Development and comparison of circulation type classifications using the COST 733 dataset and software
【24h】

Development and comparison of circulation type classifications using the COST 733 dataset and software

机译:使用COST 733数据集和软件开发和比较循环类型分类

获取原文
获取原文并翻译 | 示例

摘要

In order to examine correspondence between different methods for circulation type classification, a dataset of classification catalogs for 12 different European regions has been created using a specially developed software package. Twenty-seven basic automatic classification methods have been applied in several variants to different input datasets describing atmospheric circulation. Together with six manual classifications a total of 33 methods are available for inter-comparison. Pattern correlation, frequency time-series correlation and the adjusted Rand index have been used for comparison. Highly significant correspondence has been detected only for two clustering techniques while the remaining classification methods show surprisingly low similarity. A Monte-Carlo test with 1000 classifications of randomly defined types even shows that most of the methods are not more similar among each other than any arbitrarily chosen types. The predominant dissimilarity between the methods is interpreted to be a result of a lack of inherent structures of the input data. Only simulated annealing clustering and self-organizing maps get nearly identical results because they can optimally fit the partitioning to the outer shape of the data cloud in the phase space. Also methods based on pre-defined types come to very different results because small changes in the definition of thresholds may lead to large differences in the partitioning. It is concluded that because of the missing inner structure of the data there is no clear statistical reason to prefer any of the examined methods. For practice in synoptic climatology this means that finding a suited classification for a certain purpose may require a broad comparison of methods. The software package cost733class for development, comparison and evaluation of classifications which was developed and used in this study is available at to facilitate this task.
机译:为了检查流通类型分类的不同方法之间的对应关系,已使用专门开发的软件包创建了12个欧洲不同地区的分类目录的数据集。二十七种基本的自动分类方法已经应用于描述大气环流的不同输入数据集的几种变体中。连同六种手动分类,共有33种方法可用于比较。模式相关性,频率时间序列相关性和调整后的Rand指数已用于比较。仅针对两种聚类技术检测到高度有效的对应关系,而其余分类方法则显示出令人惊讶的低相似性。带有1000种随机定义类型分类的蒙特卡洛检验甚至表明,大多数方法彼此之间的相似性并不比任意选择的类型相似。两种方法之间的主要差异被解释为缺乏输入数据固有结构的结果。由于模拟退火聚类和自组织图可以使分区最佳地适应相空间中数据云的外部形状,因此只能得到几乎相同的结果。同样,基于预定义类型的方法也会产生非常不同的结果,因为阈值定义的细微变化可能会导致分区差异很大。结论是,由于缺少数据的内部结构,没有明确的统计理由倾向于使用任何一种检查方法。对于天气气候学的实践,这意味着找到针对特定目的的合适分类可能需要对方法进行广泛的比较。在本研究中开发和使用的用于开发,比较和评估类别的软件包cost733class可用于促进此任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号