首页> 外文期刊>Expert systems with applications >Quantile cross-spectral density: A novel and effective tool for clustering multivariate time series
【24h】

Quantile cross-spectral density: A novel and effective tool for clustering multivariate time series

机译:定量串联光谱密度:用于聚类多变量时间序列的新颖有效工具

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Clustering of multivariate time series is a central problem in data mining with applications in many fields. Frequently, the clustering target is to identify groups of series generated by the same multivariate stochastic process. Most of the approaches to address this problem include a prior step of dimensionality reduction which may result in a loss of information or consider dissimilarity measures based on correlations and cross correlations but ignoring the serial dependence structure. We propose a novel approach to measure dissimilarity between multivariate time series aimed at jointly capturing both cross dependence and serial dependence. Specifically, each series is characterized by a set of matrices of estimated quantile cross-spectral densities, where each matrix corresponds to a pair of quantile levels. Then the dissimilarity between every couple of series is evaluated by comparing their estimated quantile cross-spectral densities, and the pairwise dissimilarity matrix is taken as starting point to develop a partitioning around medoids algorithm. Since the quantilebased cross-spectra capture dependence in quantiles of the joint distribution, the proposed metric has a high capability to discriminate between high-level dependence structures. An extensive simulation study shows that our clustering procedure outperforms a wide range of alternative methods and exhibits robustness to noise distribution besides being computationally efficient. A real data application involving bivariate financial time series illustrates the usefulness of the proposed approach. The procedure is also applied to cluster nonstationary series from the UEA multivariate time series classification archive.
机译:多变量时间序列的聚类是数据挖掘的核心问题,其中包含许多字段中的应用程序。通常,聚类目标是识别由相同多变量随机过程产生的系列组。解决这个问题的大多数方法包括预期减少的现有步骤,其可能导致信息丢失或基于相关性和交叉相关性,但忽略串行依赖结构。我们提出了一种新颖的方法来测量多变量时间序列之间的异化,旨在共同捕获交叉依赖性和串行依赖性。具体地,每个系列的特征在于一组估计量子交叉光谱密度的矩阵,其中每个矩阵对应于一对分量水平。然后通过比较其估计的量化横谱密度来评估每对族系列之间的相似性,并且将成对异化矩阵作为出发点,以在麦细算法周围开发分区。由于定量基于跨光谱捕获在关节分布的量级中的依赖性,所提出的度量具有高能力来区分高级依赖性结构。一个广泛的仿真研究表明,我们的聚类程序优于广泛的替代方法,并且除了计算效率之外,还表现出噪声分布的鲁棒性。涉及Bifariate金融时间序列的真实数据应用说明了所提出的方法的有用性。该过程还应用于来自UEA多变量时间序列分类档案的群集非间断系列。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号