...
首页> 外文期刊>Knowledge and Information Systems >Pattern-based time-series subsequence clustering using radial distribution functions
【24h】

Pattern-based time-series subsequence clustering using radial distribution functions

机译:使用径向分布函数的基于模式的时间序列子序列聚类

获取原文
获取原文并翻译 | 示例
           

摘要

Clustering of time series subsequence data commonly produces results that are unspecific to the data set. This paper introduces a clustering algorithm, that creates clusters exclusively from those subsequences that occur more frequently in a data set than would be expected by random chance. As such, it partially adopts a pattern mining perspective into clustering. When subsequences are being labeled based on such clusters, they may remain without label. In fact, if the clustering was done on an unrelated time series it is expected that the subsequences should not receive a label. We show that pattern-based clusters are indeed specific to the data set for 7 out of 10 real-world sets we tested, and for window-lengths up to 128 time points. While kernel-density-based clustering can be used to find clusters with similar properties for window sizes of 8–16 time points, its performance degrades fast for increasing window sizes.
机译:时间序列子序列数据的聚类通常会产生非特定于数据集的结果。本文介绍了一种聚类算法,该算法专门从那些在数据集中出现的子序列比随机机会所期望的频率更高的子序列专门创建聚类。因此,它在集群中部分采用了模式挖掘的观点。当基于此类簇对子序列进行标记时,它们可能保持不带标记的状态。实际上,如果聚类是在不相关的时间序列上完成的,则预期子序列不应收到标签。我们显示,基于模式的聚类确实是特定于我们测试的10个真实世界中的7个数据集的数据集,并且其窗口长度最多为128个时间点。虽然基于内核密度的聚类可用于查找具有8-16个时间点的窗口大小的相似属性的集群,但其性能会随着窗口大小的增加而迅速降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号