首页> 外文会议>IEEE international conference on data engineering >Memory-efficient centroid decomposition for long time series
【24h】

Memory-efficient centroid decomposition for long time series

机译:长时间序列的记忆有效质心分解

获取原文

摘要

Real world applications that deal with time series data often rely on matrix decomposition techniques, such as the Singular Value Decomposition (SVD). The Centroid Decomposition (CD) approximates the Singular Value Decomposition, but does not scale to long time series because of the quadratic space complexity of the sign vector computation. In this paper, we propose a greedy algorithm, termed Scalable Sign Vector (SSV), to efficiently determine sign vectors for CD applications with long time series, i.e., where the number of rows (observations) is much larger than the number of columns (time series). The SSV algorithm starts with a sign vector consisting of only 1s and iteratively changes the sign of the element that maximizes the benefit. The space complexity of the SSV algorithm is linear in the length of the time series. We provide proofs for the scalability, the termination and the correctness of the SSV algorithm. Experiments with real world hydrological time series and data sets from the UCR repository validate the analytical results and show the scalability of SSV.
机译:处理时间序列数据的实际应用程序通常依赖于矩阵分解技术,例如奇异值分解(SVD)。质心分解(CD)近似于奇异值分解,但由于符号矢量计算的二次空间复杂性,因此无法缩放到长时间序列。在本文中,我们提出了一种贪婪算法,称为可伸缩符号向量(SSV),以有效地确定具有较长时间序列的CD应用程序的符号向量,即行数(观察数)远大于列数(时间序列)。 SSV算法从仅由1组成的符号向量开始,然后迭代地更改元素的符号以最大程度地提高收益。 SSV算法的空间复杂度在时间序列的长度上是线性的。我们提供了SSV算法的可扩展性,终止性和正确性的证明。使用UCR资料库中的现实世界水文时间序列和数据集进行的实验验证了分析结果,并显示了SSV的可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号