首页> 外文会议>Asian conference on remote sensing;ACRS >Identifying the Most Suitable Representation Method for Heterogeneous Time Series Data
【24h】

Identifying the Most Suitable Representation Method for Heterogeneous Time Series Data

机译:确定异构时间序列数据的最合适表示方法

获取原文

摘要

A time series data is a collection of measurements obtained sequentially, which is common in many application domains, e.g.. fluctuations of stock market, observations from sensor networks, medical and biological signals. Since time series data usually contains large number of data points, i.e., high-dimensionality, directly dealing with such data in its raw format is very expensive in terms of processing and storage loading. To effectively and efficiently manage time series data, several representation methods were discussed. Representation methods can reduce the dimensionality of a time series data while preserving its fundamental characteristics. However, each method has its own drawbacks and is most suitable for certain time series data types, which means no single method is efficient enough for all possible types. To address this issue, this study aims at proposing a system that can identify the most suitable representation method for different types of time series. To be specific, this study first proposes a time series clustering approach to cluster sample time series datasets to identify different types of time series. We then conduct an extensive performance evaluation by testing the performance of different representation methods on the clustered time series types. Based on the evaluation, the most suitable representation methods for certain clusters can be identified. With a new time series input, the system can first classify this time series by computing its similarities with clustered time series types, which indirectly helps us identify the representation method that is the most suitable for this new time series data. Finally, evaluation result shows that there are three types of representations are the most suitable representation for different time series types respectively.
机译:时间序列数据是按顺序获得的测量值的集合,这在许多应用领域都很常见,例如,股票市场的波动,来自传感器网络的观察,医学和生物信号。由于时间序列数据通常包含大量数据点,即高维,因此以原始格式直接处理此类数据在处理和存储负载方面非常昂贵。为了有效和高效地管理时间序列数据,讨论了几种表示方法。表示方法可以减少时间序列数据的维数,同时保留其基本特征。但是,每种方法都有其自身的缺点,并且最适用于某些时间序列数据类型,这意味着没有一种方法对所有可能的类型都足够有效。为了解决这个问题,本研究旨在提出一种系统,该系统可以为不同类型的时间序列确定最合适的表示方法。具体而言,本研究首先提出了一种时间序列聚类方法,以对样本时间序列数据集进行聚类,以识别不同类型的时间序列。然后,我们通过测试聚类时间序列类型上不同表示方法的性能来进行广泛的性能评估。基于评估,可以确定某些群集的最合适的表示方法。使用新的时间序列输入,系统可以首先通过计算其与聚类时间序列类型的相似性来对该时间序列进行分类,这间接地帮助我们确定最适合此新时间序列数据的表示方法。最后,评估结果表明,三种表示形式分别是针对不同时间序列类型的最合适的表示形式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号