首页> 外文会议>Annual KES conference >Using Differential Evolution to Set Weights to Segments with Different Information Content in the Piecewise Aggregate Approximation
【24h】

Using Differential Evolution to Set Weights to Segments with Different Information Content in the Piecewise Aggregate Approximation

机译:使用微分演化为分段聚合逼近中具有不同信息内容的段设置权重

获取原文

摘要

Time series mining handles several tasks such as classification, clustering and similarity search. These data are high-dimensional in nature so time series representation methods are widely used to reduce the dimensionality of these data so that they can be handled efficiently and effectively. One of the side effects of using representation methods is the loss of information which results from the dimensionality reduction implied in the representation methods. Several representation methods have pointed out that some regions in the times series may contain more information than others so a faithful representation method should be able to reflect the different information contents in different regions of a time series. One of the techniques that can be utilized for this purpose is to set different weights to different regions according to the information they contain, but the challenge is to find an objective scheme to set the weights. Differential evolution is an efficient optimizer that has been successfully used to solve many optimization problems, mainly continuous ones. In this paper we show how differential evolution can be used to set weights to different segments of time series according to their information content. Although our scheme establishes a fully functional time series representation method, with lower bounding distance and a dimensionality reduction technique, we consider this as a by-product of our work and our main aim is to show how the information contents of different time series segments can be reflected using unconventional methods such as the differential evolution. We compare the new scheme with the piecewise aggregate approximation as a method that completely lacks the ability to distinguish regions with high information from others with low information. We show how the new scheme can recover the loss of information caused by dimensionality reduction. We validate our scheme by experiments on different datasets.
机译:时间序列挖掘处理多项任务,例如分类,聚类和相似性搜索。这些数据本质上是高维的,因此广泛使用时间序列表示方法来降低这些数据的维数,以便可以高效地对其进行处理。使用表示方法的副作用之一是信息丢失,这是由表示方法中隐含的降维导致的。几种表示方法已经指出,时间序列中的某些区域可能比其他区域包含更多的信息,因此,忠实的表示方法应该能够反映时间序列的不同区域中的不同信息内容。可以用于此目的的一种技术是根据它们包含的信息为不同区域设置不同的权重,但是挑战是找到一种设置权重的客观方案。差异进化是一种有效的优化器,已成功用于解决许多优化问题,主要是连续的问题。在本文中,我们展示了如何使用差分进化根据权重的信息内容为时间序列的不同部分设置权重。尽管我们的方案建立了功能齐全的时间序列表示方法,但具有较低的边界距离和降维技术,但我们认为这是我们工作的副产品,我们的主要目的是展示如何区分不同时间序列段的信息内容使用非常规方法(例如差异演化)反映出来。我们将新方案与分段聚集近似法进行比较,因为该方法完全缺乏将信息量高的区域与信息量低的其他区域区分开的能力。我们展示了新方案如何弥补因降维而导致的信息丢失。我们通过对不同数据集进行实验来验证我们的方案。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号