首页> 外文会议>European Conference on Machine Learning and Knowledge Discovery in Databases;ECML PKDD 2008 >Two Heads Better Than One: Pattern Discovery in Time-Evolving Multi-aspect Data
【24h】

Two Heads Better Than One: Pattern Discovery in Time-Evolving Multi-aspect Data

机译:两个头比一个头好:随时间变化的多方面数据中的模式发现

获取原文

摘要

Data stream values are often associated with multiple aspects. For example, each value observed at a given time-stamp from environmental sensors may have an associated type (e.g., temperature, humidity, etc) as well as location. Time-stamp, type and location are the three aspects, which can be modeled using a tensor (high-order array). However, the time aspect is special, with a natural ordering, and with successive time-ticks having usually correlated values. Standard mul-tiway analysis ignores this structure. To capture it, we propose 2 Heads Tensor Analysis (2-heads), which provides a qualitatively different treatment on time. Unlike most existing approaches that use a PCA-like summarization scheme for all aspects, 2-heads treats the time aspect carefully. 2-heads combines the power of classic multilinear analysis (PARAFAC [1], Tucker [5], DTA/STA [3], WTA [2]) with wavelets, leading to a powerful mining tool. Furthermore, 2-heads has several other advantages as well: (a) it can be computed incrementally in a streaming fashion, (b) it has a provable error guarantee and, (c) it achieves significant compression ratio against competitors. Finally, we show experiments on real datasets, and we illustrate how 2-heads reveals interesting trends in the data. This is an extended abstract of an article published in the Data Mining and Knowledge Discovery journal [4].
机译:数据流值通常与多个方面相关联。例如,在给定时间戳下从环境传感器观察到的每个值可以具有相关的类型(例如,温度,湿度等)以及位置。时间戳,类型和位置是三个方面,可以使用张量(高阶数组)进行建模。但是,时间方面是特殊的,具有自然顺序,并且连续的时间间隔通常具有相关的值。标准多路分析忽略了这种结构。为了捕获它,我们建议使用2头张量分析(2头),它在时间上提供了质上不同的处理。与大多数在所有方面都使用类似PCA的摘要方案的现有方法不同,2头仔细地对待时间方面。 2头将经典的多线性分析(PARAFAC [1],Tucker [5],DTA / STA [3],WTA [2])的强大功能与小波相结合,从而形成了功能强大的挖掘工具。此外,2头还具有其他几个优点:(a)可以以流方式递增计算;(b)可以证明的错误保证;(c)相对于竞争对手获得显着的压缩率。最后,我们展示了在真实数据集上的实验,并展示了2头如何揭示数据中有趣的趋势。这是在《数据挖掘和知识发现》杂志[4]上发表的文章的扩展摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号