首页> 外文会议>European Conference on Machine Learning and Knowledge Discovery in Databases >Two Heads Better Than One: Pattern Discovery in Time-Evolving Multi-aspect Data
【24h】

Two Heads Better Than One: Pattern Discovery in Time-Evolving Multi-aspect Data

机译:两个头优于一个:模式发现在时间不断发展的多方面数据

获取原文
获取外文期刊封面目录资料

摘要

Data stream values are often associated with multiple aspects. For example, each value observed at a given time-stamp from environmental sensors may have an associated type (e.g., temperature, humidity, etc) as well as location. Time-stamp, type and location are the three aspects, which can be modeled using a tensor (high-order array). However, the time aspect is special, with a natural ordering, and with successive time-ticks having usually correlated values. Standard mul-tiway analysis ignores this structure. To capture it, we propose 2 Heads Tensor Analysis (2-heads), which provides a qualitatively different treatment on time. Unlike most existing approaches that use a PCA-like summarization scheme for all aspects, 2-heads treats the time aspect carefully. 2-heads combines the power of classic multilinear analysis (PARAFAC [1], Tucker [5], DTA/STA [3], WTA [2]) with wavelets, leading to a powerful mining tool. Furthermore, 2-heads has several other advantages as well: (a) it can be computed incrementally in a streaming fashion, (b) it has a provable error guarantee and, (c) it achieves significant compression ratio against competitors. Finally, we show experiments on real datasets, and we illustrate how 2-heads reveals interesting trends in the data. This is an extended abstract of an article published in the Data Mining and Knowledge Discovery journal [4].
机译:数据流值通常与多个方面相关联。例如,在来自环境传感器的给定时间戳处观察到的每个值可以具有相关的类型(例如,温度,湿度等)以及位置。时间戳,类型和位置是三个方面,可以使用张量(高阶阵列)进行建模。然而,时间方面是特殊的,具有自然排序,并且具有通常相关值的连续时间刻度。标准MUL-TIWAY分析忽略了这种结构。为了捕获它,我们提出了2个张力分析(2头),其按时提供定性不同的治疗。与所有使用PCA的摘要方案的最现有方法不同,2头仔细处理时间方面。 2头相结合了经典多线性分析的功率(ParaFac [1],Tucker [5],DTA / STA [3],WTA [2]),与小波,导致强大的采矿工具。此外,2头具有若干其他优点:(a)它可以以流式方式逐步计算,(b)它具有可提供的错误保证,并且(c)它实现了对竞争对手的显着压缩比。最后,我们在真实数据集上显示实验,我们说明了2头标题如何揭示数据中有趣的趋势。这是在数据挖掘和知识发现期刊中发表的文章的扩展摘要[4]。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号