Computational intelligence methods for processing misaligned, unevenly sampled time series containing missing data

机译：用于处理包含丢失数据的未对齐，采样不均匀的时间序列的计算智能方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One consequence of the increasing amount of data stored during acquisition processes is that sampled time series are more prone to be collected in a misaligned uneven fashion and/or be partly lost or unavailable (missing data). Due to their severe impact on data mining techniques, this work proposes methods to (a) align misaligned unevenly sampled data, (b) differentiate absent values related to low sampling frequencies, compared to those resulting from missingness mechanisms, and (c) to classify recoverable and non-recoverable segments of missing data by using statistical and fuzzy modeling approaches. These methods were evaluated against randomly simulated test datasets containing different amounts of missing data. Results show that: (1) using the variable most frequently sampled as a template, combined with cubic interpolation, allowed to unshift misaligned uneven data without significant errors; (2) the differentiation of absent values due to low sampling frequencies from those truly missing, can be succesfully performed using 95% confidence intervals relative to the mean sampling time; (3) fuzzy modeling returned better classification results for recoverable segments, while the statistical approach performed better in classifying non-recoverable segments. All three methods proposed in this work decreased their performance when the amount of missing data was increased in the test datasets.

机译：采集过程中存储的数据量不断增加的一个结果是，采样的时间序列更容易以未对准的不均匀方式收集和/或部分丢失或不可用（丢失数据）。由于其对数据挖掘技术的严重影响，这项工作提出了以下方法：（a）对齐未对齐的不均匀采样数据;（b）区分与低采样频率相关的缺失值（与缺失机制产生的值相比）;以及（c）进行分类通过使用统计和模糊建模方法，可以恢复丢失数据的可恢复段和不可恢复段。针对包含不同数量缺失数据的随机模拟测试数据集对这些方法进行了评估。结果表明：（1）使用最常采样的变量作为模板，结合三次插值，可以使未对齐的不均匀数据进行平移，而没有明显的误差; （2）可以使用相对于平均采样时间的95％置信区间成功执行因采样频率低而导致的缺失值与真正缺失值的区分。（3）模糊建模对可恢复段的分类结果较好，而统计方法对不可恢复段的分类效果更好。当测试数据集中丢失的数据量增加时，本文提出的所有三种方法均会降低其性能。

著录项

来源
《2011 IEEE Symposium on Computational Intelligence and Data Mining》|2011年|p.224-231|共8页
会议地点
作者
Cismondi Federico; Fialho Andre S.; Vieira Susana M.; Sousa Joao M.C.; Reti Shane R.; Howell Michael D.; Finkelstein Stan N.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Computational intelligence methods for data mining of causality extent in the time series [J] . Luká? Pichl, Taisei Kaizoji International Journal of Computational Science and Engineering . 2018,第4期

机译：时间序列发生因果区的数据挖掘计算智能方法
2. Data Missing Mechanism and Missing Data Real-Time Processing Methods in the Construction Monitoring of Steel Structures [J] . Luo Y. F., Ye Z. W., Guo X. N., Advances in Structural Engineering . 2015,第4期

机译：钢结构施工监测中的数据遗漏机理及数据遗漏实时处理方法
3. Imputing missing values in unevenly spaced clinical time series data to build an effective temporal classification framework [J] . Nancy Jane Y., Khanna Nehemiah H., Arputharaj Kannan Computational statistics & data analysis . 2017,第期

机译：在不均匀间隔的临床时间序列数据中抵消缺失值，以构建有效的时间分类框架
4. Computational intelligence methods for processing misaligned, unevenly sampled time series containing missing data [C] . Cismondi Federico, Fialho Andre S., Vieira Susana M., IEEE Symposium on Computational Intelligence and Data Mining . 2011

机译：用于处理未对准的计算智能方法，包含缺失数据的不均匀采样时间序列
5. A comparison of computational methods to calculate effective connectivity from functional magnetic resonance imaging time series data [D] . Witt, Suzanne T. 2008

机译：从功能磁共振成像时间序列数据计算有效连通性的计算方法比较
6. Spectral estimation in unevenly sampled space of periodically expressed microarray time series data [O] . Alan Wee-Chung Liew, Jun Xian, Shuanhu Wu, 2007

机译：周期性表达的微阵列时间序列数据在不均匀采样空间中的光谱估计
7. Imputation of missing data in time series by different computation methods in various data set applications [O] . Dhiraj Magare, Sushil Labde, Manoj Gofane, 2020

机译：不同计算方法在各种数据集应用程序中缺失数据中缺失数据的归责
8. Approaches in Highly Parameterized Inversion: TSPROC, a General Time-Series Processor to Assist in Model Calibration and Results Summarization. Chapter 7 of Section C, Computer Programs, Book 7, Automated Data Processing and Computations. Great Lakes Rest [R] . Westenbroek, S. M., Doherty, J., Walker, J. F., 2012

机译：高度参数化反演的方法：TspROC，一种辅助模型校准和结果汇总的通用时间序列处理器。 C部分，计算机程序，第7册，自动数据处理和计算的第7章。大湖休息

Computational intelligence methods for processing misaligned, unevenly sampled time series containing missing data

摘要

著录项

相似文献

相关主题

期刊订阅