首页> 美国卫生研究院文献>other >Combining Fourier and Lagged k-Nearest Neighbor Imputation for Biomedical Time Series Data

【2h】

Combining Fourier and Lagged k-Nearest Neighbor Imputation for Biomedical Time Series Data

机译：结合傅里叶和滞后k最近邻插补的生物医学时间序列数据

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most clinical and biomedical data contain missing values. A patient’s record may be split across multiple institutions, devices may fail, and sensors may not be worn at all times. While these missing values are often ignored, this can lead to bias and error when the data are mined. Further, the data are not simply missing at random. Instead the measurement of a variable such as blood glucose may depend on its prior values as well as that of other variables. These dependencies exist across time as well, but current methods have yet to incorporate these temporal relationships as well as multiple types of missingness. To address this, we propose an imputation method (FLk-NN) that incorporates time lagged correlations both within and across variables by combining two imputation methods, based on an extension to k-NN and the Fourier transform. This enables imputation of missing values even when all data at a time point is missing and when there are different types of missingness both within and across variables. In comparison to other approaches on three biological datasets (simulated and actual Type 1 diabetes datasets, and multi-modality neurological ICU monitoring) the proposed method has the highest imputation accuracy. This was true for up to half the data being missing and when consecutive missing values are a significant fraction of the overall time series length.

机译：大多数临床和生物医学数据均包含缺失值。患者的记录可能分散在多个机构中，设备可能会发生故障，并且传感器可能不会始终佩戴。尽管通常会忽略这些丢失的值，但在挖掘数据时会导致偏差和错误。此外，数据并非简单地随机丢失。相反，诸如血糖之类的变量的测量可能取决于其先前值以及其他变量的值。这些依赖关系也跨时间存在，但是当前的方法尚未结合这些时间关系以及多种类型的缺失。为了解决这个问题，我们提出了一种插补方法（FLk-NN），它基于对k-NN的扩展和傅立叶变换，通过组合两种插补方法，在变量内部和变量之间结合了时间滞后的相关性。即使一个时间点的所有数据都丢失，并且变量内部和变量之间都存在不同类型的缺失，这也可以估算缺失值。与在三个生物学数据集（模拟和实际的1型糖尿病数据集，以及多模态神经病学ICU监测）上的其他方法相比，该方法具有最高的估算准确性。对于多达一半的数据丢失以及连续的丢失值占整个时间序列长度的很大一部分来说，这是正确的。

著录项

期刊名称 other
作者
Shah Atiqur Rahman; Yuxiao Huang; Jan Claassen; Nathaniel Heintzman; Samantha Kleinberg;
展开▼
作者单位

展开▼
年(卷),期 -1(58),-1
年度 -1
页码 198–207
总页数 35
原文格式 PDF
正文语种
中图分类
关键词
missing data imputation time series biomedical data;

机译：缺失数据;估算;时间序列;生物医学数据;

相似文献

外文文献
中文文献
专利

1. Combining Fourier and lagged k-nearest neighbor imputation for biomedical time series data [J] . Journal of biomedical informatics. . 2015 ,第Null期

机译：结合傅里叶和滞后k近邻插值获得生物医学时间序列数据
2. APPLICATION OF EMPIRICAL MODE DECOMPOSITION COMBINED WITH k-NEAREST NEIGHBORS APPROACH IN FINANCIAL TIME SERIES FORECASTING [J] . AIJING LIN∗ PENGJIAN SHANGGUOCHEN FENG and BO ZHONG Fluctuation and Noise Letters . 2012 ,第2期

机译：经验模态分解与k-近邻法相结合在财务时间序列预测中的应用
3. APPLICATION OF EMPIRICAL MODE DECOMPOSITION COMBINED WITH k-NEAREST NEIGHBORS APPROACH IN FINANCIAL TIME SERIES FORECASTING [J] . AIJING LIN, PENGJIAN SHANG, GUOCHEN FENG, Fluctuation and Noise Letters: FNL: An Interdisciplinary Scientific Journal on Random Processes in Physical, Biological and Technological Systems . 2012 ,第2期

机译：经验模态分解与k-近邻法相结合在财务时间序列预测中的应用
4. Discretization of Time Series Dataset Using Relative Frequency and K-Nearest Neighbor Approach [C] . Azuraliza Abu Bakar, Almahdi Mohammed Ahmed, Abdul Razak Hamdan International conference on advanced data mining and applications;ADMA 2010 . 2010

机译：使用相对频率和K最近邻方法离散化时间序列数据集
5. Statistical Modeling of Carbon Dioxide and Cluster Analysis of Time Dependent Information: Lag Target Time Series Clustering, Multi-Factor Time Series Clustering, and Multi-Level Time Series Clustering [D] . Kim, Doo Young. 2016

机译：二氧化碳的统计建模和时间相关信息的聚类分析：滞后目标时间序列聚类，多因素时间序列聚类和多级时间序列聚类
6. 3D QSAR STUDIES ON A SERIES OF QUINAZOLINE DERRIVATIVES AS TYROSINE KINASE (EGFR) INHIBITOR: THE K-NEAREST NEIGHBOR MOLECULAR FIELD ANALYSIS APPROACH [O] . Malleshappa N. Noolvi, Harun M. Patel 2010

机译：一系列喹唑啉衍生物作为酪氨酸激酶（EGFR）抑制剂的3D QSAR研究：K-近邻神经分子分析方法
7. Combining Fourier and lagged k-nearest neighbor imputation for biomedical time series data [O] . Rahman Shah Atiqur, Huang Yuxiao, Claassen Jan, 2015

机译：结合傅里叶和滞后k近邻插值获得生物医学时间序列数据

Combining Fourier and Lagged k-Nearest Neighbor Imputation for Biomedical Time Series Data

摘要

著录项

相似文献

相关主题

期刊订阅