高维相关性缺失数据的分块填补算法研究

杨杰; 杨虎; 王鲁滨; 金鑫; 郭华; 于亮亮

首页> 中文期刊>计算机科学与探索 >高维相关性缺失数据的分块填补算法研究

高维相关性缺失数据的分块填补算法研究

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

研究了高维相关性缺失数据的填补方法,提出了分块填补算法.该算法核心思想是:在填补数据的过程中会考虑变量之间的相互关系,仅利用与待填补数据有相关性的数据进行填补,从而降低不相关数据对缺失数据填补的影响,提高数据填补的准确度.同时,该算法能够并行处理缺失数据,从而提高数据填补效率,对于高维缺失数据的填补有重要意义.为了对分块情况未知的缺失数据进行分块,提出了基于k-means聚类的分块算法.大量的仿真实验和基于真实数据集的实验表明,对于相关性数据,分块填补算法能够有效地利用相关信息进行填补,从而提高数据填补准确度.%This paper studies the method of filling the high dimensional correlation missing data, and proposes a new imputation algorithm based on data block. The key idea of the algorithm is to consider the correlation between variables when filling missing data, and only use the data correlated with the missing data to fill, thereby reducing imputation effects of the missing data caused by the irrelevant data, and improving the accuracy of data imputation. At the same time, the proposed imputation algorithm can be implemented in a parallel way, so that it performs effi-ciently to fill the high dimensional missing data. In order to divide the missing data with unknown information about blocks into several blocks, this paper proposes a block algorithm based on k-means clustering. Simulation research and application show that the proposed imputation algorithm is more effective and accurate to handle the missing for the correlation high dimensional data with considering variables' block relationship than others with not.

著录项

来源
《计算机科学与探索》|2017年第10期|1557-1569|共13页
作者
杨杰; 杨虎; 王鲁滨; 金鑫; 郭华; 于亮亮;
展开▼
作者单位

中央财经大学信息学院,北京 100081;

中央财经大学信息学院,北京 100081;

中央财经大学信息学院,北京 100081;

中央财经大学信息学院,北京 100081;

国网荆州供电公司信通分公司,湖北荆州 434000;

国网辽宁省电力有限公司信息通信分公司,沈阳 110000;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序设计、软件工程;
关键词
高维相关性数据; 缺失数据; 分块填补算法;
入库时间 2023-07-25 21:27:43

相似文献

中文文献
外文文献
专利

1. 基于属性相关度的缺失数据填补算法研究 [J] . 毛玫静 ,鄂旭 ,谭艳 . 计算机工程与应用 . 2016,第006期
2. 云计算下相关性缺失大数据分块填补仿真 [J] . 时巍 . 计算机仿真 . 2020,第004期
3. 面向高维缺失数据集的线性判别分析方法 [J] . 刘鹏 ,叶宾 . 常州大学学报（自然科学版） . 2020,第002期
4. 缺失数据下超高维线性模型的变量筛选 [J] . 贺佳钰 ,李建波 ,周庆燕 . 江苏师范大学学报：自然科学版 . 2020,第001期
5. 纵向缺失数据下高维部分线性回归模型的变量选择 [J] . 田瑞琴 ,徐登可 . 杭州师范大学学报（自然科学版） . 2020,第003期
6. 基于粗糙集的缺失数据填补技术研究 [C] . 窦东阳 ,赵英凯 . 第七届中国Rough集与软计算、第一届中国Web智能、第一届中国粒计算联合会议（CRSSC-CWI-CGrC'2007） . 2007
7. 基于Savitzky-Golay去噪算法与主成分分析的缺失数据填补算法研究 [A] . 王祥玉 . 2017

高维相关性缺失数据的分块填补算法研究

摘要

著录项

相似文献

相关主题

期刊订阅