Clustering with Missing Values: No Imputation Required

机译：缺少值的聚类：无需插补

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Clustering algorithms can identify groups in large data sets, such as star catalogs and hyperspectral images. In general, clustering methods cannot analyze items that have missing data values. Common solutions either fill in the missing values (imputation) or ignore the missing data (marginalization). Imputed values are treated as just as reliable as the truly observed data, but they are only as good as the assumptions used to create them. In contrast, we present a method for encoding partially observed features as a set of supplemental soft constraints and introduce the KSC algorithm, which incorporates constraints into the clustering process. In experiments on artificial data and data from the Sloan Digital Sky Survey, we show that soft constraints are an effective way to enable clustering with missing values.

著录项

作者
Wagstaff, Kiri;
展开▼
作者单位

展开▼
年度 2004
页码 1-10
总页数 10
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
ALGORITHMS; CLUSTER ANALYSIS; MULTISENSOR FUSION; CONSTRAINTS; clustering; constraints; missing values; data analysis; CODING; SKY SURVEYS (ASTRONOMY); DATA PROCESSING; ACCURACY;

机译：算法;聚类分析;多传感器融合;限制;聚类;限制;缺失值;数据分析;编码;天空调查（天文学）;数据处理;准确性;

相似文献

外文文献
中文文献
专利

1. Scalable and Accurate Missing Value Imputation with Least-Missing-Column-Values-Impute-First and K-NN Clustering Strategies [J] . Advanced Science Letters . 2016,第10期

机译：具有最小缺少列 - 值 - 纯第一和K-NN聚类策略的可扩展和准确的缺失值估算
2. Order-Sensitive Imputation for Clustered Missing Values [J] . Qian Ma, Yu Gu, Wang-Chien Lee, IEEE Transactions on Knowledge and Data Engineering . 2019,第1期

机译：聚类缺失值的阶数敏感性推算
3. Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth [J] . Zhang Zhaoyang, Fang Hua, Wang Honggang Journal of medical systems . 2016,第6期

机译：eHealth中缺少值的大型纵向试验数据的基于多重归因的聚类验证（MIV）
4. Order-Sensitive Imputation for Clustered Missing Values (Extended Abstract) [C] . Qian Ma, Yu Gu, Wang-Chien Lee, IEEE International Conference on Data Engineering . 2019

机译：聚类缺失值的阶数敏感性推算（扩展摘要）
5. GRU-DF: An RNN Model with Dynamic Imputation for Missing Values in Multivariate Time Series [D] . Berretta Magarinos, Matias Bartolome 2019

机译：GRU-DF：具有动态插补的RNN模型，用于多元时间序列中的缺失值
6. Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth [O] . Zhaoyang Zhang, Hua Fang, Honggang Wang -1

机译：eHealth中缺少值的大型纵向试验数据的基于多重归因的聚类验证（MIV）
7. Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth [O] . Zhaoyang Zhang, Hua Fang, Honggang Wang 2016

机译：基于多个归纳的聚类验证（MIV），用于储存中缺失值的大纵向试验数据

Clustering with Missing Values: No Imputation Required

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅