...
首页> 外文期刊>Statistical Analysis and Data Mining >Symbolic Missing Data Imputation in Principal Component Analysis
【24h】

Symbolic Missing Data Imputation in Principal Component Analysis

机译:主成分分析中的符号缺失数据插补

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The concept of symbolic data has been developed with the aim of representing variables whose measurement is affected by some internal variation. This idea has been mainly concerned with the need of aggregating individuals in order to summarize large datasets into smaller matrices of manageable size, retaining as much of the original knowledge as possible. Nevertheless it is often applied also with variables structured from their outset as symbolic variables, although measured on single individuals. This paper deals with the latter framework, and aims at showing that symbolic data analysis techniques can be applied to the field of missing values treatment. The algorithm for a symbolic imputation technique in principal component analysis is presented as a generalization of the basic strategy called interval imputation. An illustrative example and a real data case study show how the proposed technique works.
机译:已经开发出符号数据的概念,其目的是代表变量的度量受某些内部变化影响。这个想法主要涉及聚集个人以便将大型数据集汇总为可管理大小的较小矩阵的需求,同时保留尽可能多的原始知识。然而,尽管对单个个体进行了测量,但通常也将其从一开始就构造为符号变量的变量也应用。本文讨论了后者的框架,旨在表明符号数据分析技术可以应用于缺失值处理领域。主成分分析中的符号插补技术算法是对称为区间插补的基本策略的概括。一个说明性示例和一个实际数据案例研究显示了所提出的技术如何工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号