首页> 外文会议>IEEE 5th International Bio-Inspired Computing: Theories and Applications >A framework for high dimensional data reduction in the microarray domain
【24h】

A framework for high dimensional data reduction in the microarray domain

机译:在微阵列领域进行高维数据缩减的框架

获取原文
获取外文期刊封面目录资料

摘要

Microarray analysis and visualization is very helpful for biologists and clinicians to understand gene expression in cells and to facilitate diagnosis and treatment of patients. However, a typical microarray dataset has thousands of features and a very small number of observations. This very high dimensional data has a massive amount of information which often contains some noise, non-useful information and small number of relevant features for disease or genotype. This paper proposes a framework for very high dimensional data reduction based on three technologies: feature selection, linear dimensionality reduction and non-linear dimensionality reduction. In this paper, feature selection based on mutual information will be proposed for filtering features and selecting the most relevant features with the minimum redundancy. A kernel linear dimensionality reduction method is also used to extract the latent variables from a high dimensional data set. In addition, a non-linear dimensionality reduction based on local linear embedding is used to reduce the dimension and visualize the data. Experimental results are presented to show the outputs of each step and the efficiency of this framework.
机译:微阵列分析和可视化对于生物学家和临床医生了解细胞中的基因表达并促进患者的诊断和治疗非常有帮助。但是,典型的微阵列数据集具有数千个特征和很少的观测值。这种非常高维的数据具有大量信息,其中通常包含一些噪声,无用信息以及少量疾病或基因型的相关特征。本文提出了一种基于三种技术的超高维数据约简框架:特征选择,线性维约化和非线性维约化。在本文中,将提出基于互信息的特征选择,以过滤特征并以最小的冗余度选择最相关的特征。核线性降维方法也用于从高维数据集中提取潜在变量。另外,基于局部线性嵌入的非线性降维被用于减小维数并可视化数据。给出实验结果以显示每个步骤的输出以及该框架的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号