A framework for high dimensional data reduction in the microarray domain

机译：在微阵列领域进行高维数据缩减的框架

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Microarray analysis and visualization is very helpful for biologists and clinicians to understand gene expression in cells and to facilitate diagnosis and treatment of patients. However, a typical microarray dataset has thousands of features and a very small number of observations. This very high dimensional data has a massive amount of information which often contains some noise, non-useful information and small number of relevant features for disease or genotype. This paper proposes a framework for very high dimensional data reduction based on three technologies: feature selection, linear dimensionality reduction and non-linear dimensionality reduction. In this paper, feature selection based on mutual information will be proposed for filtering features and selecting the most relevant features with the minimum redundancy. A kernel linear dimensionality reduction method is also used to extract the latent variables from a high dimensional data set. In addition, a non-linear dimensionality reduction based on local linear embedding is used to reduce the dimension and visualize the data. Experimental results are presented to show the outputs of each step and the efficiency of this framework.

机译：微阵列分析和可视化对于生物学家和临床医生了解细胞中的基因表达并促进患者的诊断和治疗非常有帮助。但是，典型的微阵列数据集具有数千个特征和很少的观测值。这种非常高维的数据具有大量信息，其中通常包含一些噪声，无用信息以及少量疾病或基因型的相关特征。本文提出了一种基于三种技术的超高维数据约简框架：特征选择，线性维约化和非线性维约化。在本文中，将提出基于互信息的特征选择，以过滤特征并以最小的冗余度选择最相关的特征。核线性降维方法也用于从高维数据集中提取潜在变量。另外，基于局部线性嵌入的非线性降维被用于减小维数并可视化数据。给出实验结果以显示每个步骤的输出以及该框架的效率。

著录项

来源
《IEEE 5th International Bio-Inspired Computing: Theories and Applications》|2010年|P.903-907|共5页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;
关键词
Feature Selection; Linear dimension Reduction; Non-Linear Dimension Reduction;

机译：特征选择;线性降维;非线性降维;

相似文献

外文文献
中文文献
专利

1. Solution of the time-domain inverse resistivity problem in the model reduction framework part I. one-dimensional problem with SISO data [J] . Druskin V., Simoncini V., Zaslavsky M. SIAM Journal on Scientific Computing . 2013,第3期

机译：模型简化框架中时域反电阻率问题的解决方法I.使用SISO数据的一维问题
2. Rasch-based high-dimensionality data reduction and class prediction with applications to microarray gene expression data [J] . Andrej Kastrin, Borut Peterlin Expert systems with applications . 2010,第7期

机译：基于Rasch的高维数据归约和分类预测及其在微阵列基因表达数据中的应用
3. Dimension reduction methods for microarrays with application to censored survival data [J] . Lexin Li, Hongzhe Li Bioinformatics . 2004,第18期

机译：微阵列的降维方法及其在受检生存数据中的应用
4. A framework for high dimensional data reduction in the microarray domain [C] . {missing} International Conference on Bio-Inspired Computing: Theories and Applications . 2010

机译：微阵列域中的高维数据减少的框架
5. A Differential Privacy-Based and Dimensionality Reduction Framework to Optimize the Generalizability of Machine Learning Techniques for High-Dimensional Biological Data [D] . Le, Trang T. 2017

机译：基于差异隐私和维度减少框架，以优化高维生物数据机器学习技术的易用性
6. Importance of data structure in comparing two dimension reduction methods for classification of microarray gene expression data [O] . Caroline Truntzer, Catherine Mercier, Jacques Estève, 2007

机译：数据结构在比较二维缩减方法分类微阵列基因表达数据中的重要性
7. A Framework for High Dimensional Data Reduction in the Microarray Domain [O] . Ali Anaissi, Paul J. Kennedy, Madhu Goyal 2010

机译：微阵列域中高维数据减少的框架

A framework for high dimensional data reduction in the microarray domain

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅