首页> 美国卫生研究院文献>Heliyon >On linear dimension reduction based on diagonalization of scatter matrices for bioinformatics downstream analyses
【2h】

On linear dimension reduction based on diagonalization of scatter matrices for bioinformatics downstream analyses

机译:基于生物信息学下游分析的散射矩阵对角化的线性尺寸减小

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Dimension reduction is often a preliminary step in the analysis of data sets with a large number of variables. Most classical, both supervised and unsupervised, dimension reduction methods such as principal component analysis (PCA), independent component analysis (ICA) or sliced inverse regression (SIR) can be formulated using one, two or several different scatter matrix functionals. Scatter matrices can be seen as different measures of multivariate dispersion and might highlight different features of the data and when compared might reveal interesting structures. Such analysis then searches for a projection onto an interesting (signal) part of the data, and it is also important to know the correct dimension of the signal subspace. These approaches usually make either no model assumptions or work in wide classes of semiparametric models. Theoretical results in the literature are however limited to the case where the sample size exceeds the number of variables which is hardly ever true for data sets encountered in bioinformatics. In this paper, we briefly review the relevant literature and explore if the dimension reduction tools can be used to find relevant and interesting subspaces for small-n-large-p data sets. We illustrate the methods with a microarray dataset of prostate cancer patients and healthy controls.
机译:尺寸减小通常是分析具有大量变量的数据集的初步步骤。可以使用一个,两个或几个不同的散射矩阵功能来配制大多数经典的监督和无监督和无监督的尺寸减少方法,例如主成分分析(PCA),独立分析分析(ICA)或切片逆回归(SIR)。散射矩阵可以被视为多元色散的不同措施,并且可以突出数据的不同特征,并且比较可能会揭示有趣的结构。然后,这样的分析将投影搜索到数据的有趣(信号)部分,并且对于知道信号子空间的正确维度也很重要。这些方法通常在宽的半甲型模型中没有模型假设或工作。然而,文献中的理论结果仅限于样本大小超过变量数量的变量,这对于在生物信息学中遇到的数据集几乎没有真实的情况。在本文中,我们简要介绍了相关的文献,并探索了尺寸减少工具,如果尺寸减少工具可用于查找用于小型N大P数据集的相关和有趣的子空间。我们说明了具有前列腺癌患者和健康对照的微阵列数据集的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号