...
首页> 外文期刊>BioTechniques >Sequential projection pursuit principal component analysis - dealing with missing data associated with new-omics technologies
【24h】

Sequential projection pursuit principal component analysis - dealing with missing data associated with new-omics technologies

机译:顺序投影追踪主成分分析-处理与新组学技术相关的缺失数据

获取原文
获取原文并翻译 | 示例
           

摘要

Principal Component Analysis (PCA) is a common exploratory tool used to evaluate large complex data sets. The resulting lower-dimensional representations are often valuable for pattern visualization, clustering, or classification of the data. However, PCA cannot be applied directly to many -omics data sets generated by newer technologies such as label-free mass spectrometry due to large numbers of non-random missing values. Here we present a sequential projection pursuit PCA (sppPCA) method for defining principal components in the presence of missing data. Our results demonstrate that this approach generates robust and informative low-dimensional data representations compared to commonly used imputation approaches.
机译:主成分分析(PCA)是用于评估大型复杂数据集的常用探索工具。所得的低维表示形式通常对于数据的模式可视化,聚类或分类很有价值。但是,由于大量的非随机缺失值,因此PCA无法直接应用于由较新技术(例如无标签质谱)生成的许多组学数据集。在这里,我们提出了一种顺序投影追踪PCA(sppPCA)方法,用于在缺少数据的情况下定义主要成分。我们的结果表明,与常用的插补方法相比,此方法可生成健壮且内容丰富的低维数据表示形式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号