...
首页> 外文期刊>PLoS Computational Biology >mixOmics: An R package for ‘omics feature selection and multiple data integration
【24h】

mixOmics: An R package for ‘omics feature selection and multiple data integration

机译:mixOmics:R组,用于“组学功能选择和多数据集成

获取原文

摘要

The advent of high throughput technologies has led to a wealth of publicly available ‘omics data coming from different sources, such as transcriptomics, proteomics, metabolomics. Combining such large-scale biological data sets can lead to the discovery of important biological insights, provided that relevant information can be extracted in a holistic manner. Current statistical approaches have been focusing on identifying small subsets of molecules (a ‘molecular signature’) to explain or predict biological conditions, but mainly for a single type of ‘omics. In addition, commonly used methods are univariate and consider each biological feature independently. We introduce mixOmics, an R package dedicated to the multivariate analysis of biological data sets with a specific focus on data exploration, dimension reduction and visualisation. By adopting a systems biology approach, the toolkit provides a wide range of methods that statistically integrate several data sets at once to probe relationships between heterogeneous ‘omics data sets. Our recent methods extend Projection to Latent Structure (PLS) models for discriminant analysis, for data integration across multiple ‘omics data or across independent studies, and for the identification of molecular signatures. We illustrate our latest mixOmics integrative frameworks for the multivariate analyses of ‘omics data available from the package.
机译:高通量技术的出现导致了来自不同来源(例如转录组学,蛋白质组学和代谢组学)的大量公开的omics数据。如果可以以整体方式提取相关信息,则将此类大规模生物数据集进行组合可以导致发现重要的生物学见解。当前的统计方法一直侧重于识别小分子子集(“分子标记”)以解释或预测生物学状况,但主要针对单一类型的“组学”。此外,常用的方法是单变量的,并且独立考虑每个生物学特征。我们介绍了mixOmics,这是一个R包,专用于生物数据集的多变量分析,特别着重于数据探索,降维和可视化。通过采用系统生物学方法,该工具包提供了广泛的方法,这些方法可以一次统计地集成多个数据集,以探究异构'组学数据集之间的关系。我们最近的方法将“投影到潜在结构(PLS)”模型扩展到了判别分析,跨多个组学数据或独立研究的数据集成以及分子标记的识别。我们举例说明了最新的mixOmics集成框架,可对软件包中可用的组学数据进行多变量分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号