首页> 外文期刊>Scientific reports. >Identification of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): a Data-Driven Approach across Metabolic Processes
【24h】

Identification of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): a Data-Driven Approach across Metabolic Processes

机译:识别多变量曲线分辨率 - 交替的最小二乘(MCR-ALS)的可靠组件:跨代谢过程的数据驱动方法

获取原文
           

摘要

There is an increasing need to use multivariate statistical methods for understanding biological functions, identifying the mechanisms of diseases, and exploring biomarkers. In addition to classical analyses such as hierarchical cluster analysis, principal component analysis, and partial least squares discriminant analysis, various multivariate strategies, including independent component analysis, non-negative matrix factorization, and multivariate curve resolution, have recently been proposed. However, determining the number of components is problematic. Despite the proposal of several different methods, no satisfactory approach has yet been reported. To resolve this problem, we implemented a new idea: classifying a component as “reliable” or “unreliable” based on the reproducibility of its appearance, regardless of the number of components in the calculation. Using the clustering method for classification, we applied this idea to multivariate curve resolution-alternating least squares (MCR-ALS). Comparisons between conventional and modified methods applied to proton nuclear magnetic resonance (1H-NMR) spectral datasets derived from known standard mixtures and biological mixtures (urine and feces of mice) revealed that more plausible results are obtained by the modified method. In particular, clusters containing little information were detected with reliability. This strategy, named “cluster-aided MCR-ALS,” will facilitate the attainment of more reliable results in the metabolomics datasets.
机译:越来越需要使用多元统计方法来了解生物学功能,鉴定疾病的机制,以及探索生物标志物。除了诸如层次聚类分析,主成分分析和局部最小二乘判别分析的典型分析之外,最近提出了各种多元组分分析,包括独立分量分析,非负数矩阵分解和多变量曲线分辨率。但是,确定组件的数量是有问题的。尽管提出了几种不同的方法,但尚未报告令人满意的方法。为了解决这个问题,我们实现了一个新的想法:根据其外观的再现性,将组件分类为“可靠”或“不可靠”,无论计算中的组件数量如何。使用群集方法进行分类,我们将此想法应用于多变量曲线分辨率 - 交替的最小二乘(MCR-ALS)。应用于质子核磁共振的常规和修饰方法的比较( 1 1 h-nmr)源自已知的标准混合物和生物混合物(小鼠尿液和粪便)的光谱数据集显示出更多的合理结果通过修改方法。特别是,以可靠性检测包含少数信息的集群。这个策略名为“群集辅助MCR-ALS”,将促进在代谢组合数据集中获得更可靠的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号