首页> 美国卫生研究院文献>Scientific Reports >Identification of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): a Data-Driven Approach across Metabolic Processes
【2h】

Identification of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): a Data-Driven Approach across Metabolic Processes

机译:多元曲线分辨率交替最小二乘(MCR-ALS)中可靠成分的识别:一种跨代谢过程的数据驱动方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

There is an increasing need to use multivariate statistical methods for understanding biological functions, identifying the mechanisms of diseases, and exploring biomarkers. In addition to classical analyses such as hierarchical cluster analysis, principal component analysis, and partial least squares discriminant analysis, various multivariate strategies, including independent component analysis, non-negative matrix factorization, and multivariate curve resolution, have recently been proposed. However, determining the number of components is problematic. Despite the proposal of several different methods, no satisfactory approach has yet been reported. To resolve this problem, we implemented a new idea: classifying a component as “reliable” or “unreliable” based on the reproducibility of its appearance, regardless of the number of components in the calculation. Using the clustering method for classification, we applied this idea to multivariate curve resolution-alternating least squares (MCR-ALS). Comparisons between conventional and modified methods applied to proton nuclear magnetic resonance (1H-NMR) spectral datasets derived from known standard mixtures and biological mixtures (urine and feces of mice) revealed that more plausible results are obtained by the modified method. In particular, clusters containing little information were detected with reliability. This strategy, named “cluster-aided MCR-ALS,” will facilitate the attainment of more reliable results in the metabolomics datasets.
机译:越来越需要使用多元统计方法来理解生物学功能,识别疾病机制和探索生物标志物。除了经典的分析(例如层次聚类分析,主成分分析和偏最小二乘判别分析)外,最近还提出了各种多元策略,包括独立成分分析,非负矩阵分解和多元曲线分辨率。但是,确定部件的数量是有问题的。尽管提出了几种不同方法的建议,但尚未报告令人满意的方法。为了解决此问题,我们实现了一个新想法:根据组件外观的可重复性将其分为“可靠”或“不可靠”,而与计算中的组件数量无关。使用聚类方法进行分类,我们将此想法应用于多变量曲线分辨率-交替最小二乘(MCR-ALS)。从已知标准混合物和生物混合物(小鼠尿液和粪便)获得的质子核磁共振( 1 H-NMR)光谱数据集的常规方法和改进方法之间的比较表明,通过以下方法可获得更合理的结果修改后的方法。特别地,可靠地检测到包含很少信息的集群。该策略称为“集群辅助MCR-ALS”,将有助于在代谢组学数据集中获得更可靠的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号