首页> 美国卫生研究院文献>other >Extraction interpretation and validation of information for comparing samples in metabolic LC/MS data sets
【2h】

Extraction interpretation and validation of information for comparing samples in metabolic LC/MS data sets

机译:提取解释和验证信息以比较代谢LC / MS数据集中的样品

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

LC/MS is an analytical technique that, due to its high sensitivity, has become increasingly popular for the generation of metabolic signatures in biological samples and for the building of metabolic data bases. However, to be able to create robust and interpretable (transparent) multivariate models for the comparison of many samples, the data must fulfil certain specific criteria: (i) that each sample is characterized by the same number of variables, (ii) that each of these variables is represented across all observations, and (iii) that a variable in one sample has the same biological meaning or represents the same metabolite in all other samples. In addition, the obtained models must have the ability to make predictions of, e.g. related and independent samples characterized accordingly to the model samples. This method involves the construction of a representative data set, including automatic peak detection, alignment, setting of retention time windows, summing in the chromatographic dimension and data compression by means of alternating regression, where the relevant metabolic variation is retained for further modelling using multivariate analysis. This approach has the advantage of allowing the comparison of large numbers of samples based on their LC/MS metabolic profiles, but also of creating a means for the interpretation of the investigated biological system. This includes finding relevant systematic patterns among samples, identifying influential variables, verifying the findings in the raw data, and finally using the models for predictions. The presented strategy was here applied to a population study using urine samples from two cohorts, Shanxi (People’s Republic of China) and Honolulu (USA). The results showed that the evaluation of the extracted information data using partial least square discriminant analysis (PLS-DA) provided a robust, predictive and transparent model for the metabolic differences between the two populations. The presented findings suggest that this is a general approach for data handling, analysis, and evaluation of large metabolic LC/MS data sets.
机译:LC / MS是一种分析技术,由于其高灵敏度,已变得越来越流行,用于在生物样品中生成代谢特征以及用于建立代谢数据库。但是,为了能够创建健壮且可解释的(透明)多元模型来比较多个样本,数据必须满足某些特定条件:(i)每个样本都具有相同数量的变量,(ii)每个样本这些变量中的所有变量均在所有观察结果中表示,并且(iii)一个样品中的变量在所有其他样品中具有相同的生物学含义或表示相同的代谢物。另外,所获得的模型必须具有例如预测的能力。相关样本和独立样本的特征与模型样本相对应。该方法涉及构建代表性数据集,包括自动峰检测,比对,保留时间窗的设置,色谱维数求和和通过交替回归进行的数据压缩,其中相关的代谢变异保留下来,可用于使用多变量进行进一步建模分析。这种方法的优点是可以根据其LC / MS代谢谱比较大量样品,还可以创建一种方法来解释所研究的生物系统。这包括在样本中找到相关的系统模式,识别有影响力的变量,验证原始数据中的发现,最后使用模型进行预测。此处提出的策略已用于使用来自两个队列(山西(中华人民共和国)和檀香山(美国))的尿液样本进行的人群研究中。结果表明,使用偏最小二乘判别分析(PLS-DA)对提取的信息数据进行评估,为两个人群之间的代谢差异提供了可靠,可预测和透明的模型。提出的发现表明,这是用于大型代谢LC / MS数据集的数据处理,分析和评估的通用方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号