...
首页> 外文期刊>Analytical chemistry >Visualization, Quantification, and Alignment of Spectral Drift in Population Scale Untargeted Metabolomics Data
【24h】

Visualization, Quantification, and Alignment of Spectral Drift in Population Scale Untargeted Metabolomics Data

机译:人口规模无标准的代谢组数据中的可视化,量化和对准的光谱漂移

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Untargeted liquid-chromatography mass spectrometry (LC-MS)-based metabolomics analysis of human biospecimens has become among the most promising strategies for probing the underpinnings of human health and disease. Analysis of spectral data across population scale cohorts, however, is precluded by day-to-day nonlinear signal drifts in LC retention time or batch effects that complicate comparison of thousands of untargeted peaks. To date, there exists no efficient means of visualization and quantitative assessment of signal drift, correction of drift when present, and automated filtering of unstable spectral features, particularly across thousands of data files in population scale experiments. Herein, we report the development of a set of R-based scripts that allow for pre-and postprocessing of raw LC-MS data. These methods can be integrated with existing data analysis workflows by providing initial preprocessing bulk nonlinear retention time correction at the raw data level. Further, this approach provides postprocessing visualization and quantification of peak alignment accuracy, as well as peak-reliability-based parsing of processed data through hierarchical clustering of signal profiles. In a metabolomics data set derived from similar to 3000 human plasma samples, we find that application of our alignment tools resulted in substantial improvement in peak alignment accuracy, automated data filtering, and ultimately statistical power for detection of metabolite correlates of clinical measures. These tools will enable metabolomics studies of population scale cohorts.
机译:未确定的液相色谱质谱(LC-MS)基础的人类生物起草的代谢分析已成为探讨人类健康和疾病的基础的最有希望的策略之一。然而,跨人类规模群体的光谱数据分析,但是在LC保留时间或批量效应中的日常非线性信号漂移,使得复杂的数千个未确定峰的比较。迄今为止,存在无效的可视化方法和定量评估信号漂移,当存在时漂移的校正,以及自动过滤不稳定的光谱特征,特别是在人口比例实验中的数千个数据文件中。这里,我们报告了一组基于R基脚本的开发,允许预先处理原始LC-MS数据。通过在原始数据级别提供初始预处理散装非线性保留时间校正,可以将这些方法与现有的数据分析工作流集成在一起。此外,该方法提供了后处理的可视化和峰值对准精度的量化,以及通过信号配置文件的分层聚类的基于峰值可靠性的处理数据解析。在从类似于3000个人的等离子体样本的代谢组数据集中,我们发现我们的对准工具的应用导致峰值对准精度,自动化数据滤波和最终统计功率的显着提高,以检测代谢物相关的临床测量。这些工具将使人口规模队列的代谢组学研究。

著录项

  • 来源
    《Analytical chemistry》 |2017年第3期|共6页
  • 作者单位

    Univ Calif San Diego Dept Med La Jolla CA 92093 USA;

    Harvard Med Sch Brigham &

    Womens Hosp Dept Med Div Cardiovasc Boston MA 02115 USA;

    Harvard Med Sch Brigham &

    Womens Hosp Dept Med Div Cardiovasc Boston MA 02115 USA;

    Univ Calif San Diego Dept Med La Jolla CA 92093 USA;

    Framingham Heart Dis Epidemiol Study Framingham MA 01702 USA;

    Harvard Med Sch Brigham &

    Womens Hosp Dept Med Div Cardiovasc Boston MA 02115 USA;

    Univ Calif San Diego Dept Med La Jolla CA 92093 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 分析化学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号