首页> 外文期刊>BMC Bioinformatics >PyMS: a Python toolkit for processing of gas chromatography-mass spectrometry (GC-MS) data. Application and comparative study of selected tools
【24h】

PyMS: a Python toolkit for processing of gas chromatography-mass spectrometry (GC-MS) data. Application and comparative study of selected tools

机译:PyMS:用于处理气相色谱-质谱(GC-MS)数据的Python工具包。所选工具的应用和比较研究

获取原文
           

摘要

Background Gas chromatography–mass spectrometry (GC-MS) is a technique frequently used in targeted and non-targeted measurements of metabolites. Most existing software tools for processing of raw instrument GC-MS data tightly integrate data processing methods with graphical user interface facilitating interactive data processing. While interactive processing remains critically important in GC-MS applications, high-throughput studies increasingly dictate the need for command line tools, suitable for scripting of high-throughput, customized processing pipelines. Results PyMS comprises a library of functions for processing of instrument GC-MS data developed in Python. PyMS currently provides a complete set of GC-MS processing functions, including reading of standard data formats (ANDI- MS/NetCDF and JCAMP-DX), noise smoothing, baseline correction, peak detection, peak deconvolution, peak integration, and peak alignment by dynamic programming. A novel common ion single quantitation algorithm allows automated, accurate quantitation of GC-MS electron impact (EI) fragmentation spectra when a large number of experiments are being analyzed. PyMS implements parallel processing for by-row and by-column data processing tasks based on Message Passing Interface (MPI), allowing processing to scale on multiple CPUs in distributed computing environments. A set of specifically designed experiments was performed in-house and used to comparatively evaluate the performance of PyMS and three widely used software packages for GC-MS data processing (AMDIS, AnalyzerPro, and XCMS). Conclusions PyMS is a novel software package for the processing of raw GC-MS data, particularly suitable for scripting of customized processing pipelines and for data processing in batch mode. PyMS provides limited graphical capabilities and can be used both for routine data processing and interactive/exploratory data analysis. In real-life GC-MS data processing scenarios PyMS performs as well or better than leading software packages. We demonstrate data processing scenarios simple to implement in PyMS, yet difficult to achieve with many conventional GC-MS data processing software. Automated sample processing and quantitation with PyMS can provide substantial time savings compared to more traditional interactive software systems that tightly integrate data processing with the graphical user interface.
机译:背景技术气相色谱-质谱(GC-MS)是一种经常用于代谢物的有针对性和非目标性测量的技术。现有的大多数用于处理原始仪器GC-MS数据的软件工具都将数据处理方法与图形用户界面紧密集成在一起,从而促进了交互式数据处理。尽管交互式处理在GC-MS应用程序中仍然至关重要,但是高通量研究越来越表明对命令行工具的需求,这些工具适用于对高通量定制化处理管道进行脚本编写。结果PyMS包含一个函数库,用于处理用Python开发的仪器GC-MS数据。 PyMS当前提供了一套完整的GC-MS处理功能,包括读取标准数据格式(ANDI-MS / NetCDF和JCAMP-DX),噪声平滑,基线校正,峰检测,峰去卷积,峰积分和通过动态编程。当分析大量实验时,一种新颖的通用离子单定量算法可以自动,准确地定量GC-MS电子碰撞(EI)碎片光谱。 PyMS基于消息传递接口(MPI)对行和列数据处理任务实现并行处理,从而允许在分布式计算环境中的多个CPU上进行扩展处理。在内部进行了一组专门设计的实验,用于比较评估PyMS的性能以及三个用于GC-MS数据处理的广泛使用的软件包(AMDIS,AnalyzerPro和XCMS)。结论PyMS是一种用于处理原始GC-MS数据的新型软件包,特别适合于编写定制处理管道的脚本以及用于批处理模式的数据处理。 PyMS提供有限的图形功能,可用于常规数据处理和交互式/探索性数据分析。在实际的GC-MS数据处理方案中,PyMS的性能与领先的软件包相同或更好。我们演示了在PyMS中易于实现的数据处理方案,但是在许多常规GC-MS数据处理软件中却难以实现。与将数据处理与图形用户界面紧密集成的更传统的交互式软件系统相比,使用PyMS进行自动样品处理和定量分析可以节省大量时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号