首页> 外文期刊>Proteomics >MaSS‐Simulator: A Highly Configurable Simulator for Generating MS/MS Datasets for Benchmarking of Proteomics Algorithms
【24h】

MaSS‐Simulator: A Highly Configurable Simulator for Generating MS/MS Datasets for Benchmarking of Proteomics Algorithms

机译:质量模拟器:一种高度可配置的模拟器,用于生成MS / MS数据集,用于蛋白质组学算法的基准测试

获取原文
获取原文并翻译 | 示例
           

摘要

Abstract Mass Spectrometry (MS)‐based proteomics has become an essential tool in the study of proteins. With the advent of modern MS machines huge amounts of data is being generated, which can only be processed by novel algorithmic tools. However, in the absence of data benchmarks and ground truth datasets algorithmic integrity testing and reproducibility is a challenging problem. To this end, MaSS‐Simulator has been presented, which is an easy to use simulator and can be configured to simulate MS/MS datasets for a wide variety of conditions with known ground truths. MaSS‐Simulator offers many configuration options to allow the user a great degree of control over the test datasets, which can enable rigorous and large‐ scale testing of any proteomics algorithm. MaSS‐Simulator is assessed by comparing its performance against experimentally generated spectra and spectra obtained from NIST collections of spectral library. The results show that MaSS‐Simulator generated spectra match closely with real‐spectra and have a relative‐error distribution centered around 25%. In contrast, the theoretical spectra for same peptides have relative‐error distribution centered around 150%. MaSS‐Simulator will enable developers to specifically highlight the capabilities of their algorithms and provide a strong proof of any pitfalls they might face. Source code, executables, and a user manual for MaSS‐Simulator can be downloaded from https://github.com/pcdslab/MaSS-Simulator .
机译:摘要质谱(MS)基础的蛋白质组学已成为蛋白质研究中的重要工具。随着现代MS机器的出现,正在生成大量数据,只能通过新颖的算法工具处理。然而,在没有数据基准和地面真理数据集算法完整性测试和再现性的情况下是一个具有挑战性的问题。为此,已经提出了大规模模拟器,这是一种易于使用的模拟器,可以配置为模拟MS / MS数据集,以实现具有已知地面真理的各种条件。 Mass-Simulator提供许多配置选项,以允许用户对测试数据集进行大量控制,这可以对任何蛋白质组学算法进行严格和大规模的测试。通过比较其针对由NIST集合的实验生成的光谱和光谱来进行评估来评估质量模拟器。结果表明,质量模拟器产生的光谱与真实光谱紧密匹配,具有约25%的相对误差分布。相反,相同肽的理论光谱具有相对误差分布,其中心为约150%。 Mass-Simulator将使开发人员能够明确突出算法的功能,并提供他们可能面临的任何陷阱的强大证据。源代码,可执行文件和质量模拟器的用户手册可以从https://github.com/pcdslab/mass-simulator下载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号