首页> 美国卫生研究院文献>other >MaSS-Simulator: A highly configurable simulator for generating MS/MS datasets for benchmarking of proteomics algorithms
【2h】

MaSS-Simulator: A highly configurable simulator for generating MS/MS datasets for benchmarking of proteomics algorithms

机译:MaSS-Simulator:一种高度可配置的模拟器用于生成MS / MS数据集以对蛋白质组学算法进行基准测试

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Mass Spectrometry (MS) based proteomics has become an essential tool in the study of proteins. With the advent of modern MS machines huge amounts of data is being generated which can only be processed by novel algorithmic tools. However, in the absence of data benchmarks and ground truth datasets algorithmic integrity testing and reproducibility is a challenging problem. To this end, we present MaSS-Simulator, which is an easy to use simulator and can be configured to simulate MS/MS datasets for a wide variety of conditions with known ground truths. MaSS-Simulator offers many configuration options to allow the user a great degree of control over the test datasets which can enable rigorous and large- scale testing of any proteomics algorithm. We assessed MaSS-Simulator by comparing its performance against experimentally generated spectra and spectra obtained from NIST collections of spectral library. Our results showed that MaSS-Simulator generated spectra matched closely with real-spectra and had a relative-error distribution centered around 25%. In contrast the theoretical spectra for same peptides had relative-error distribution centered around 150%. MaSS-Simulator will enable developers to specifically highlight the capabilities of their algorithms and provide a strong proof of any pitfalls they might face. Source code, executables and a user manual for MaSS-Simulator can be downloaded from
机译:基于质谱(MS)的蛋白质组学已成为蛋白质研究中必不可少的工具。随着现代MS机器的出现,正在生成大量数据,这些数据只能通过新颖的算法工具进行处理。然而,在没有数据基准和地面事实数据集的情况下,算法完整性测试和可再现性是一个具有挑战性的问题。为此,我们介绍了MaSS-Simulator,这是一种易于使用的模拟器,可以将其配置为针对具有已知基本事实的各种条件来模拟MS / MS数据集。 MaSS-Simulator提供了许多配置选项,以允许用户对测试数据集进行高度控制,从而可以对任何蛋白质组学算法进行严格且大规模的测试。我们通过将MaSS-Simulator的性能与实验生成的光谱以及从光谱库的NIST集合中获得的光谱进行比较来评估其性能。我们的结果表明,MaSS-Simulator生成的光谱与真实光谱非常匹配,并且相对误差分布集中在25%左右。相反,相同肽的理论光谱的相对误差分布集中在150%左右。 MaSS-Simulator将使开发人员能够特别强调其算法的功能,并为他们可能遇到的任何陷阱提供有力的证明。可以从以下网站下载MaSS-Simulator的源代码,可执行文件和用户手册。

著录项

  • 期刊名称 other
  • 作者

    Muaaz Gul Awan; Fahad Saeed;

  • 作者单位
  • 年(卷),期 -1(18),20
  • 年度 -1
  • 页码 e1800206
  • 总页数 7
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

  • 入库时间 2022-08-21 11:06:42

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号