首页> 外文期刊>Bioinformatics >Improved model-based, platform-independent feature extraction for mass spectrometry
【24h】

Improved model-based, platform-independent feature extraction for mass spectrometry

机译:改进的基于模型,与平台无关的质谱特征提取

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Mass spectrometry (MS) is increasingly being used for biomedical research. The typical analysis of MS data consists of several steps. Feature extraction is a crucial step since subsequent analyses are performed only on the detected features. Current methodologies applied to low-resolution MS, in which features are peaks or wavelet functions, are parameter-sensitive and inaccurate in the sense that peaks and wavelet functions do not directly correspond to the underlying molecules under observation. In highresolution MS, the model-based approach is more appealing as it can provide a better representation of the MS signals by incorporating information about peak shapes and isotopic distributions. Current model-based techniques are computationally expensive; various algorithms have been proposed to improve the computational efficiency of this paradigm. However, these methods cannot deal well with overlapping features, especially when they are merged to create one broad peak. In addition, no method has been proven to perform well across different MS platforms. Results: We suggest a new model- based approach to feature extraction in which spectra are decomposed into a mixture of distributions derived from peptide models. By incorporating kernelbased smoothing and perceptual similarity for matching distributions, our statistical framework improves existing methodologies in terms of computational efficiency and the accuracy of the results. Our model is parameterized by physical properties and is therefore applicable to different MS instruments and settings. We validate our approach on simulated data, and show that the performance is higher than commonly used tools on real high- and low-resolution MS, and MS/MS data sets.
机译:动机:质谱(MS)越来越多地用于生物医学研究。 MS数据的典型分析包括几个步骤。特征提取是关键步骤,因为仅对检测到的特征执行后续分析。当前应用于低分辨率MS的方法(其特征是峰或小波函数)对参数敏感,并且在峰和小波函数不直接对应于所观察的基础分子的意义上是不准确的。在高分辨率质谱仪中,基于模型的方法更具吸引力,因为它可以通过合并有关峰形和同位素分布的信息来更好地表示质谱仪信号。当前基于模型的技术在计算上是昂贵的。已经提出了各种算法来提高该范例的计算效率。但是,这些方法不能很好地处理重叠特征,特别是将它们合并以创建一个宽峰时。此外,没有方法被证明能在不同的MS平台上很好地执行。结果:我们建议一种基于模型的新特征提取方法,其中将光谱分解为从肽模型衍生的分布的混合物。通过结合基于核的平滑和感知相似度以匹配分布,我们的统计框架在计算效率和结果准确性方面改进了现有方法。我们的模型通过物理属性进行参数化,因此适用于不同的MS仪器和设置。我们验证了我们在模拟数据上的方法,并表明在实际的高分辨率和低分辨率MS以及MS / MS数据集上,其性能高于常用工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号