...
首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Peakbin Selection in Mass Spectrometry Data Using a Consensus Approach with Estimation of Distribution Algorithms
【24h】

Peakbin Selection in Mass Spectrometry Data Using a Consensus Approach with Estimation of Distribution Algorithms

机译:估计分布算法估计的共识方法在质谱数据中选择峰箱

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Progress is continuously being made in the quest for stable biomarkers linked to complex diseases. Mass spectrometers are one of the devices for tackling this problem. The data profiles they produce are noisy and unstable. In these profiles, biomarkers are detected as signal regions (peaks), where control and disease samples behave differently. Mass spectrometry (MS) data generally contain a limited number of samples described by a high number of features. In this work, we present a novel class of evolutionary algorithms, estimation of distribution algorithms (EDA), as an efficient peak selector in this MS domain. There is a trade-of f between the reliability of the detected biomarkers and the low number of samples for analysis. For this reason, we introduce a consensus approach, built upon the classical EDA scheme, that improves stability and robustness of the final set of relevant peaks. An entire data workflow is designed to yield unbiased results. Four publicly available MS data sets (two MALDI-TOF and another two SELDI-TOF) are analyzed. The results are compared to the original works, and a new plot (peak frequential plot) for graphically inspecting the relevant peaks is introduced. A complete online supplementary page, which can be found at http://www.sc.ehu.es/ccwbayes/members/ruben/ms, includes extended info and results, in addition to Matlab scripts and references.
机译:在寻求与复杂疾病有关的稳定生物标志物方面正在不断取得进展。质谱仪是解决该问题的装置之一。他们生成的数据配置文件嘈杂且不稳定。在这些配置文件中,将生物标记物检测为信号区域(峰值),其中对照样本和疾病样本的行为有所不同。质谱(MS)数据通常包含数量有限的样品,这些样品由大量特征描述。在这项工作中,我们提出了一种新颖的进化算法,即分布算法(EDA)估计,作为该MS域中的有效峰值选择器。在检测到的生物标志物的可靠性与少量样品进行分析之间需要权衡f。因此,我们引入一种基于经典EDA方案的共识方法,该方法可提高最终相关峰集的稳定性和鲁棒性。整个数据工作流程旨在产生公正的结果。分析了四个公开可用的MS数据集(两个MALDI-TOF和另一个两个SELDI-TOF)。将结果与原始工作进行比较,并引入了用于以图形方式检查相关峰的新图(峰值频率图)。完整的在线补充页面可在http://www.sc.ehu.es/ccwbayes/members/ruben/ms上找到,除了Matlab脚本和参考之外,还包括扩展的信息和结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号