首页> 外文会议>International Conference on Security, Pattern Analysis, and Cybernetics >Gradient boosting model for unbalanced quantitative mass spectra quality assessment
【24h】

Gradient boosting model for unbalanced quantitative mass spectra quality assessment

机译:不平衡定量质谱质量评估的梯度升压模型

获取原文

摘要

A method for controlling the quality of isotope labeled mass spectra is described here. In such mass spectra, the profiles of labeled (heavy) and unlabeled (light) peptide pairs provide us valuable information about the studied biological samples in different conditions. The core task of quality control in quantitative LC-MS experiment is to filter out low quality spectra or the peptides with error profiles. The most common used method for this problem is training a classifier for the spectra data to separate it into positive (high quality) and negative (low quality) ones. However, the small number of error profiles always makes the training data dominated by the positive samples, i.e., class imbalance problem. So the Syntheic minority over-sampling technique (SMOTE) is employed to handle the unbalanced data and then applied extreme gradient boosting (Xgboost) model as the classifier. We assessed the different heavy-light peptide ratio samples by the trained Xgboost classifier, and found that the SMOTE Xgboost classifier increases the reliability of peptide ratio estimations significantly.
机译:这里描述了一种用于控制标记标记的质谱的质量的方法。在这种质谱中,标记(重)和未标记的(Light)肽对的曲线为我们提供了有关在不同条件下研究的生物样品的有价值的信息。定量LC-MS实验中质量控制的核心任务是滤除低质量光谱或具有误差型材的肽。对于此问题的最常见的使用方法是培训用于光谱数据的分类器,以将其分成正(高质量)和负(低质量)。然而,少量错误配置文件始终使培训数据由正示例主导,即类不平衡问题。因此,使用合成的少数群体过度采样技术(SMOTE)来处理不平衡数据,然后将极端梯度升压(XGBoost)模型应用为分类器。我们评估由受过训练的分类器Xgboost不同重 - 轻肽比样品,并发现该SMOTE Xgboost分类显著增加肽比估计的可靠性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号