首页> 外文会议>Engineering in Medicine and Biology Society, 2004. IEMBS '04. 26th Annual International Conference of the IEEE >Modeling protein tandem mass spectrometry data with an extended linear regression strategy
【24h】

Modeling protein tandem mass spectrometry data with an extended linear regression strategy

机译:使用扩展的线性回归策略为蛋白质串联质谱数据建模

获取原文

摘要

Tandem mass spectrometry (MS/MS) has emerged as a cornerstone of proteomics owing in part to robust spectral interpretation algorithm. The intensity patterns presented in mass spectra are useful information for identification of peptides and proteins. However, widely used algorithms can not predicate the peak intensity patterns exactly. We have developed a systematic analytical approach based on a family of extended regression models, which permits routine, large scale protein expression profile modeling. By proving an important technical result that the regression coefficient vector is just the eigenvector corresponding to the least eigenvalue of a space transformed version of the original data, this extended regression problem can be reduced to a SVD decomposition problem, thus gain the robustness and efficiency. To evaluate the performance of our model, from 60,960 spectra, we chose 2,859 with high confidence, non redundant matches as training data, based on this specific problem, we derived some measurements of goodness of fit to show that our modeling method is reasonable. The issues of overfitting and underfitting are also discussed. This extended regression strategy therefore offers an effective and efficient framework for in-depth investigation of complex mammalian proteomes.
机译:串联质谱(MS / MS)已成为蛋白质组学的基石,部分原因是强大的光谱解释算法。质谱图中显示的强度模式是鉴定肽和蛋白质的有用信息。但是,广泛使用的算法无法准确预测峰强度模式。我们已经基于一系列扩展的回归模型开发了一种系统的分析方法,该模型可以进行常规的大规模蛋白质表达谱建模。通过证明一个重要的技术结果,即回归系数向量仅仅是与原始数据的空间变换版本的最小特征值相对应的特征向量,可以将此扩展回归问题简化为SVD分解问题,从而提高了鲁棒性和效率。为了评估我们模型的性能,从60,960个光谱中,我们选择了2859个具有高置信度,非冗余匹配的数据作为训练数据,基于这个特定问题,我们得出了一些拟合优度的度量,以证明我们的建模方法是合理的。还讨论了过度拟合和拟合不足的问题。因此,这种扩展的回归策略为深入研究复杂的哺乳动物蛋白质组学提供了有效且有效的框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号