...
首页> 外文期刊>BMC Genomics >Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder
【24h】

Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder

机译:使用支持向量机基于各种序列特征,构象柔韧性和无序性预测磺酰化位点

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background Sumoylation, which is a reversible and dynamic post-translational modification, is one of the vital processes in a cell. Before a protein matures to perform its function, sumoylation may alter its localization, interactions, and possibly structural conformation. Abberations in protein sumoylation has been linked with a variety of disorders and developmental anomalies. Experimental approaches to identification of sumoylation sites may not be effective due to the dynamic nature of sumoylation, laborsome experiments and their cost. Therefore, computational approaches may guide experimental identification of sumoylation sites and provide insights for further understanding sumoylation mechanism. Results In this paper, the effectiveness of using various sequence properties in predicting sumoylation sites was investigated with statistical analyses and machine learning approach employing support vector machines. These sequence properties were derived from windows of size 7 including position-specific amino acid composition, hydrophobicity, estimated sub-window volumes, predicted disorder, and conformational flexibility. 5-fold cross-validation results on experimentally identified sumoylation sites revealed that our method successfully predicts sumoylation sites with a Matthew's correlation coefficient, sensitivity, specificity, and accuracy equal to 0.66, 73%, 98%, and 97%, respectively. Additionally, we have showed that our method compares favorably to the existing prediction methods and basic regular expressions scanner. Conclusions By using support vector machines, a new, robust method for sumoylation site prediction was introduced. Besides, the possible effects of predicted conformational flexibility and disorder on sumoylation site recognition were explored computationally for the first time to our knowledge as an additional parameter that could aid in sumoylation site prediction.
机译:背景Sumoylation是一种可逆的动态翻译后修饰,是细胞中的重要过程之一。在蛋白质成熟以执行其功能之前,SUMO化可能会改变其定位,相互作用以及可能的结构构象。蛋白SUMO化中的异常与多种疾病和发育异常有关。由于SUMO化的动态性质,繁琐的实验及其成本,用于识别SUMO化位点的实验方法可能无效。因此,计算方法可以指导SUMO化位点的实验鉴定,并为进一步理解SUMO化机制提供见解。结果在本文中,通过统计分析和使用支持向量机的机器学习方法,研究了使用各种序列特性来预测磺酰化位点的有效性。这些序列特性来自大小为7的窗口,包括位置特异性氨基酸组成,疏水性,估计的子窗口体积,预测的障碍和构象柔韧性。通过实验确定的磺酰化位点的5倍交叉验证结果表明,我们的方法成功预测了磺酰化位点,其马修相关系数,灵敏度,特异性和准确性分别等于0.66、73%,98%和97%。此外,我们已经表明,与现有的预测方法和基本正则表达式扫描仪相比,我们的方法具有优势。结论使用支持向量机,提出了一种新的,可靠的用于磺酰化位点预测的方法。此外,据我们所知,这是首次通过预测构象柔韧性和无序性对SUMO化位点识别的可能影响,作为有助于SUMO化位点预测的附加参数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号