首页> 外文会议>International Conference on Computer and Information Technology >Improved protein disorder predictor by smoothing output
【24h】

Improved protein disorder predictor by smoothing output

机译:通过平滑输出改进了蛋白质异常预测因子

获取原文

摘要

Intrinsically disorder regions (IDRs) or, proteins (IDPs) are associated with important biological functions, while lacking stable structure in their native state. The phenomena of disordered proteins or residues are abundant in nature and are extensively involved in critical human diseases and hence impacting drug discovery. Thus, the study using disorder prediction is becoming crucial in the proteomic research. The large scale growth of genome database demands high performance computational methods for identification of protein disorder. We developed a canonical support vector machine based disorder predictor, DisPredict by integrating RBF kernel. It employs novel feature set for accurate characterization of disorder which outperformed two leading predictors: the neural network based SPINE-D and Meta predictor MFDp based on ten-fold cross validation. We propose a post processing of probabilities to further improve the accuracy, named DisPredict1.1 which yields outstanding performance further both in binary annotation and real valued probability prediction per residue in both short and long disordered regions. It provides highest Mathews Correlation Coefficient (MCC), competitive Area Under receiver operating characteristic Curve (AUC) and lowest Mean Absolute Error (MAE) when compared with twenty existing predictors of several kinds on independent benchmark dataset. DisPredict is available online.
机译:本质上,无序区域(IDR)或蛋白质(IDP)与重要的生物学功能相关,而在其天然状态下却缺乏稳定的结构。蛋白质或残基无序现象在自然界中十分丰富,广泛涉及人类的关键疾病,因此会影响药物的发现。因此,使用障碍预测的研究在蛋白质组学研究中变得至关重要。基因组数据库的大规模增长需要用于识别蛋白质异常的高性能计算方法。我们通过集成RBF内核开发了基于规范支持向量机的疾病预测器DisPredict。它采用新颖的特征集对疾病进行准确的表征,其性能优于两个主要的预测指标:基于神经网络的SPINE-D和基于十倍交叉验证的元预测指标MFDp。我们提出了概率的后处理,以进一步提高准确性,将其命名为DisPredict1.1,它在二进制注释和短期和长期无序区域中每个残基的实值概率预测上均具有出色的性能。与独立基准数据集上已有的二十种现有预测变量相比,它提供了最高的Mathews相关系数(MCC),有竞争力的接收机工作特性曲线下面积(AUC)和最低的平均绝对误差(MAE)。 DisPredict可在线获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号