首页> 外文会议>International Conference on Computer and Information Technology >Improved protein disorder predictor by smoothing output
【24h】

Improved protein disorder predictor by smoothing output

机译:通过平滑输出改善蛋白质障碍预测器

获取原文

摘要

Intrinsically disorder regions (IDRs) or, proteins (IDPs) are associated with important biological functions, while lacking stable structure in their native state. The phenomena of disordered proteins or residues are abundant in nature and are extensively involved in critical human diseases and hence impacting drug discovery. Thus, the study using disorder prediction is becoming crucial in the proteomic research. The large scale growth of genome database demands high performance computational methods for identification of protein disorder. We developed a canonical support vector machine based disorder predictor, DisPredict by integrating RBF kernel. It employs novel feature set for accurate characterization of disorder which outperformed two leading predictors: the neural network based SPINE-D and Meta predictor MFDp based on ten-fold cross validation. We propose a post processing of probabilities to further improve the accuracy, named DisPredict1.1 which yields outstanding performance further both in binary annotation and real valued probability prediction per residue in both short and long disordered regions. It provides highest Mathews Correlation Coefficient (MCC), competitive Area Under receiver operating characteristic Curve (AUC) and lowest Mean Absolute Error (MAE) when compared with twenty existing predictors of several kinds on independent benchmark dataset. DisPredict is available online.
机译:本质障碍区域(IDRS)或蛋白质(IDPS)与重要的生物学功能相关,同时缺乏原生状态的稳定结构。无序蛋白质或残留物的现象本质上很丰富,并且广泛参与危重人类疾病,从而遭受影响的药物发现。因此,使用紊乱预测的研究在蛋白质组学研究中成为至关重要。基因组数据库的大规模生长需要高性能计算方法,用于鉴定蛋白质障碍。我们开发了一种规范支持向量机基础的疾病预测因子,通过集成RBF内核而脱颖而出。它采用了新颖的特征,可用于精确表征的紊乱,这两种前导预测因子优于两个领先的预测因子:基于神经网络的基于神经网络的脊柱-D和元预测器MFDP基于十倍交叉验证。我们提出了一种后期处理概率,以进一步提高名为Disprictict1.1的准确性,这在短期和长期无序区域中的二元注释和真实值概率预测中进一步产生了出色的性能。它在与独立基准数据集中的二十多种现有预测器相比,提供最高的Mathews相关系数(MCC),接收器操作特征曲线(AUC)和最低平均绝对误差(MAE)。 Disprictict在线提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号