...
首页> 外文期刊>Journal of Theoretical Biology >Neural network and SVM classifiers accurately predict lipid binding proteins, irrespective of sequence homology
【24h】

Neural network and SVM classifiers accurately predict lipid binding proteins, irrespective of sequence homology

机译:神经网络和SVM分类剂准确地预测脂质结合蛋白,无论序列同源性如何

获取原文
获取原文并翻译 | 示例
           

摘要

Due to the central roles of lipid binding proteins (LBPs) in many biological processes, sequence based identification of LBPs is of great interest. The major challenge is that LBPs are diverse in sequence, structure, and function which results in low accuracy of sequence homology based methods. Therefore, there is a need for developing alternative functional prediction methods irrespective of sequence similarity. To identify LBPs from non-LBPs, the performances of support vector machine (SVM) and neural network were compared in this study. Comprehensive protein features and various techniques were employed to create datasets. Five-fold cross-validation (CV) and independent evaluation (IE) tests were used to assess the validity of the two methods. The results indicated that SVM outperforms neural network. SVM achieved 89.28% (CV) and 89.55% (IE) overall accuracy in identification of LBPs from non-LBPs and 92.06% (CV) and 92.90% (IE) (in average) for classification of different LBPs classes. Increasing the number and the range of extracted protein features as well as optimization of the SVM parameters significantly increased the efficiency of LBPs class prediction in comparison to the only previous report in this field. Altogether, the results showed that the SVM algorithm can be run on broad, computationally calculated protein features and offers a promising tool in detection of LBPs classes. The proposed approach has the potential to integrate and improve the common sequence alignment based methods. (C) 2014 Elsevier Ltd. All rights reserved.
机译:由于脂质结合蛋白(Lbps)在许多生物学过程中,基于序列的LBP鉴定具有很大的兴趣。主要挑战是LBPS序列,结构和功能是多样的,导致基于序列同源性的方法的低精度。因此,无论序列相似度如何,都需要开发替代功能预测方法。为了从非LBPS识别LBP,在本研究中比较了支持向量机(SVM)和神经网络的性能。采用综合蛋白质特征和各种技术来创建数据集。使用五倍的交叉验证(CV)和独立评估(即)测试来评估两种方法的有效性。结果表明SVM优于神经网络。 SVM达到89.28%(CV)和89.55%(即)鉴定来自非LBP的LBP和92.06%(CV)和92.90%(IE)(IE)(平均)的总体准确性,用于分类不同的LBPS类。增加了提取的蛋白质特征的数量和范围以及SVM参数的优化显着提高了LBPS类预测的效率与该字段中唯一的先前报告相比。结果,结果表明,SVM算法可以在广泛的计算计算的蛋白质特征上运行,并在检测Lbps类中提供有希望的工具。所提出的方法有可能集成和改善基于常见的序列对准方法。 (c)2014年elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号