...
首页> 外文期刊>Journal of proteome research >Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy
【24h】

Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy

机译:基于机器学习的细胞穿透肽预测及其提高精度的吸收效率

获取原文
获取原文并翻译 | 示例

摘要

Cell-penetrating peptides (CPPs) can enter cells as a variety of biologically active conjugates and have various biomedical applications. To offset the cost and effort of designing novel CPPs in laboratories, computational methods are necessitated to identify candidate CPPs before in vitro experimental studies. We developed a two-layer prediction framework called machine-learning-based prediction of cell-penetrating peptides (MLCPPs). The first-layer predicts whether a given peptide is a CPP or non-CPP, whereas the second-layer predicts the uptake efficiency of the predicted CPPs. To construct a two-layer prediction framework, we employed four different machine-learning methods and five different compositions including amino acid composition (AAC), dipeptide composition, amino acid index, composition transition distribution, and physicochemical properties (PCPs). In the first layer, hybrid features (combination of AAC and PCP) and extremely randomized tree outperformed state-of-the-art predictors in CPP prediction with an accuracy of 0.896 when tested on independent data sets, whereas in the second layer, hybrid features obtained through feature selection protocol and random forest produced an accuracy of 0.725 that is better than state-of-the-art predictors. We anticipate that our method MLCPP will become a valuable tool for predicting CPPs and their uptake efficiency and might facilitate hypothesis-driven experimental design. The MLCPP server interface along with the benchmarking and independent data sets are freely accessible at www.thegleelab.org/MLCPP.
机译:细胞穿透肽(CPP)可以进入细胞作为各种生物活性缀合物并具有各种生物医学应用。为了抵消在实验室中设计新的CPP的成本和精力,需要在体外​​实验研究之前识别候选CPP的计算方法。我们开发了一种称为机器学习的预测框架的两层预测框架,其对细胞穿透肽(MLCPPS)的预测。第一层预测给定的肽是CPP或非CPP,而第二层预测预测CPP的摄取效率。为了构建双层预测框架,我们采用四种不同的机器学习方法和五种不同的组合物,包括氨基酸组合物(AAC),二肽组合物,氨基酸指数,组成转变分布和物理化学性质(PCP)。在第一层中,混合特征(AAC和PCP的组合)和极其随机的树在CPP预测中优于最终的最先进的预测器,其在独立数据集上测试时的精度为0.896,而在第二层中,混合特征通过特征选择协议和随机森林获得的精度为0.725,比最先进的预测器更好。我们预计我们的方法MLCPP将成为预测CPP的宝贵工具及其吸收效率,并且可以促进假设驱动的实验设计。 MLCPP服务器接口以及基准测试和独立数据集可在www.thegleelab.org/mlcpp上自由访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号