...
首页> 外文期刊>Frontiers in Bioengineering and Biotechnology >Ensembles of Hydrophobicity Scales as Potent Classifiers for Chimeric Virus-Like Particle Solubility – An Amino Acid Sequence-Based Machine Learning Approach
【24h】

Ensembles of Hydrophobicity Scales as Potent Classifiers for Chimeric Virus-Like Particle Solubility – An Amino Acid Sequence-Based Machine Learning Approach

机译:疏水性尺度的合奏作为嵌合病毒样颗粒溶解度的有效分类器 - 一种基于氨基酸序列的机器学习方法

获取原文
           

摘要

Virus-like particles (VLPs) are protein-based nanoscale structures that show high potential as immunotherapeutics or cargo delivery vehicles. Chimeric VLPs are decorated with foreign peptides resulting in structures that confer immune responses against the displayed epitope. However, insertion of foreign sequences often results in insoluble proteins, calling for methods capable of assessing a VLP candidate’s solubility in silico. The prediction of VLP solubility requires a model that can identify critical hydrophobicity-related parameters, distinguishing between VLP-forming aggregation and aggregation leading to insoluble virus protein clusters. Therefore, we developed and implemented a soft ensemble vote classifier (sEVC) framework based on chimeric hepatitis B core antigen (HBcAg) amino acid sequences and 91 publicly available hydrophobicity scales. Based on each hydrophobicity scale, an individual decision tree was induced as classifier in the sEVC. An embedded feature selection algorithm and stratified sampling proved beneficial for model construction. With a learning experiment, model performance in the space of model training set size and number of included classifiers in the sEVC was explored. Additionally, seven models were created from training data of 24-384 chimeric HBcAg constructs, which were validated by 100-fold Monte Carlo cross-validation. The models predicted external test sets of 184-544 chimeric HBcAg constructs. Best models showed a Matthew’s correlation coefficient of 0.6 on the validation and the external test set. Feature selection was evaluated for classifiers with best and worst performance in the chimeric HBcAg VLP solubility scenario. Analysis of the associated hydrophobicity scales allowed for retrieval of biological information related to the mechanistic backgrounds of VLP solubility, suggesting a special role of arginine for VLP assembly and solubility. In the future, the developed sEVC could further be applied to hydrophobicity-related problems in other domains, such as monoclonal antibodies.
机译:病毒样颗粒(VLP)是基于蛋白质的纳米级结构,其显示为免疫治疗剂或货物递送车辆的高潜力。嵌合VLP用外来肽装饰,导致赋予针对显示的表位的免疫应答的结构。然而,将外序列的插入通常导致不溶性蛋白质,呼吁能够评估VLP候选溶解度在硅中的方法。 VLP溶解度的预测需要一种可以识别关键疏水性相关参数的模型,区分VLP形成聚集和聚集导致不溶性病毒蛋白质簇。因此,我们开发并实现了基于嵌合乙型肝炎核酸抗原(HBCAG)氨基酸序列的软合成票分类器(SEVC)框架,91个公共可用疏水性尺度。基于每个疏水性比例,在SEVC中诱导单个决策树作为分类器。嵌入式特征选择算法和分层采样证明了模型结构的有益。探讨了学习实验,探讨了模型培训集合规模和综合分类器的空间中的模型性能。此外,七种模型是从24-384嵌合HBCAG构造的训练数据创建的,这些模型由100倍的蒙特卡罗交叉验证验证。模型预测了184-544嵌合HBCAG构建体的外部测试组。最佳型号显示了Matthew的相关系数> 0.6的验证和外部测试集。在嵌合HBCAG VLP溶解度方案中为具有最佳和最差性能的分类器评估功能选择。允许检索与VLP溶解度有关的生物学信息的相关疏水性尺度分析,表明精氨酸对VLP组装和溶解度的特殊作用。未来,开发的SEVC可以进一步应用于其他结构域中的疏水性相关问题,例如单克隆抗体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号