首页> 美国卫生研究院文献>other >Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein
【2h】

Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein

机译:在蛋白质的理化特性水平上了解HA亚型的潜在机制

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The evolution of the influenza A virus to increase its host range is a major concern worldwide. Molecular mechanisms of increasing host range are largely unknown. Influenza surface proteins play determining roles in reorganization of host-sialic acid receptors and host range. In an attempt to uncover the physic-chemical attributes which govern HA subtyping, we performed a large scale functional analysis of over 7000 sequences of 16 different HA subtypes. Large number (896) of physic-chemical protein characteristics were calculated for each HA sequence. Then, 10 different attribute weighting algorithms were used to find the key characteristics distinguishing HA subtypes. Furthermore, to discover machine leaning models which can predict HA subtypes, various Decision Tree, Support Vector Machine, Naïve Bayes, and Neural Network models were trained on calculated protein characteristics dataset as well as 10 trimmed datasets generated by attribute weighting algorithms. The prediction accuracies of the machine learning methods were evaluated by 10-fold cross validation. The results highlighted the frequency of Gln (selected by 80% of attribute weighting algorithms), percentage/frequency of Tyr, percentage of Cys, and frequencies of Try and Glu (selected by 70% of attribute weighting algorithms) as the key features that are associated with HA subtyping. Random Forest tree induction algorithm and RBF kernel function of SVM (scaled by grid search) showed high accuracy of 98% in clustering and predicting HA subtypes based on protein attributes. Decision tree models were successful in monitoring the short mutation/reassortment paths by which influenza virus can gain the key protein structure of another HA subtype and increase its host range in a short period of time with less energy consumption. Extracting and mining a large number of amino acid attributes of HA subtypes of influenza A virus through supervised algorithms represent a new avenue for understanding and predicting possible future structure of influenza pandemics.
机译:甲型流感病毒的进化以扩大其宿主范围是世界范围内的主要关注。增加宿主范围的分子机制在很大程度上是未知的。流感表面蛋白在宿主唾液酸受体和宿主范围的重组中起决定性作用。为了揭示控制HA亚型的理化属性,我们对16种不同HA亚型的7000多个序列进行了大规模功能分析。对于每个HA序列,计算了大量(896)的理化蛋白质特性。然后,使用10种不同的属性加权算法来找到区分HA子类型的关键特征。此外,为了发现可以预测HA亚型的机器学习模型,在计算出的蛋白质特征数据集以及由属性加权算法生成的10个修剪数据集上训练了各种决策树,支持向量机,朴素贝叶斯和神经网络模型。通过10倍交叉验证对机器学习方法的预测准确性进行了评估。结果强调了Gln的频率(由80%的属性加权算法选择),Tyr的百分比/频率,Cys的百分比以及Try和Glu的频率(由70%的属性加权算法选择)是关键特征。与HA子类型关联。随机森林树归纳算法和支持向量机的RBF核函数(通过网格搜索进行缩放)在基于蛋白质属性的聚类和预测HA亚型中显示出98%的高精度。决策树模型成功地监测了短突变/重排路径,流感病毒可通过该短突变/重排路径获得另一种HA亚型的关键蛋白质结构,并在短时间内以较低的能耗增加其宿主范围。通过监督算法提取和挖掘甲型流感病毒HA亚型的大量氨基酸属性,为理解和预测流感大流行的未来结构提供了一条新途径。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号