首页> 外文会议>International conference on intelligent computing >A New Representation Method of H1N1 Influenza Virus and Its Application
【24h】

A New Representation Method of H1N1 Influenza Virus and Its Application

机译:H1N1流感病毒的新表达方法及其应用

获取原文

摘要

Based on the 38,899 pieces of H1N1 virus protein sequences from 1902 to 2013 in the world, the 1805 H1N1 virus sequences with HA and NA protein are selected according to viruses occurred at the same time and place. A new representation of feature vector for protein sequences is proposed by the physicochemical properties of amino acids and coarse graining theories. The 20 kinds of amino acids are divided into 4 classes and connected with each other to construct 16-dimensional feature vectors to represent HA and NA protein sequence, respectively. The whole protein sequence is represented by a 32-dimensional feature vector, which combines the feature vectors of HA and NA protein sequences, and the optimal cluster of the H1N1 influenza virus is obtained by the structural clustering. The relationship between HA and NA protein structures and the outbreak of H1N1 virus protein sequences is analyzed by selecting the representative elements and constructing evolutionary tree. The results show that the new representation of feature vector for protein sequences is reasonable, and large amount of data confirms that HA and NA protein sequences play a direct and important role in the outbreak of H1N1 influenza virus.
机译:根据全球1902年至2013年的38899条H1N1病毒蛋白序列,根据同时发生的病毒,选择含有HA和NA蛋白的1805个H1N1病毒序列。通过氨基酸的理化性质和粗粒理论,提出了一种蛋白质序列特征向量的新表示形式。将20种氨基酸分为4类,并相互连接以构建分别代表HA和NA蛋白序列的16维特征向量。完整的蛋白质序列由32维特征向量表示,该向量将HA和NA蛋白质序列的特征向量结合在一起,并通过结构聚类获得了H1N1流感病毒的最佳簇。通过选择代表性元件并构建进化树,分析了HA和NA蛋白结构与H1N1病毒蛋白序列爆发之间的关系。结果表明,蛋白质序列特征向量的新表示是合理的,大量数据证实HA和NA蛋白质序列在H1N1流感病毒的爆发中起着直接而重要的作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号