首页> 外文期刊>Journal of Theoretical Biology >A novel model for protein sequence similarity analysis based on spectral radius
【24h】

A novel model for protein sequence similarity analysis based on spectral radius

机译:基于光谱半径的蛋白质序列相似性分析的新模型

获取原文
获取原文并翻译 | 示例
           

摘要

Advances in sequencing technologies led to rapid increase in the number and diversity of biological sequences, which facilitated development in the sequence research. In this paper, we present a new method for analyzing protein sequence similarity. We calculated the spectral radii of 20 amino acids (AAs) and put forward a novel 2-D graphical representation of protein sequences. To characterize protein sequences numerically, three groups of features were extracted and related to statistical, dynamics measurements and fluctuation complexity of the sequences. With the obtained feature vector, two models utilizing Gaussian Kernel similarity and Cosine similarity were built to measure the similarity between sequences. We applied our method to analyze the similarities/dissimilarities of four data sets. Both proposed models received consistent results with improvements when compared to that obtained by the ClustalW analysis. The novel approach we present in this study may therefore benefit protein research in medical and scientific fields. (C) 2018 Elsevier Ltd. All rights reserved.
机译:测序技术的进步导致生物序列的数量和多样性快速增加,促进了序列研究的发展。在本文中,我们提出了一种分析蛋白质序列相似性的新方法。我们计算了20个氨基酸(AAS)的光谱半径,并提出了蛋白质序列的新型2-D图形表示。为了在数值上表征蛋白质序列,提取三组特征,与统计,动力学测量和序列的波动复杂性相关。利用所获得的特征向量,建立了使用高斯内核相似性和余弦相似性的两个模型来测量序列之间的相似性。我们应用了我们的方法来分析四个数据集的相似之处/异化。与通过Clustalw分析所获得的相比,两个提出的模型都接受了一致的结果随着改进而改进。因此,我们本研究中存在的新方法可能会使医学和科学领域的蛋白质研究受益。 (c)2018年elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号