【24h】

Spectral clustering of protein sequences

机译:蛋白质序列的光谱聚类

获取原文

摘要

A major challenge in bioinformatics is the grouping together of protein sequences into functionally similar families. Large scale clustering of protein sequences may help to identify novel relationships and may also be of use in structural genomics. This paper explores the use of graph-theoretic spectral methods for clustering protein sequences. Using the leading eigenvectors of a matrix derived from similarity information between protein sequences, we were able to obtain meaningful clusters on quite diverse sets of proteins. The results presented show how this method is often able to identify correctly the superfamilies to which the sequences belong.
机译:生物信息学的主要挑战是将蛋白质序列分组到功能相似的家族中。蛋白质序列的大规模聚类可能有助于鉴定新的关系,也可能在结构基因组学中使用。本文探讨了使用图论光谱方法对蛋白质序列进行聚类的方法。使用从蛋白质序列之间的相似性信息得出的矩阵的本征特征向量,我们能够在非常多样化的蛋白质集合上获得有意义的簇。给出的结果显示了该方法通常如何能够正确识别序列所属的超家族。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号