首页> 美国卫生研究院文献>BioData Mining >Partitioning clustering algorithms for protein sequence data sets
【2h】

Partitioning clustering algorithms for protein sequence data sets

机译:蛋白质序列数据集的分区聚类算法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundGenome-sequencing projects are currently producing an enormous amount of new sequences and cause the rapid increasing of protein sequence databases. The unsupervised classification of these data into functional groups or families, clustering, has become one of the principal research objectives in structural and functional genomics. Computer programs to automatically and accurately classify sequences into families become a necessity. A significant number of methods have addressed the clustering of protein sequences and most of them can be categorized in three major groups: hierarchical, graph-based and partitioning methods. Among the various sequence clustering methods in literature, hierarchical and graph-based approaches have been widely used. Although partitioning clustering techniques are extremely used in other fields, few applications have been found in the field of protein sequence clustering. It is not fully demonstrated if partitioning methods can be applied to protein sequence data and if these methods can be efficient compared to the published clustering methods.
机译:背景技术基因组测序项目目前正在产生大量新序列,并导致蛋白质序列数据库的迅速增加。将这些数据无监督地分为功能组或家族,聚类,已成为结构和功能基因组学的主要研究目标之一。自动且准确地将序列分类为家族的计算机程序成为必要。大量方法解决了蛋白质序列的聚类问题,其中大多数可以分为三大类:分层方法,基于图的方法和分区方法。在文献中的各种序列聚类方法中,分层和基于图的方法已被广泛使用。尽管分区聚类技术在其他领域中被广泛使用,但是在蛋白质序列聚类领域中却发现了很少的应用。尚未完全证明分区方法是否可以应用于蛋白质序列数据,以及与公开的聚类方法相比这些方法是否有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号