首页> 外文会议>Workshop on Genome Informatics >A graph-based clustering method for a large set of sequences using a graph partitioning algorithm
【24h】

A graph-based clustering method for a large set of sequences using a graph partitioning algorithm

机译:一种基于图的聚类方法,用于使用曲线图分区算法的大组序列

获取原文

摘要

A graph-based clustering method is proposed to cluster protein sequences into families, which automatically improves clusters of the conventional single linkage clustering method. Our approach formulates sequence clustering problem as a kind of graphpartitioning problem in a weighted linkage graph, which vertices correspond to sequences, edges correspond to higher similarities than given threshold and are weighted by their similarities. The effectiveness of our method is shown in comparison with InterPro families in all mouse proteins in SWISS-PROT. The result clusters match to InterPro families much better than the single linkage clustering method. 77 percent of proteins in InterPro families are classified into appropriate clusters.
机译:将基于图的聚类方法提出给将蛋白质序列聚类为家族,其自动改善常规单连杆聚类方法的簇。我们的方法将序列聚类问题制定为加权连杆图中的图形聚类问题,该曲线图中的顶点对应于序列,边缘对应于比给定阈值更高的相似性,并且由它们的相似性加权。与Swiss-prot中的所有小鼠蛋白质中的Interpro系列相比,我们的方法的有效性如瑞士科学。结果集群与Interpro系列匹配比单个链接聚类方法更好。 77%的Interpro系列蛋白质被分类为适当的集群。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号