首页> 美国卫生研究院文献>BMC Bioinformatics >Evaluation and improvements of clustering algorithms for detecting remote homologous protein families
【2h】

Evaluation and improvements of clustering algorithms for detecting remote homologous protein families

机译:用于检测远程同源蛋白家族的聚类算法的评估和改进

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundAn important problem in computational biology is the automatic detection of protein families (groups of homologous sequences). Clustering sequences into families is at the heart of most comparative studies dealing with protein evolution, structure, and function. Many methods have been developed for this task, and they perform reasonably well (over 0.88 of F-measure) when grouping proteins with high sequence identity. However, for highly diverged proteins the performance of these methods can be much lower, mainly because a common evolutionary origin is not deduced directly from sequence similarity. To the best of our knowledge, a systematic evaluation of clustering methods over distant homologous proteins is still lacking.
机译:背景技术计算生物学中的一个重要问题是蛋白质家族(同源序列组)的自动检测。将序列聚类为家族是处理蛋白质进化,结构和功能的大多数比较研究的核心。已经开发出许多方法来完成此任务,并且在对具有高序列同一性的蛋白质进行分组时,它们表现良好(超过F值的0.88)。但是,对于高度差异较大的蛋白质,这些方法的性能可能会低得多,这主要是因为不能直接从序列相似性推论出共同的进化起源。据我们所知,仍然缺乏对远距离同源蛋白的聚类方法的系统评价。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号