首页> 外国专利> METHODS OF CLUSTERING GENE AND PROTEIN SEQUENCES

METHODS OF CLUSTERING GENE AND PROTEIN SEQUENCES

机译:聚类基因和蛋白质序列的方法

摘要

The invention relates to methods for clustering gene and protein sequences. In particular, it involves generation of networks of sequences where the interconnections are based upon a measure of similarity. The invention also provides methods of optimizing and improving the networks by re-wiring of the network based upon overlap of the nearest neighbors of given pairs of nodes. The invention further provides methods of identifying clusters of sequences within the networks and the optimized networks based upon the topology of the network. The clusters identified represent groups of sequences that are related by function and/or evolution. The invention has particular applicability in annotation of sequences in databases and identification of functional homologs which can be very useful for novel therapeutic and diagnostic targets based upon such targets belonging to a cluster or family that contains a known sequence such as a diagnostic sequence, antigen or other therapeutic target.
机译:本发明涉及使基因和蛋白质序列聚类的方法。特别地,它涉及序列网络的生成,其中互连基于相似性的度量。本发明还提供了基于给定节点对的最近邻居的重叠通过重新布线网络来优化和改善网络的方法。本发明还提供了基于网络拓扑来识别网络和优化网络内的序列簇的方法。所识别的簇代表通过功能和/或进化相关的序列组。本发明特别适用于数据库中序列的注释和功能同源物的鉴定,这对于基于属于含有已知序列如诊断序列,抗原或抗原的簇或家族的靶标的新型治疗和诊断靶标非常有用。其他治疗目标。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号