首页>
外国专利>
METHODS OF CLUSTERING GENE AND PROTEIN SEQUENCES
METHODS OF CLUSTERING GENE AND PROTEIN SEQUENCES
展开▼
机译:聚类基因和蛋白质序列的方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The invention relates to methods for clustering gene and protein sequences. In particular, it involves generation of networks of sequences where the interconnections are based upon a measure of similarity. The invention also provides methods of optimizing and improving the networks by re-wiring of the network based upon overlap of the nearest neighbors of given pairs of nodes. The invention further provides methods of identifying clusters of sequences within the networks and the optimized networks based upon the topology of the network. The clusters identified represent groups of sequences that are related by function and/or evolution. The invention has particular applicability in annotation of sequences in databases and identification of functional homologs which can be very useful for novel therapeutic and diagnostic targets based upon such targets belonging to a cluster or family that contains a known sequence such as a diagnostic sequence, antigen or other therapeutic target.
展开▼