首页> 外国专利> METHODS OF CLUSTERING GENE AND PROTEIN SEQUENCES

METHODS OF CLUSTERING GENE AND PROTEIN SEQUENCES

机译:聚类基因和蛋白质序列的方法

摘要

The invention relates to methods for clustering gene and protein sequences. Inparticular, it involves generationof networks of sequences where the interconnections are based upon a measureof similarity. The invention also provides methodsof optimizing and improving the networks by re-wiring of the network basedupon overlap of the nearest neighbors of given pairsof nodes. The invention further provides methods of identifying clusters ofsequences within the networks and the optimizednetworks based upon the topology of the network. The clusters identifiedrepresent groups of sequences that are related by functionand/or evolution. The invention has particular applicability in annotation ofsequences in databases and identification of functionalhomologs which can be very useful for novel therapeutic and diagnostic targetsbased upon such targets belonging to a cluster orfamily that contains a known sequence such as a diagnostic sequence, antigenor other therapeutic target.
机译:本发明涉及使基因和蛋白质序列聚类的方法。在特别是它涉及到世代互连基于度量的序列网络的数量相似。本发明还提供了方法通过重新连接基于网络的网络来优化和改进网络的过程在给定对的最近邻居重叠时节点。本发明进一步提供了识别聚类的方法。网络内的序列和优化基于网络拓扑的网络。确定的集群代表与功能相关的序列组和/或进化。本发明特别适用于注释数据库中的序列和功能识别同源物对于新型治疗和诊断靶标可能非常有用基于属于集群的目标或包含已知序列(例如诊断序列,抗原)的家族或其他治疗目标。

著录项

  • 公开/公告号CA2633793A1

    专利类型

  • 公开/公告日2007-06-28

    原文格式PDF

  • 申请/专利权人 NOVARTIS VACCINES AND DIAGNOSTICS S.R.L.;

    申请/专利号CA20062633793

  • 申请日2006-12-19

  • 分类号C07K14/195;G06F19/22;G06F19/26;A61K31/7088;A61K39/02;A61K48;C07K14;C07K16;C07K16/12;C12N15/31;C12Q1/68;C40B30/02;G01N33/68;A61K39;C40B30/04;C40B30/06;C40B40/10;

  • 国家 CA

  • 入库时间 2022-08-21 20:53:32

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号