首页> 外文会议> >A protein family classification method for analysis of large DNA sequences
【24h】

A protein family classification method for analysis of large DNA sequences

机译:用于分析大型DNA序列的蛋白质家族分类方法

获取原文

摘要

A method is described for identification and classification of proteins encoded in large DNA sequences. Previously, an automated system was introduced for the general detection of amino acid sequence motifs within diverse protein families. The system generated a database consisting of aligned sequence segments (blocks) that correspond to the most highly conserved regions of proteins. This database of blocks can be searched using protein queries for sensitive detection of homology based on the detection of both local and global similarities. We show that this database searching approach can also be used to detect distant relatives encoded in very large DNA sequences. The approach is illustrated by the detection of known and new relationships in the 315 kilobase sequence of yeast chromosome III.
机译:描述了一种用于鉴定和分类在大DNA序列中编码的蛋白质的方法。以前,引入了一种自动化系统来对各种蛋白质家族中的氨基酸序列基序进行一般检测。该系统生成了一个数据库,该数据库由比对的序列段(块)组成,这些序列段对应于蛋白质的高度保守的区域。可以使用蛋白质查询来搜索该块数据库,以基于对局部和全局相似性的检测来敏感地检测同源性。我们表明,该数据库搜索方法还可用于检测以非常大的DNA序列编码的远亲。通过检测酵母染色体III 315千碱基序列中的已知关系和新关系来说明该方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号