首页> 外文会议>Workshop on Genome Informatics >Comprehensive functional identification of prokaryotic transmembrane proteins by binary topology pattern
【24h】

Comprehensive functional identification of prokaryotic transmembrane proteins by binary topology pattern

机译:二元拓扑模式综合功能鉴定原核跨膜蛋白

获取原文

摘要

The functions of more than one half of proteins in proteome are not annotated yet. The functions of transmembrane (TM) protein, which corresponds to one fourth in a proteome, is known only a little, because of difficulty in determining TM protein structure experimentally. Accordingly, a lot of efforts have been made in an attempt to predict TM topology which is considered to correspond to the fold in the case of globular protein. Because, it is known that TM protein function can be identified by its TM topology, at least roughly. We have developed a TM protein function identification method using the binary topology pattern based on the number of segments and loop length. The topology pattern is expressed as a sequence of "1", "0" and "*": "1" and"0" mean the long and short loop based on a defined threshold length, respectively, and "*" means the binary loop length is not defined. The topology pattern of a query TM protein is associated to a particular function when it corresponds to the topology pattern of a TM protein having a known function. In previous work, a common threshold length was used for all the loops. This time, we defined different threshold lengths for individual loops to improve identification accuracy. By using this method, weidentified comprehensively functions of putative TM proteins encoded within 39 microbial genomes, classifying the TM proteins into defined functional groups. We also compared the accuracy of our method in functional identification with one by BLAST.
机译:蛋白质组中超过一半的蛋白质的功能尚未注释。跨膜(TM)蛋白的功能,其对应于蛋白质组中的四分之一,仅仅是一点,因为难以在实验上确定TM蛋白质结构。因此,已经尝试预测TM拓扑的许多努力,该拓扑被认为对应于球状蛋白质的情况。因为,已知TM蛋白质功能可以通过其TM拓扑识别,至少粗略地。我们使用基于段数和循环长度的二进制拓扑模式开发了TM蛋白质功能识别方法。拓扑模式表示为“1”,“0”和“*”序列:“1”和“0”表示基于定义的阈值长度,“*”表示二进制循环长度未定义。当它对应于具有已知功能的TM蛋白的拓扑模式时,查询TM蛋白的拓扑模式与特定功能相关联。在以前的工作中,所有环路都使用公共阈值长度。这次,我们为各个环路定义了不同的阈值长度,以提高识别准确性。通过使用该方法,Weittified在39个微生物基因组内编码的推定TM蛋白的全面函数,将TM蛋白分类为定义的官能团。我们还将我们的方法的准确性与爆炸进行了一个。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号