...
首页> 外文期刊>PLoS Computational Biology >Expansion of Protein Domain Repeats
【24h】

Expansion of Protein Domain Repeats

机译:蛋白质结构域重复序列的扩展

获取原文

摘要

Many proteins, especially in eukaryotes, contain tandem repeats of several domains from the same family. These repeats have a variety of binding properties and are involved in protein–protein interactions as well as binding to other ligands such as DNA and RNA. The rapid expansion of protein domain repeats is assumed to have evolved through internal tandem duplications. However, the exact mechanisms behind these tandem duplications are not well-understood. Here, we have studied the evolution, function, protein structure, gene structure, and phylogenetic distribution of domain repeats. For this purpose we have assigned Pfam-A domain families to 24 proteomes with more sensitive domain assignments in the repeat regions. These assignments confirmed previous findings that eukaryotes, and in particular vertebrates, contain a much higher fraction of proteins with repeats compared with prokaryotes. The internal sequence similarity in each protein revealed that the domain repeats are often expanded through duplications of several domains at a time, while the duplication of one domain is less common. Many of the repeats appear to have been duplicated in the middle of the repeat region. This is in strong contrast to the evolution of other proteins that mainly works through additions of single domains at either terminus. Further, we found that some domain families show distinct duplication patterns, e.g., nebulin domains have mainly been expanded with a unit of seven domains at a time, while duplications of other domain families involve varying numbers of domains. Finally, no common mechanism for the expansion of all repeats could be detected. We found that the duplication patterns show no dependence on the size of the domains. Further, repeat expansion in some families can possibly be explained by shuffling of exons. However, exon shuffling could not have created all repeats.
机译:许多蛋白质,特别是在真核生物中,含有来自同一家族的多个结构域的串联重复序列。这些重复序列具有多种结合特性,并参与蛋白质间相互作用以及与其他配体(例如DNA和RNA)的结合。假定蛋白结构域重复序列的快速扩增是通过内部串联重复进化而来的。但是,这些串联重复背后的确切机制尚不清楚。在这里,我们研究了结构域重复序列的进化,功能,蛋白质结构,基因结构和系统发育分布。为此,我们将Pfam-A域家族分配给了24个蛋白质组,这些蛋白质组在重复区域中具有更敏感的域分配。这些发现证实了以前的发现,即与原核生物相比,真核生物,尤其是脊椎动物,含有更高比例重复蛋白质。每种蛋白质的内部序列相似性表明,结构域重复经常通过一次复制多个结构域而扩展,而一个结构域的复制则不那么常见。许多重复似乎已在重复区域的中间重复。这与其他蛋白质的进化形成强烈反差,其他蛋白质的进化主要通过在两个末端添加单个结构域起作用。此外,我们发现某些域家族显示出不同的重复模式,例如,星云素域主要一次以七个域为单位扩展,而其他域家族的重复涉及不同数量的域。最后,没有检测到所有重复序列扩展的通用机制。我们发现复制模式显示不依赖于域的大小。此外,某些家族的重复扩张可能是通过外显子改组来解释的。但是,外显子改组不可能产生所有重复。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号