...
首页> 外文期刊>Molecular genetics and genomics: MGG >Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences
【24h】

Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences

机译:人类蛋白质编码序列中域与未分配区域之间SNP的不同进化模式

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
机译:蛋白质进化在每个基因组的进化中起着重要作用。由于它们的功能性质,通常,它们的大部分部分或部位受到不同的选择性约束,特别是通过纯化选择。以前有关蛋白质进化的大多数研究都考虑了单个蛋白质的整体或将蛋白质编码序列与非编码序列进行了比较。很少关注给定基因组每种蛋白质内不同部分的进化。为此,基于所有人类蛋白质的PfamA注释,每个蛋白质序列可分为两部分:结构域或未分配区域。使用这种原理,根据两个分类对来自1000个基因组计划的蛋白质编码序列中的单核苷酸多态性(SNP)进行了定位:出现在蛋白质域内的SNP和未分配区域内的SNP。通过这些分类,我们发现:域内同义SNP的密度显着大于未分配区域内同义SNP的密度;但是,非同义SNP的密度显示相反的模式。我们还发现,在域和未分配区域上均存在净化选择的特征。此外,在域上的选择强度显着大于未分配区域上的选择强度。另外,在所有人类蛋白质序列中,有117个PfamA域,其中未发现SNP。我们的结果突出了蛋白质结构域的重要方面,并可能有助于我们对蛋白质进化的理解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号