...
首页> 外文期刊>BMC Evolutionary Biology >Transcription-related mutations and GC content drive variation in nucleotide substitution rates across the genomes of Arabidopsis thaliana and Arabidopsis lyrata
【24h】

Transcription-related mutations and GC content drive variation in nucleotide substitution rates across the genomes of Arabidopsis thaliana and Arabidopsis lyrata

机译:转录相关突变和GC含量驱动拟南芥和拟南芥基因组中核苷酸取代率的变化

获取原文

摘要

Background There has been remarkably little study of nucleotide substitution rate variation among plant nuclear genes, in part because orthology is difficult to establish. Orthology is even more problematic for intergenic regions of plant nuclear genomes, because plant genomes generally harbor a wealth of repetitive DNA. In theory orthologous intergenic data is valuable for studying rate variation because nucleotide substitutions in these regions should be under little selective constraint compared to coding regions. As a result, evolutionary rates in intergenic regions may more accurately reflect genomic features, like recombination and GC content, that contribute to nucleotide substitution. Results We generated a set of 66 intergenic sequences in Arabidopsis lyrata, a close relative of Arabidopsis thaliana. The intergenic regions included transposable element (TE) remnants and regions flanking the TEs. We verified orthology of these amplified regions both by comparison of existing A. lyrata – A. thaliana genetic maps and by using molecular features. We compared substitution rates among the 66 intergenic loci, which exhibit ~5-fold rate variation, and compared intergenic rates to a set of 64 orthologous coding sequences. Our chief observations were that the average rate of nucleotide substitution is slower in intergenic regions than in synonymous sites, that rate variation in both intergenic and coding regions correlate with GC content, that GC content alone is not sufficient to explain differences in rates between intergenic and coding regions, and that rates of evolution in intergenic regions correlate negatively with gene density. Conclusion Our observations indicated that mutation rates vary among genomics regions as a function of base composition, suggesting that previous observations of "selective constraint" on non-coding regions could more accurately be attributed to a GC effect instead of selection. The negative correlation between nucleotide substitution rate and gene density provides a potential neutral explanation for a previously documented correlation between gene density and polymorphism levels within A. thaliana. Finally, we discuss potential forces that could contribute to rapid synonymous rates, and provide evidence to suggest that transcription-related mutation contributes to rate differences between intergenic and synonymous sites.
机译:背景技术关于植物核基因之间核苷酸取代率变异的研究很少,部分是因为难以确定正畸。对于植物核基因组的基因间区域而言,正交学的问题甚至更大,因为植物基因组通常包含大量重复的DNA。从理论上讲,直系同源基因数据对于研究速率变化是有价值的,因为与编码区相比,这些区域中的核苷酸取代应受到很小的选择约束。结果,基因间区域的进化速率可以更准确地反映有助于核苷酸取代的基因组特征,例如重组和GC含量。结果我们在拟南芥(Arabidopsis thaliana)的近亲中产生了一组66个基因间序列。基因间区域包括转座因子(TE)残基和位于TEs侧翼的区域。我们通过比较现有的拟南芥-拟南芥遗传图谱和使用分子特征来验证这些扩增区域的正字法。我们比较了66个基因间位点之间的替代率,这些基因间位点表现出约5倍的速率变化,并将基因间率与一组64个直系同源编码序列进行了比较。我们的主要观察结果是,基因间区域的平均核苷酸取代速度比同义位点慢,基因间和编码区域的速度变化均与GC含量相关,仅GC含量不足以解释基因间和核苷酸之间的比率差异。编码区,以及基因间区的进化速率与基因密度负相关。结论我们的观察结果表明,基因组区域之间的突变率随碱基组成的变化而变化,这表明先前对非编码区域“选择性约束”的观察结果可以更准确地归因于GC效应而不是选择。核苷酸取代率与基因密度之间的负相关性为拟南芥中基因密度与多态性水平之间先前记录的相关性提供了潜在的中性解释。最后,我们讨论了可能促成快速同义率的潜在力量,并提供了证据表明转录相关的突变导致了基因间和同义位点之间的速率差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号