首页> 外文期刊>GigaScience >Assembly of the 373k gene space of the polyploid sugarcane genome reveals reservoirs of functional diversity in the world's leading biomass crop
【24h】

Assembly of the 373k gene space of the polyploid sugarcane genome reveals reservoirs of functional diversity in the world's leading biomass crop

机译:多倍体甘蔗基因组373k基因空间的组装揭示了世界领先的生物量农作物的功能多样性库

获取原文
           

摘要

Background Sugarcane cultivars are polyploid interspecific hybrids of giant genomes, typically with 10–13 sets of chromosomes from 2 Saccharum species. The ploidy, hybridity, and size of the genome, estimated to have 10 Gb, pose a challenge for sequencing. Results Here we present a gene space assembly of SP80-3280, including 373,869 putative genes and their potential regulatory regions. The alignment of single-copy genes in diploid grasses to the putative genes indicates that we could resolve 2–6 (up to 15) putative homo(eo)logs that are 99.1% identical within their coding sequences. Dissimilarities increase in their regulatory regions, and gene promoter analysis shows differences in regulatory elements within gene families that are expressed in a species-specific manner. We exemplify these differences for sucrose synthase (SuSy) and phenylalanine ammonia-lyase (PAL), 2 gene families central to carbon partitioning. SP80-3280 has particular regulatory elements involved in sucrose synthesis not found in the ancestor Saccharum spontaneum . PAL regulatory elements are found in co-expressed genes related to fiber synthesis within gene networks defined during plant growth and maturation. Comparison with sorghum reveals predominantly bi-allelic variations in sugarcane, consistent with the formation of 2 “subgenomes” after their divergence ~3.8–4.6 million years ago and reveals single-nucleotide variants that may underlie their differences. Conclusions This assembly represents a large step towards a whole-genome assembly of a commercial sugarcane cultivar. It includes a rich diversity of genes and homo(eo)logous resolution for a representative fraction of the gene space, relevant to improve biomass and food production.
机译:背景甘蔗品种是巨大基因组的多倍体种间杂种,通常具有来自2个蔗糖物种的10–13套染色体。基因组的倍性,杂合性和大小估计超过10 Gb,这给测序带来了挑战。结果在这里,我们介绍了SP80-3280的基因空间装配,包括373,869个推定基因及其潜在的调控区域。将二倍体草中的单拷贝基因与推定基因的比对表明,我们可以解析2–6(最多15个)推定同源(eo)基因,它们在其编码序列中具有99.1%的同一性。差异在其调控区域增加,基因启动子分析显示基因家族内调控物种以物种特异性方式表达的差异。我们举例说明了蔗糖合酶(SuSy)和苯丙氨酸氨裂合酶(PAL)这两个对碳分配至关重要的基因家族的差异。 SP80-3280具有祖先自发性蔗糖中未发现的参与蔗糖合成的特殊调节元件。在植物生长和成熟过程中定义的基因网络内,在与纤维合成有关的共表达基因中发现了PAL调控元件。与高粱的比较揭示了甘蔗中主要的双等位基因变异,与在约3.8-460万年前的两个“亚基因组”分化后形成一致,并揭示了可能是其差异的单核苷酸变异。结论该组装代表了商业甘蔗品种全基因组组装的一大进步。它包括丰富的基因多样性和基因空间代表性部分的同源(同源)分辨率,与提高生物量和粮食生产有关。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号