...
首页> 外文期刊>BMC Genomics >A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)
【24h】

A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)

机译:针叶树(Picea sitchensis)的针叶树基因组资源为200,000个云杉(Picea spp。)EST和6,464个高质量,序列完成的全长cDNA

获取原文
           

摘要

Background Members of the pine family (Pinaceae), especially species of spruce (Picea spp.) and pine (Pinus spp.), dominate many of the world's temperate and boreal forests. These conifer forests are of critical importance for global ecosystem stability and biodiversity. They also provide the majority of the world's wood and fiber supply and serve as a renewable resource for other industrial biomaterials. In contrast to angiosperms, functional and comparative genomics research on conifers, or other gymnosperms, is limited by the lack of a relevant reference genome sequence. Sequence-finished full-length (FL)cDNAs and large collections of expressed sequence tags (ESTs) are essential for gene discovery, functional genomics, and for future efforts of conifer genome annotation. Results As part of a conifer genomics program to characterize defense against insects and adaptation to local environments, and to discover genes for the production of biomaterials, we developed 20 standard, normalized or full-length enriched cDNA libraries from Sitka spruce (P. sitchensis), white spruce (P. glauca), and interior spruce (P. glauca-engelmannii complex). We sequenced and analyzed 206,875 3'- or 5'-end ESTs from these libraries, and developed a resource of 6,464 high-quality sequence-finished FLcDNAs from Sitka spruce. Clustering and assembly of 147,146 3'-end ESTs resulted in 19,941 contigs and 26,804 singletons, representing 46,745 putative unique transcripts (PUTs). The 6,464 FLcDNAs were all obtained from a single Sitka spruce genotype and represent 5,718 PUTs. Conclusion This paper provides detailed annotation and quality assessment of a large EST and FLcDNA resource for spruce. The 6,464 Sitka spruce FLcDNAs represent the third largest sequence-verified FLcDNA resource for any plant species, behind only rice (Oryza sativa) and Arabidopsis (Arabidopsis thaliana), and the only substantial FLcDNA resource for a gymnosperm. Our emphasis on capturing FLcDNAs and ESTs from cDNA libraries representing herbivore-, wound- or elicitor-treated induced spruce tissues, along with incorporating normalization to capture rare transcripts, resulted in a rich resource for functional genomics and proteomics studies. Sequence comparisons against five plant genomes and the non-redundant GenBank protein database revealed that a substantial number of spruce transcripts have no obvious similarity to known angiosperm gene sequences. Opportunities for future applications of the sequence and clone resources for comparative and functional genomics are discussed.
机译:背景技术松树科(Pinaceae)的成员,尤其是云杉(Picea spp。)和松木(Pinus spp。)的物种,主导着世界上许多温带和北方森林。这些针叶林对于全球生态系统的稳定性和生物多样性至关重要。它们还提供了世界上大部分的木材和纤维供应,并作为其他工业生物材料的可再生资源。与被子植物相比,针叶树或其他裸子植物的功能和比较基因组学研究因缺乏相关的参考基因组序列而受到限制。序列完成的全长(FL)cDNA和大量表达序列标签(EST)集合对于基因发现,功能基因组学以及针叶树基因组注释的未来工作至关重要。结果作为针叶树基因组计划的一部分,该计划旨在表征对昆虫的防御能力和对当地环境的适应能力,并发现用于生产生物材料的基因,我们从Sitka云杉(P. sitchensis)开发了20个标准,标准化或全长富集的cDNA文库。 ,白云杉(P. glauca)和室内云杉(P. glauca-engelmannii复合物)。我们对这些文库中的206,875个3'或5'端EST进行了测序和分析,并开发了来自Sitka云杉的6,464个高质量序列完成的FLcDNA。 147,146个3'端EST的聚类和组装导致19,941个重叠群和26,804个单例,代表46,745个推定的独特转录本(PUT)。 6,464个FLcDNA均来自单一的Sitka云杉基因型,代表5,718个PUT。结论本文提供了针对云杉的大量EST和FLcDNA资源的详细注释和质量评估。 6,464个Sitka云杉FLcDNA代表了所有植物物种中第三大经序列验证的FLcDNA资源,仅次于水稻(Oryza sativa)和拟南芥(Arabidopsis thaliana),也是裸子植物唯一的重要FLcDNA资源。我们着重于从代表草食动物,伤口或激发子处理的云杉组织的cDNA库中捕获FLcDNA和EST,并结合归一化以捕获稀有转录本,从而为功能基因组学和蛋白质组学研究提供了丰富的资源。针对五个植物基因组的序列比较和非冗余GenBank蛋白质数据库显示,大量的云杉转录本与已知的被子植物基因序列没有明显的相似性。讨论了用于比较和功能基因组学的序列和克隆资源未来应用的机会。

相似文献

  • 外文文献
  • 中文文献
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号