...
首页> 外文期刊>BMC Genomics >Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database
【24h】

Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

机译:Silene EST注释数据库SiESTa的高通量转录组测序比较和开发

获取原文
           

摘要

Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch webcite . Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms.
机译:背景技术Silene属被广泛用作解决植物中生态和进化问题的模型系统,但是由于缺乏可用于研究其基因组的资源,该类作为模型系统的应用受到了阻碍。大规模并行测序cDNA最近已发展成为一种表征非模型生物转录组的有效方法,可产生大量数据,从而能够在比较框架内研究多种物种。产生的序列为鉴定表达的基因,表征功能变异和开发分子标记物提供了极好的资源,从而为将来对基因序列和基因表达差异的研究奠定了基础。在这里,我们报告了比较转录组测序研究的结果,该研究对代表四个Silene和一个石竹物种的八个人进行了分组。所有序列和注释都已存储在一个名为SiESTa的新开发且可公开获得的数据库中,这是Silene EST注释数据库。结果在Roche GS-FLX 454焦磷酸测序平台上进行的两次运行共产生1,041,122个EST读数。分别对所有八个已测序个体的EST读数进行了分析,并使用TGICL将其组装成重叠群。这些都用BLASTX搜索和Gene Ontology(GO)术语的结果注释,并鉴定了成千上万的单核苷酸多态性(SNP)。未组装的读段以单例形式保存,并且与重叠群一起形成了每个个体中表征的单基因。在与拟南芥蛋白质组相似性搜索中具有重大成功的比例(49%)证明了单基因的高质量。您可以在http://www.siesta.ethz.ch网站上访问SiESTa数据库。结论本研究建立的序列集合为4种Silene和1种石竹物种提供了重要的基因组资源,并将有助于进一步发展Silene作为植物模型系统。表征的基因不仅对本研究中包括的物种,而且对于尚无基因组资源的相关物种,都将对未来的研究有用。我们的研究结果表明,在比较框架中大规模并行转录组测序的效率可作为开发各种非模型生物组中基因组资源的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号