...
首页> 外文期刊>BMC Plant Biology >A newly-developed community microarray resource for transcriptome profiling in Brassica species enables the confirmation of Brassica -specific expressed sequences
【24h】

A newly-developed community microarray resource for transcriptome profiling in Brassica species enables the confirmation of Brassica -specific expressed sequences

机译:一种用于芸苔属物种转录组分析的新开发的社区微阵列资源,可以确认芸苔属特异性表达序列

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background The Brassica species include an important group of crops and provide opportunities for studying the evolutionary consequences of polyploidy. They are related to Arabidopsis thaliana , for which the first complete plant genome sequence was obtained and their genomes show extensive, although imperfect, conserved synteny with that of A. thaliana . A large number of EST sequences, derived from a range of different Brassica species, are available in the public database, but no public microarray resource has so far been developed for these species. Results We assembled unigenes using ~800,000 EST sequences, mainly from three species: B. napus , B. rapa and B. oleracea . The assembly was conducted with the aim of co-assembling ESTs of orthologous genes (including homoeologous pairs of genes in B. napus from each of the A and C genomes), but resolving assemblies of paralogous, or paleo-homoeologous, genes ( i.e . the genes related by the ancestral genome triplication observed in diploid Brassica species). 90,864 unique sequence assemblies were developed. These were incorporated into the BAC sequence annotation for the Brassica rapa Genome Sequencing Project, enabling the identification of cognate genomic sequences for a proportion of them. A 60-mer oligo microarray comprising 94,558 probes was developed using the unigene sequences. Gene expression was analysed in reciprocal resynthesised B. napus lines and the B. oleracea and B. rapa lines used to produce them. The analysis showed that significant expression could consistently be detected in leaf tissue for 35,386 unigenes. Expression was detected across all four genotypes for 27,355 unigenes, genome-specific expression patterns were observed for 7,851 unigenes and 180 unigenes displayed other classes of expression pattern. Principal component analysis (PCA) clearly resolved the individual microarray datasets for B. rapa , B. oleracea and resynthesised B. napus . Quantitative differences in expression were observed between the resynthesised B. napus lines for 98 unigenes, most of which could be classified into non-additive expression patterns, including 17 that showed cytoplasm-specific patterns. We further characterized the unigenes for which A genome-specific expression was observed and cognate genomic sequences could be identified. Ten of these unigenes were found to be Brassica -specific sequences, including two that originate from complex loci comprising gene clusters. Conclusion We succeeded in developing a Brassica community microarray resource. Although expression can be measured for the majority of unigenes across species, there were numerous probes that reported in a genome-specific manner. We anticipate that some proportion of these will represent species-specific transcripts and the remainder will be the consequence of variation of sequences within the regions represented by the array probes. Our studies demonstrated that the datasets obtained from the arrays can be used for typical analyses, including PCA and the analysis of differential expression. We have also demonstrated that Brassica -specific transcripts identified in silico in the sequence assembly of public EST database accessions are indeed reported by the array. These would not be detectable using arrays designed using A. thaliana sequences.
机译:背景技术芸苔属物种包括重要的农作物,为研究多倍体的进化结果提供了机会。它们与拟南芥有关,为此获得了第一个完整的植物基因组序列,并且它们的基因组显示出与拟南芥广泛而保守的同义同义。在公共数据库中可以找到许多来自不同芸苔属物种的EST序列,但是到目前为止,尚未为这些物种开发公共微阵列资源。结果我们使用约80万个EST序列组装了单基因,主要来自三种物种:油菜,油菜和油菜。进行组装的目的是为了共同组装直系同源基因的EST(包括来自A和C基因组的油菜双歧杆菌中的同源基因对),但解析旁系同源或古同源基因的组装(即。二倍体芸苔属中观察到的与祖先基因组三倍体相关的基因)。开发了90,864个独特的序列装配。这些被并入到芸苔基因组测序项目的BAC序列注释中,从而能够鉴定其中一部分的同源基因组序列。使用unigene序列开发了包含94,558个探针的60聚体寡核苷酸微阵列。在相互重新合成的甘蓝型油菜(B. napus)品系以及用于生产它们的油菜(B. oleracea)和B. rapa品系中分析基因表达。分析表明,可以在叶组织中持续检测到35386个单基因的显着表达。在所有四种基因型中检测到27,355个单基因的表达,观察到7,851个单基因的基因组特异性表达模式,而180个单基因显示了其他类别的表达模式。主成分分析(PCA)清楚地解析了B. rapa,B。oleracea和重新合成的B. napus的各个微阵列数据集。在重新合成的甘蓝型油菜品系中发现了98个unigenes在表达上的数量差异,其中大多数可分为非加性表达模式,包括17种显示细胞质特异性模式的表达模式。我们进一步表征了观察到基因组特异性表达并可以鉴定同源基因组序列的单基因。发现这些单基因中的十个是芸苔属特异性序列,包括两个源自包含基因簇的复杂基因座的序列。结论我们成功开发了芸苔属社区微阵列资源。尽管可以测量物种中大多数单基因的表达,但是有许多探针以基因组特异性方式报道。我们预计其中的某些比例将代表物种特异性转录本,其余的将是阵列探针代表的区域内序列变化的结果。我们的研究表明,从阵列获得的数据集可用于典型分析,包括PCA和差异表达分析。我们还证明了在阵列上确实报告了在公共EST数据库登录品的序列组装中在计算机上鉴定的芸苔属特异性转录本。使用拟南芥序列设计的阵列将无法检测到这些。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号