首页> 美国卫生研究院文献>Genetics >Expressed Sequence Tags With cDNA Termini: Previously Overlooked Resources for Gene Annotation and Transcriptome Exploration in Chlamydomonas reinhardtii
【2h】

Expressed Sequence Tags With cDNA Termini: Previously Overlooked Resources for Gene Annotation and Transcriptome Exploration in Chlamydomonas reinhardtii

机译:用cDNA Termini表达序列标签:莱茵衣藻基因注释和转录组探索的先前被忽视的资源

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Many of Chlamydomonas reinhardtii expressed sequence tags (ESTs) in GenBank dbEST and community EST assemblies were either over- or undertrimmed in terms of their cDNA termini, which are defined as the diagnostic sequence elements that delineate 3′/5′ ends of mRNA transcripts. Overtrimming represents a loss of directional, positional, and structural information of transcript ends whereas undertrimming causes unclean spurious sequences retained in ESTs that exert deleterious impacts on downstream EST-based applications. We examined 309,278 raw EST sequencing trace files of C. reinhardtii and found that only 57% had cDNA termini that matched the expected structures specified in their cDNA library constructions while satisfying our minimum length requirement for their final clean sequences. Using GMAP, 156,963 individual ESTs were mapped to the genome successfully, with their in silico-verified cDNA termini anchored to the genome. Our data analysis suggested strong macro- and microheterogeneity of 3′/5′ end positions of individual transcripts derived from the same genes in C. reinhardtii. This work annotating differential ends of individual transcripts in the draft genome presents the research community with a new stream of data that will facilitate accurate determination of gene structures, genome annotation, and exploration of the transcriptome and mRNA metabolism in C. reinhardtii.
机译:GenBank dbEST和社区EST装配中的许多衣藻衣原体表达序列标签(EST)的cDNA末端被过高或过低,其被定义为描绘mRNA转录本3'/ 5'末端的诊断序列元件。过度修剪表示转录本末端的方向,位置和结构信息的丢失,而过度修剪会导致保留在EST中的不干净的伪序列,从而对基于EST的下游应用程序产生有害影响。我们检查了309,278个莱茵衣原体的原始EST测序跟踪文件,发现只有57%的cDNA末端符合其cDNA文库构建中指定的预期结构,同时满足了我们对其最终清洁序列的最小长度要求。使用GMAP,成功将156,963个单个EST定位到基因组,并将其计算机验证的cDNA末端锚定在基因组上。我们的数据分析表明,来自莱茵衣藻中相同基因的单个转录本的3'/ 5'末端位置具有较强的宏观和微观异质性。这项注释基因组草案中各个转录物差异末端的工作为研究界提供了新的数据流,这些数据将有助于准确确定基因结构,基因组注释以及探索莱茵衣藻的转录组和mRNA代谢。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号