首页> 美国卫生研究院文献>Bioinformatics >A cautionary note for retrocopy identification: DNA-based duplication of intron-containing genes significantly contributes to the origination of single exon genes
【2h】

A cautionary note for retrocopy identification: DNA-based duplication of intron-containing genes significantly contributes to the origination of single exon genes

机译:回顾性鉴定的注意事项:含内含子基因的基于DNA的复制极大地促进了单个外显子基因的起源

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Retrocopies are important genes in the genomes of almost all higher eukaryotes. However, the annotation of such genes is a non-trivial task. Intronless genes have often been considered to be retroposed copies of intron-containing paralogs. Such categorization relies on the implicit premise that alignable regions of the duplicates should be long enough to cover exon–exon junctions of the intron-containing genes, and thus intron loss events can be inferred. Here, we examined the alternative possibility that intronless genes could be generated by partial DNA-based duplication of intron-containing genes in the fruitfly genome.>Results: By building pairwise protein-, transcript- and genome-level DNA alignments between intronless genes and their corresponding intron-containing paralogs, we found that alignments do not cover exon–exon junctions in 40% of cases and thus no intron loss could be inferred. For these cases, the candidate parental proteins tend to be partially duplicated, and intergenic sequences or neighboring genes are included in the intronless paralog. Moreover, we observed that it is significantly less likely for these paralogs to show inter-chromosomal duplication and testis-dominant transcription, compared to the remaining 60% of cases with evidence of clear intron loss (retrogenes). These lines of analysis reveal that DNA-based duplication contributes significantly to the 40% of cases of single exon gene duplication. Finally, we performed an analogous survey in the human genome and the result is similar, wherein 34% of the cases do not cover exon–exon junctions. Thus, genome annotation for retrogene identification should discard candidates without clear evidence of intron loss.>Contact: ; >Supplementary information: are available at Bioinformatics online.
机译:>动机:复本是几乎所有高级真核生物基因组中的重要基因。但是,注释此类基因并非易事。无内含子基因通常被认为是含内含子旁系同源物的翻版副本。这种分类基于隐含的前提,即重复序列的可对齐区域应足够长,以覆盖含内含子基因的外显子-外显子连接,因此可以推断内含子丢失事件。在这里,我们研究了通过将果蝇基因组中含内含子的基因进行基于DNA的部分复制而生成无内含子基因的另一种可能性。>结果:通过建立蛋白质,转录本和基因组水平的成对表达无内含子基因及其对应的含内含子旁系同源物之间的DNA比对,我们发现在40%的情况下,比对不覆盖外显子-外显子连接,因此无法推断出内含子丢失。对于这些情况,候选亲本蛋白往往会部分复制,并且无内含子旁系同源基因中包含基因间序列或邻近基因。而且,我们观察到这些旁系同源物显示染色体间重复和睾丸显性转录的可能性要小得多,而其余60%​​的病例具有明显的内含子缺失(逆转录酶)。这些分析表明,基于DNA的复制在单外显子基因复制的40%病例中起了重要作用。最后,我们在人类基因组中进行了类似的调查,结果相似,其中34%的病例未涵盖外显子-外显子连接。因此,用于逆转录基因鉴定的基因组注释应丢弃没有明显内含子丢失证据的候选基因。>联系人:; >补充信息:可在线访问生物信息学。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号