首页> 美国卫生研究院文献>Scientific Reports >Resequencing of the common marmoset genome improves genome assemblies and gene-coding sequence analysis
【2h】

Resequencing of the common marmoset genome improves genome assemblies and gene-coding sequence analysis

机译:普通mar猴基因组的重测序可改善基因组装配和基因编码序列分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The first draft of the common marmoset (Callithrix jacchus) genome was published by the Marmoset Genome Sequencing and Analysis Consortium. The draft was based on whole-genome shotgun sequencing, and the current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions and supercontigs and relatively short contigs that are unmapped to chromosomes in the draft genome. We performed resequencing and assembly of the genome of common marmoset by deep sequencing with high-throughput sequencing technology. Several different sequence runs using Illumina sequencing platforms were executed, and 181 Gbp of high-quality bases including mate-pairs with long insert lengths of 3, 8, 20, and 40 Kbp were obtained, that is, approximately 60× coverage. The resequencing significantly improved the MGSAC draft genome sequence. The N50 of the contigs, which is a statistical measure used to evaluate assembly quality, doubled. As a result, 51% of the contigs (total length: 299 Mbp) that were unmapped to chromosomes in the MGSAC draft were merged with chromosomal contigs, and the improved genome sequence helped to detect 5,288 new genes that are homologous to human cDNAs and the gaps in 5,187 transcripts of the Ensembl gene annotations were completely filled.
机译:普通mar猴(Callithrix jacchus)基因组的初稿由the猴基因组测序和分析联合会出版。该草案基于全基因组shot弹枪测序,当前的装配版本为Callithrix_jacches-3.2.1,但仍存在187,214个未确定的缺口区域和超重叠群和相对短的重叠群,它们未映射到草案基因组的染色体。我们通过高通量测序技术进行深度测序,对普通mar猴的基因组进行了重新测序和组装。使用Illumina测序平台进行了几种不同的序列运行,并获得了181 Gbp的高质量碱基,包括具有3、8、20和40 Kbp的长插入长度的伴侣对,即大约60倍的覆盖率。重测序显着改善了MGSAC草稿基因组序列。重叠群的N50(一种用于评估装配质量的统计量)增加了一倍。结果,未映射到MGSAC草图中的染色体的重叠群(全长299 Mbp)中有51%与染色体重叠群融合在一起,改进的基因组序列有助于检测5288个与人cDNA和人类cDNA同源的新基因。 Ensembl基因注释的5,187个转录本中的缺口被完全填补。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号