首页> 美国卫生研究院文献>GigaScience >De novo genome assembly of Camptotheca acuminata a natural source of the anti-cancer compound camptothecin
【2h】

De novo genome assembly of Camptotheca acuminata a natural source of the anti-cancer compound camptothecin

机译:喜树(Camtoptheca acuminata)的从头基因组组装喜树碱是抗癌化合物喜树碱的天然来源

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Camptotheca acuminata is 1 of a limited number of species that produce camptothecin, a pentacyclic quinoline alkaloid with anti-cancer activity due to its ability to inhibit DNA topoisomerase. While transcriptome studies have been performed previously with various camptothecin-producing species, no genome sequence for a camptothecin-producing species is available to date. We generated a high-quality de novo genome assembly for C. acuminata representing 403 174 860 bp on 1394 scaffolds with an N50 scaffold size of 1752 kbp. Quality assessments of the assembly revealed robust representation of the genome sequence including genic regions. Using a novel genome annotation method, we annotated 31 825 genes encoding 40 332 gene models. Based on sequence identity and orthology with validated genes from Catharanthus roseus as well as Pfam searches, we identified candidate orthologs for genes potentially involved in camptothecin biosynthesis. Extensive gene duplication including tandem duplication was widespread in the C. acuminata genome, with 2571 genes belonging to 997 tandem duplicated gene clusters. To our knowledge, this is the first genome sequence for a camptothecin-producing species, and access to the C. acuminata genome will permit not only discovery of genes encoding the camptothecin biosynthetic pathway but also reagents that can be used for heterologous expression of camptothecin and camptothecin analogs with novel pharmaceutical applications.
机译:喜树是产生喜树碱的有限种类之一,喜树碱是一种五环喹啉生物碱,由于具有抑制DNA拓扑异构酶的能力而具有抗癌活性。尽管先前已经对各种喜树碱生产物种进行了转录组研究,但迄今为止尚无关于喜树碱生产物种的基因组序列。我们为C. acuminata生成了高质量的从头基因组装配体,代表了1394个支架上的403 174 860 bp,N50支架大小为1752 kbp。大会的质量评估揭示了包括基因区域的基因组序列的鲁棒表示。使用一种新颖的基因组注释方法,我们注释了31 825个编码40 332个基因模型的基因。基于序列同一性和正交性以及来自Catharanthus roseus的经过验证的基因以及Pfam搜索,我们确定了可能与喜树碱生物合成相关的基因的候选直系同源物。广泛的基因重复包括串联重复在C. acuminata基因组中很普遍,其中2571个基因属于997个串联重复的基因簇。据我们所知,这是喜树碱生产物种的第一个基因组序列,进入尖锐梭菌基因组不仅可以发现编码喜树碱生物合成途径的基因,还可以发现可用于喜树碱和喜树碱异源表达的试剂。喜树碱类似物具有新颖的药物应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号