...
首页> 外文期刊>Genetics and Molecular Research >Identification of genes encoding hypothetical proteins in open-reading frame expressed sequence tags from mammalian stages of Trypanosoma cruzi
【24h】

Identification of genes encoding hypothetical proteins in open-reading frame expressed sequence tags from mammalian stages of Trypanosoma cruzi

机译:克鲁氏锥虫哺乳动物阶段在开放阅读框表达的序列标签中编码假设蛋白的基因的鉴定

获取原文
           

摘要

Approximately 50% of the predicted protein-coding genes of the Trypanosoma cruzi CL Brener strain are annotated as hypothetical or conserved hypothetical proteins. To further characterize these genes, we generated 1161 open-reading frame expressed sequence tags (ORESTES) from the mammalian stages of the VL10 human strain. Sequence clustering resulted in 435 clusters, consisting of 339 singletons and 96 contigs. Significant matches to the T. cruzi predicted gene database were found for ~94% contigs and ~69% singletons. These included genes encoding surface proteins, known to be intensely expressed in the parasite mammalian stages and implicated in host cell invasion and/or immune evasion mechanisms. Among 151 contigs and singletons with similarity to predicted hypothetical protein-coding genes and conserved hypothetical protein-coding genes, 83% showed no match with T. cruzi EST and/or proteome databases. These ORESTES are the first experimental evidence that the corresponding genes are in fact transcribed. Sequences with no significant match were searched against several T. cruzi and National Center for Biotechnology Information non-redundant sequence databases. The ORESTES analysis indicated that 124 predicted conserved hypothetical protein-coding genes and 27 predicted hypothetical protein-coding genes annotated in the CL Brener genome are transcribed in the VL10 mammalian stages. Six ORESTES annotated as hypothetical protein-coding genes showing no match to EST and/or proteome databases were confirmed by Northern blot in VL10. The generation of this set of ORESTES complements the T. cruzi genome annotation and suggests new stage-regulated genes encoding hypothetical proteins.
机译:克鲁氏锥虫CL Brener菌株的约50%的预测蛋白质编码基因被注释为假设或保守的假设蛋白质。为了进一步表征这些基因,我们从VL10人类菌株的哺乳动物阶段生成了1161个开放阅读框表达序列标签(ORESTES)。序列聚类产生435个聚类,由339个单例和96个重叠群组成。发现约94%的重叠群和〜69%的单身汉与T. cruzi预测基因数据库的显着匹配。这些包括编码表面蛋白的基因,已知这些基因在寄生虫哺乳动物阶段会强烈表达,并参与宿主细胞的入侵和/或免疫逃逸机制。在与预测的假设蛋白质编码基因和保守的假设蛋白质编码基因相似的151个重叠群和单例中,有83%的人与克鲁氏锥虫EST和/或蛋白质组数据库不匹配。这些ORESTES是实际上转录了相应基因的第一个实验证据。在几个克鲁氏梭菌和国家生物技术信息中心非冗余序列数据库中搜索没有明显匹配的序列。 ORESTES分析表明,在VL10哺乳动物阶段转录了CL Brener基因组中注释的124个预测的保守假设蛋白质编码基因和27个预测的假设蛋白质编码基因。通过VL10中的Northern印迹证实了六个被标记为假设蛋白编码基因的ORESTES,它们与EST和/或蛋白质组数据库不匹配。这组ORESTES的产生补充了克鲁维氏锥虫基因组注释,并提示了编码假想蛋白的新阶段调节基因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号