首页> 外文期刊>Microbiology >Genome update: the 1000th genome – a cautionary tale
【24h】

Genome update: the 1000th genome – a cautionary tale

机译:基因组更新:1000个基因组 - 一种警示故事

获取原文
       

摘要

There are now more than 1000 sequenced prokaryotic genomes deposited inpublic databases and available for analysis. Currently, although the sequencedatabases GenBank, DNA Database of Japan and EMBL are synchronized continually,there are slight differences in content at the genomes level for a varietyof logistical reasons, including differences in format and loading errors,such as those caused by file transfer protocol interruptions. This means thatthe 1000th genome will be different in the various databases. Some of thedata on the highly accessed web pages are inaccurate, leading to false conclusionsfor example about the largest bacterial genome sequenced. Biological diversityis far greater than many have thought. For example, analysis of multiple Escherichia coli genomes has led to an estimate of around 45 000 genefamilies — more genes than are recognized in the human genome. Moreover,of the 1000 genomes available, not a single protein is conserved across allgenomes. Excluding the members of the Archaea, only a total of fourgenes are conserved in all bacteria: two protein genes and two RNA genes.
机译:现在有超过1000个测序的原核基因组,存放在公共数据库中,可用于分析。目前,虽然日本和embl的DNA数据库,日本和embl的DNA数据库不断同步,但基因组水平的内容略有差异,以各种后勤原因,包括格式和加载错误的差异,例如由文件传输协议中断引起的差异。这意味着在各种数据库中,1000个基因组将不同。高度访问的网页上的一些TheData是不准确的,导致关于最大细菌基因组测序的例子的错误结论。生物多样性远远超过许多人的想法。例如,对多个大肠杆菌基因组的分析导致约45000个基因组的估计 - 比人类基因组在人类基因组中识别的更多基因。此外,在可获得的1000个基因组中,在allgenomes中没有保守单个蛋白质。不包括古代成员,所有细菌中只保守了四根蛋白质:两种蛋白质基因和两个RNA基因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号