...
首页> 外文期刊>Nucleic acids research >Loose ends: almost one in five human genes still have unresolved coding status
【24h】

Loose ends: almost one in five human genes still have unresolved coding status

机译:结局松散:几乎五分之一的人类基因仍具有未解析的编码状态

获取原文
           

摘要

Seventeen years after the sequencing of the human genome, the human proteome is still under revision. One in eight of the 22 210 coding genes listed by the Ensembl/GENCODE, RefSeq and UniProtKB reference databases are annotated differently across the three sets. We have carried out an in-depth investigation on the 2764 genes classified as coding by one or more sets of manual curators and not coding by others. Data from large-scale genetic variation analyses suggests that most are not under protein-like purifying selection and so are unlikely to code for functional proteins. A further 1470 genes annotated as coding in all three reference sets have characteristics that are typical of non-coding genes or pseudogenes. These potential non-coding genes also appear to be undergoing neutral evolution and have considerably less supporting transcript and protein evidence than other coding genes. We believe that the three reference databases currently overestimate the number of human coding genes by at least 2000, complicating and adding noise to large-scale biomedical experiments. Determining which potential non-coding genes do not code for proteins is a difficult but vitally important task since the human reference proteome is a fundamental pillar of most basic research and supports almost all large-scale biomedical projects.
机译:人类基因组测序十七年后,人类蛋白质组仍在修订中。在Ensembl / GENCODE,RefSeq和UniProtKB参考数据库中列出的22210个编码基因中,有八分之一在三组中标注了不同的注释。我们已经对2764个基因进行了深入的研究,这些基因被归类为由一组或多组手动策展人编码而不是由其他策展人编码。大规模遗传变异分析的数据表明,大多数蛋白质都没有进行类似蛋白质的纯化选择,因此不太可能编码功能性蛋白质。在所有三个参考集中标注为编码的另外1470个基因具有非编码基因或假基因的典型特征。这些潜在的非编码基因似乎也正在经历中性进化,并且与其他编码基因相比,其支持的转录本和蛋白质证据明显更少。我们认为,这三个参考数据库目前至少高估了人类编码基因的数量,至少增加了2000个,从而使大规模生物医学实验复杂化并增加了噪声。确定哪个潜在的非编码基因不编码蛋白质是一项艰巨而至关重要的任务,因为人类参考蛋白质组是大多数基础研究的基本支柱,并且支持几乎所有大型生物医学项目。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号