...
首页> 外文期刊>Bioinformatics >PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions
【24h】

PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions

机译:PhyloCSF:区分蛋白质编码区和非编码区的比较基因组学方法

获取原文
获取原文并翻译 | 示例

摘要

Motivation: As high-throughput transcriptome sequencing provides evidence for novel transcripts in many species, there is a renewed need for accurate methods to classify small genomic regions as protein coding or non-coding. We present PhyloCSF, a novel comparative genomics method that analyzes a multispecies nucleotide sequence alignment to determine whether it is likely to represent a conserved protein-coding region, based on a formal statistical comparison of phylogenetic codon models.Results: We show that PhyloCSF's classification performance in 12-species Drosophila genome alignments exceeds all other methods we compared in a previous study. We anticipate that this method will be widely applicable as the transcriptomes of many additional species, tissues and subcellular compartments are sequenced, particularly in the context of ENCODE and modENCODE, and as interest grows in long non-coding RNAs, often initially recognized by their lack of protein coding potential rather than conserved RNA secondary structures.
机译:动机:由于高通量转录组测序为许多物种的新转录本提供了证据,因此,对将小基因组区域分类为蛋白质编码或非编码的准确方法的新需求。基于系统发育密码子模型的正式统计比较,我们介绍了PhyloCSF,这是一种新颖的比较基因组学方法,可分析多物种核苷酸序列比对以确定是否可能代表保守的蛋白质编码区。结果:我们证明了PhyloCSF的分类性能在12种果蝇的基因组比对中,我们超过了先前研究中比较的所有其他方法。我们预计该方法将被广泛应用,因为对许多其他物种,组织和亚细胞区室的转录组进行了测序,尤其是在ENCODE和modENCODE的情况下,并且随着人们对长的非编码RNA的兴趣不断增长,通常最初会因其缺乏而认识到它们蛋白质编码潜力,而不是保守的RNA二级结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号