首页> 外文期刊>RNA >RNAcode: robust discrimination of coding and noncoding regions in comparative sequence data.
【24h】

RNAcode: robust discrimination of coding and noncoding regions in comparative sequence data.

机译:RNA码:比较序列数据中编码区和非编码区的鲁棒区分。

获取原文
获取原文并翻译 | 示例
           

摘要

With the availability of genome-wide transcription data and massive comparative sequencing, the discrimination of coding from noncoding RNAs and the assessment of coding potential in evolutionarily conserved regions arose as a core analysis task. Here we present RNAcode, a program to detect coding regions in multiple sequence alignments that is optimized for emerging applications not covered by current protein gene-finding software. Our algorithm combines information from nucleotide substitution and gap patterns in a unified framework and also deals with real-life issues such as alignment and sequencing errors. It uses an explicit statistical model with no machine learning component and can therefore be applied "out of the box," without any training, to data from all domains of life. We describe the RNAcode method and apply it in combination with mass spectrometry experiments to predict and confirm seven novel short peptides in Escherichia coli and to analyze the coding potential of RNAs previously annotated as "noncoding." RNAcode is open source software and available for all major platforms at http://wash.github.com/rnacode.
机译:随着全基因组转录数据和大量比较测序的可用性,将区分非编码RNA的编码和评估进化保守区域中的编码潜力作为核心分析任务。在这里,我们介绍RNAcode,该程序可检测多个序列比对中的编码区,该程序针对当前蛋白质基因发现软件未涵盖的新兴应用进行了优化。我们的算法在一个统一的框架中结合了来自核苷酸取代和缺口模式的信息,还处理了比对和测序错误等现实问题。它使用没有机器学习组件的显式统计模型,因此无需任何培训即可“开箱即用”应用于来自生活各个领域的数据。我们描述了RNAcode方法,并将其与质谱实验结合使用,以预测和确认大肠杆菌中的七个新型短肽,并分析先前标注为“非编码”的RNA的编码潜力。 RNAcode是开源软件,可用于所有主要平台,网址为http://wash.github.com/rnacode。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号