首页> 外文期刊>Nucleic Acids Research >Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm
【24h】

Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm

机译:将映射的RNA-Seq读码整合到真核基因发现算法的自动训练中

获取原文
获取原文并翻译 | 示例
           

摘要

We present a new approach to automatic training of a eukaryotic ab initio gene finding algorithm. With the advent of Next-Generation Sequencing, automatic training has become paramount, allowing genome annotation pipelines to keep pace with the speed of genome sequencing. Earlier we developed GeneMark-ES, currently the only gene finding algorithm for eukaryotic genomes that performs automatic training in unsupervised ab initio mode. The new algorithm, GeneMark-ET augments GeneMark-ES with a novel method that integrates RNA-Seq read alignments into the self-training procedure. Use of 'assembled' RNA-Seq transcripts is far from trivial; significant error rate of assembly was revealed in recent assessments. We demonstrated in computational experiments that the proposed method of incorporation of 'unassembled' RNA-Seq reads improves the accuracy of gene prediction; particularly, for the 1.3 GB genome of Aedes aegypti the mean value of prediction Sensitivity and Specificity at the gene level increased over GeneMark-ES by 24.5%. In the current surge of genomic data when the need for accurate sequence annotation is higher than ever, GeneMark-ET will be a valuable addition to the narrow arsenal of automatic gene prediction tools.
机译:我们提出了一种自动训练真核生物从头算基因寻找算法的新方法。随着下一代测序的出现,自动训练已变得至关重要,这使得基因组注释流程可以跟上基因组测序的速度。较早之前,我们开发了GeneMark-ES,这是目前唯一用于真核基因组的基因发现算法,可以在无监督的从头开始模式下进行自动训练。新算法GeneMark-ET通过将RNA-Seq读取比对整合到自训练程序中的新方法增强了GeneMark-ES。使用“组装的” RNA-Seq转录本并非易事。最近的评估显示出明显的组装错误率。我们在计算实验中证明,提出的“未组装的” RNA-Seq读取方法的整合可提高基因预测的准确性;特别是,对于埃及伊蚊的1.3 GB基因组,在基因水平上预测敏感性和特异性的平均值比GeneMark-ES高出24.5%。在当前的基因组数据激增时,对精确序列注释的需求比以往任何时候都高,GeneMark-ET将成为自动基因预测工具的缩影。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号