首页> 外文会议>IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology >Assembly independent functional annotation of short-read data using SOFA: Short-ORF functional annotation
【24h】

Assembly independent functional annotation of short-read data using SOFA: Short-ORF functional annotation

机译:组装使用沙发的短读数据的独立功能注释:短ORF功能注释

获取原文

摘要

Accurate description of the microbial communities driving matter and energy transformations in complex ecosystems such as soils cannot yet be effectively accomplished using assembly-based approaches despite the rise of next generation sequencing technologies. Here we present SOFA, an open source pipeline enabling comparative functional annotation of unassembled short-read data. The pipeline attempts to merge mate pairs in fastq files, predicts open reading frames (ORFs) on merged and unmerged reads as small as 70 bps, and completes an additional step, we term `deduplication'. Deduplication prevents the double counting of ORFs predicted from unmerged paired-end reads by checking for homologous annotations that span the same ORF, allowing for quantitatively accurate predictions. The effectiveness of SOFA is validated with both simulated and bone fide soil metagenomes, and empirical results are compared to existing strategies for obtaining accurate ORF counts, and an analytical model of read duplication. SOFA enables downstream processing stages within the existing MetaPathways pipeline, and is available for download as a stand alone application at https://github.com under the MIT license.
机译:尽管下一代测序技术升高,但是,尽管下一代测序技术升高,但是在诸如土壤中的复杂生态系统中的微生物群落的准确描述尚不能有效地实现了诸如土壤的复杂生态系统中的能量变换。在这里,我们提供了一个开源管道的沙发,实现了对未组装的短读数据的比较功能注释。管道尝试在FASTQ文件中合并配对对,预测合并和未使用的读数的开放阅读帧(ORF),只为70 bps,并完成了额外的步骤,我们术语“重复数据删除”。重复数据删除可以通过检查跨越相同ORF的同源注释,防止从未混合结束读取预测的ORF的双重计数,从而允许定量准确的预测。 SOFA的有效性验证了模拟和骨吸状土壤偏心蛋白,与现有策略进行了验证,以获得精确的ORF计数,以及读重复的分析模型。 SOFA在现有Metapathways管道中启用下游处理阶段,可用于在MIT许可证下以HTTPS://github.com作为独立应用程序下载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号