首页> 外文期刊>Applications in plant sciences. >HybPiper: Extracting Coding Sequence and Introns for Phylogenetics from High-Throughput Sequencing Reads Using Target Enrichment
【24h】

HybPiper: Extracting Coding Sequence and Introns for Phylogenetics from High-Throughput Sequencing Reads Using Target Enrichment

机译:HybPiper:使用目标富集从高通量测序读物中提取系统发生学的编码序列和内含子

获取原文
获取外文期刊封面目录资料

摘要

Premise of the study: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). Methods and Results: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly complete gene sequences for all genes in 22 species of Artocarpus. HybPiper also recovered more than 500 bp of nontargeted intron sequence in over half of the phylogenetic markers and identified paralogous gene copies in Artocarpus. Conclusions: HybPiper was designed for Linux and Mac OS X and is freely available at https://github.com/mossmatters/HybPiper .
机译:研究的前提:将通过靶标富集产生的序列数据用于系统发育,需要将高通量序列读取重组为基因座,这带来了许多生物信息学挑战。我们开发了HybPiper,将其作为用户友好的平台,可用于组装基因区域,提取外显子和内含子序列以及鉴定旁系基因拷贝。我们使用设计为针对面包果(桑科)中的333个系统发育标记和125个功能重要基因的诱饵测试HybPiper。方法和结果:HybPiper分三个阶段实现了序列组装的并行执行:读取映射,重叠群组装和目标序列提取。该管道能够为面包果22种物种的所有基因恢复几乎完整的基因序列。 HybPiper还从一半以上的系统发育标记中回收了超过500 bp的非靶向内含子序列,并在面包果中鉴定了旁系同源基因拷贝。结论:HybPiper是为Linux和Mac OS X设计的,可从https://github.com/mossmatters/HybPiper免费获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号