首页> 外文期刊>Applied Microbiology >DECIPHER, a Search-Based Approach to Chimera Identification for 16S rRNA Sequences
【24h】

DECIPHER, a Search-Based Approach to Chimera Identification for 16S rRNA Sequences

机译:DECIPHER,一种基于搜索的16S rRNA序列嵌合体鉴定方法

获取原文
       

摘要

DECIPHER is a new method for finding 16S rRNA chimeric sequences by the use of a search-based approach. The method is based upon detecting short fragments that are uncommon in the phylogenetic group where a query sequence is classified but frequently found in another phylogenetic group. The algorithm was calibrated for full sequences (fs_DECIPHER) and short sequences (ss_DECIPHER) and benchmarked against WigeoN (Pintail), ChimeraSlayer, and Uchime using artificially generated chimeras. Overall, ss_DECIPHER and Uchime provided the highest chimera detection for sequences 100 to 600 nucleotides long (79% and 81%, respectively), but Uchime's performance deteriorated for longer sequences, while ss_DECIPHER maintained a high detection rate (89%). Both methods had low false-positive rates (1.3% and 1.6%). The more conservative fs_DECIPHER, benchmarked only for sequences longer than 600 nucleotides, had an overall detection rate lower than that of ss_DECIPHER (75%) but higher than those of the other programs. In addition, fs_DECIPHER had the lowest false-positive rate among all the benchmarked programs (<0.20%). DECIPHER was outperformed only by ChimeraSlayer and Uchime when chimeras were formed from closely related parents (less than 10% divergence). Given the differences in the programs, it was possible to detect over 89% of all chimeras with just the combination of ss_DECIPHER and Uchime. Using fs_DECIPHER, we detected between 1% and 2% additional chimeras in the RDP, SILVA, and Greengenes databases from which chimeras had already been removed with Pintail or Bellerophon. DECIPHER was implemented in the R programming language and is directly accessible through a webpage or by downloading the program as an R package (http://DECIPHER.cee.wisc.edu).
机译:DECIPHER是通过使用基于搜索的方法来查找16S rRNA嵌合序列的新方法。该方法基于检测在系统发育组中不常见的短片段,其中对查询序列进行分类但在另一个系统发育组中经常发现。该算法针对完整序列(fs_DECIPHER)和短序列(ss_DECIPHER)进行了校准,并使用人工生成的嵌合体针对WigeoN(Pintail),ChimeraSlayer和Uchime进行了基准测试。总体而言,ss_DECIPHER和Uchime对长度为100至600个核苷酸的序列提供了最高的嵌合体检测(分别为79%和81%),但对于更长的序列,Uchime的性能却有所下降,而ss_DECIPHER则保持了较高的检测率(89%)。两种方法的假阳性率均较低(1.3%和1.6%)。较为保守的fs_DECIPHER(仅针对长度超过600个核苷酸的序列进行了基准测试)的总体检测率低于ss_DECIPHER(75%),但高于其他程序。此外,在所有基准测试程序中,fs_DECIPHER的假阳性率最低(<0.20%)。当嵌合体由亲缘关系密切的亲代组成时,DECIPHER的表现仅优于ChimeraSlayer和Uchime(差异小于10%)。鉴于程序的差异,仅通过ss_DECIPHER和Uchime的组合就可以检测出超过89%的嵌合体。使用fs_DECIPHER,我们在RDP,SILVA和Greengenes数据库中检测到了1%到2%的其他嵌合体,这些数据库中已经使用Pintail或Bellerophon去除了嵌合体。 DECIPHER是用R编程语言实现的,可以直接通过网页或通过将程序下载为R包(http://DECIPHER.cee.wisc.edu)来访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号