首页> 外文期刊>Nucleic Acids Research >TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads
【24h】

TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads

机译:陷良:用于识别和表征卫星DNA的计算工具,来自无组织短读数

获取原文
获取原文并翻译 | 示例
           

摘要

Satellite DNA is one of the major classes of repetitive DNA, characterized by tandemly arranged repeat copies that form contiguous arrays up to megabases in length. This type of genomic organization makes satellite DNA difficult to assemble, which hampers characterization of satellite sequences by computational analysis of genomic contigs. Here, we present tandem repeat analyzer (TAREAN), a novel computational pipeline that circumvents this problem by detecting satellite repeats directly from unassembled short reads. The pipeline first employs graph-based sequence clustering to identify groups of reads that represent repetitive elements. Putative satellite repeats are subsequently detected by the presence of circular structures in their cluster graphs. Consensus sequences of repeat monomers are then reconstructed from the most frequent k-mers obtained by decomposing read sequences from corresponding clusters. The pipeline performance was successfully validated by analyzing low-pass genome sequencing data from five plant species where satellite DNA was previously experimentally characterized. Moreover, novel satellite repeats were predicted for the genome of Vicia faba and three of these repeats were verified by detecting their sequences on metaphase chromosomes using fluorescence in situ hybridization.
机译:卫星DNA是重复性DNA的主要类别之一,其特征在于串联排列的重复拷贝,其长度形成典型的载体。这种类型的基因组组织使卫星DNA难以组装,其通过基因组Contig的计算分析妨碍了卫星序列的表征。在这里,我们呈现串联重复分析仪(Tarean),通过直接从未组装的短读取来检测卫星重复来缩短这个问题的新型计算管道。管道首先采用基于图形的序列聚类,以识别代表重复元素的读取组。随后通过它们的簇图中存在圆形结构来检测推定的卫星重复。然后通过通过分解来自相应簇的读取序列获得的最常见的K-MERS重建重复单体的共有序列。通过分析来自五种植物物种的低通基因组测序数据,成功验证了管道性能,其中卫星DNA先前进行了实验表征。此外,预测了新的卫星重复对vicia faba的基因组,并且通过在原位杂交中检测它们的中期染色体上的序列来验证这些重复的三种。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号