首页> 美国卫生研究院文献>G3: GenesGenomesGenetics >BlobToolKit – Interactive Quality Assessment of Genome Assemblies
【2h】

BlobToolKit – Interactive Quality Assessment of Genome Assemblies

机译:BlobToolKit –基因组装配的交互式质量评估

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Reconstruction of target genomes from sequence data produced by instruments that are agnostic as to the species-of-origin may be confounded by contaminant DNA. Whether introduced during sample processing or through co-extraction alongside the target DNA, if insufficient care is taken during the assembly process, the final assembled genome may be a mixture of data from several species. Such assemblies can confound sequence-based biological inference and, when deposited in public databases, may be included in downstream analyses by users unaware of underlying problems. We present BlobToolKit, a software suite to aid researchers in identifying and isolating non-target data in draft and publicly available genome assemblies. BlobToolKit can be used to process assembly, read and analysis files for fully reproducible interactive exploration in the browser-based Viewer. BlobToolKit can be used during assembly to filter non-target DNA, helping researchers produce assemblies with high biological credibility. We have been running an automated BlobToolKit pipeline on eukaryotic assemblies publicly available in the International Nucleotide Sequence Data Collaboration and are making the results available through a public instance of the Viewer at . We aim to complete analysis of all publicly available genomes and then maintain currency with the flow of new genomes. We have worked to embed these views into the presentation of genome assemblies at the European Nucleotide Archive, providing an indication of assembly quality alongside the public record with links out to allow full exploration in the Viewer.
机译:从与来源物种无关的仪器产生的序列数据重建靶基因组可能会与污染物DNA混淆。无论是在样品处理过程中引入还是与目标DNA一起通过共提取引入,如果在组装过程中未给予足够的重视,最终组装的基因组可能是多种物种数据的混合体。这样的程序集可能会混淆基于序列的生物学推断,并且当存储在公共数据库中时,可能会被不知道潜在问题的用户包括在下游分析中。我们提出了BlobToolKit,这是一个软件套件,可帮助研究人员识别和隔离草稿和公开可用的基因组程序中的非目标数据。 BlobToolKit可用于处理装配,读取和分析文件,以便在基于浏览器的Viewer中完全可重现的交互式探索。 BlobToolKit可以在组装过程中用于过滤非目标DNA,从而帮助研究人员生产具有高度生物可信性的组装。我们一直在国际核苷酸序列数据合作组织(International Nucleotide Sequence Data Collaboration)中公开提供的真核生物程序集上运行自动BlobToolKit管道,并通过的Viewer的公共实例提供结果。我们的目标是完成对所有公共可用基因组的分析,然后随着新基因组的流动而保持流行。我们努力将这些视图嵌入到欧洲核苷酸档案馆的基因组组装展示中,以显示组装质量以及公共记录,并提供链接以允许在Viewer中进行全面探索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号