首页> 外文期刊>Journal of computational biology: A journal of computational molecular cell biology >EasyQC: Tool with Interactive User Interface for Efficient Next-Generation Sequencing Data Quality Control
【24h】

EasyQC: Tool with Interactive User Interface for Efficient Next-Generation Sequencing Data Quality Control

机译:EasyQc:具有交互式用户界面的工具,用于高效的下一代测序数据质量控制

获取原文
获取原文并翻译 | 示例
           

摘要

The advent of next-generation sequencing (NGS) technologies has revolutionized the world of genomic research. Millions of sequences are generated in a short period of time and they provide intriguing insights to the researcher. Many NGS platforms have evolved over a period of time and their efficiency has been ever increasing. Still, primarily because of the chemistry, glitch in the sequencing machine and human handling errors, some artifacts tend to exist in the final sequence data set. These sequence errors have a profound impact on the downstream analyses and may provide misleading information. Hence, filtering of these erroneous reads has become inevitable and myriad of tools are available for this purpose. However, many of them are accessible as a command line interface that requires the user to enter each command manually. Here, we report EasyQC, a tool for NGS data quality control (QC) with a graphical user interface providing options to carry out trimming of NGS reads based on quality, length, homopolymer, and ambiguous bases. EasyQC also possesses features such as format converter, paired end merger, adapter trimmer, and a graph generator that generates quality distribution, length distribution, GC content, and base composition graphs. Comparison of raw and processed sequence data sets using EasyQC suggested significant increase in overall quality of the sequences. Testing of EasyQC using NGS data sets on a standalone desktop proved to be relatively faster. EasyQC is developed using PERL modules and can be executed in Windows and Linux platforms. With the various QC features, easy interface for end users, and cross-platform compatibility, EasyQC would be a valuable addition to the already existing tools facilitating better downstream analyses.
机译:下一代测序(NGS)技术的出现彻底改变了基因组研究的世界。在短时间内产生数百万次序列,并为研究人员提供有趣的洞察。许多NGS平台在一段时间内发展,他们的效率一直在增加。仍然是因为毛刺在测序机和人类处理误差中,一些伪像往往存在于最终序列数据集中。这些序列误差对下游分析产生了深远的影响,并且可以提供误导性信息。因此,过滤这些错误的读数已成为不可避免的,无数的工具可用于此目的。但是,许多可以作为命令行界面访问,该命令行界面要求用户手动输入每个命令。在这里,我们报告EasyQc,一种用于NGS数据质量控制(QC)的工具,具有图形用户界面,提供基于质量,长度,均聚物和模糊基础进行NGS读取的选项。 EasyQC还拥有格式转换器,配对结束合并,适配器修剪器和图形生成器等功能,可产生质量分布,长度分布,GC内容和基础组成图。使用EasyQC的原始和加工序列数据集的比较表明序列整体质量的显着增加。在独立桌面上使用NGS数据集测试EasyQC证明是相对更快的。 EasyQC使用Perl模块开发,可以在Windows和Linux平台中执行。凭借各种QC功能,Easy Unite的QC功能,以及跨平台兼容性,EasyQC将是您已经存在的有价值的补充,促进了更好的下游分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号