首页> 外文OA文献 >BLASTGrabber: A bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data
【2h】

BLASTGrabber: A bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data

机译:BLasTGrabber:用于大量BLasT数据的可视化,分析和序列选择的生物信息学工具

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Background Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprecedented scale. This has been possible due to high-performance computers and parallel processing. However, the raw BLAST output from contemporary searches involving thousands of queries becomes ill-suited for direct human processing. Few programs attempt to directly visualize and interpret BLAST output; those that do often provide a mere basic structuring of BLAST data.Results Here we present a bioinformatics application named BLASTGrabber suitable for high-throughput sequencing analysis. BLASTGrabber, being implemented as a Java application, is OS-independent and includes a user friendly graphical user interface. Text or XML-formatted BLAST output files can be directly imported, displayed and categorized based on BLAST statistics. Query names and FASTA headers can be analysed by text-mining. In addition to visualizing sequence alignments, BLAST data can be ordered as an interactive taxonomy tree. All modes of analysis support selection, export and storage of data. A Java interface-based plugin structure facilitates the addition of customized third party functionality.Conclusion The BLASTGrabber application introduces new ways of visualizing and analysing massive BLAST output data by integrating taxonomy identification, text mining capabilities and generic multi-dimensional rendering of BLAST hits. The program aims at a non-expert audience in terms of computer skills; the combination of new functionalities makes the program flexible and useful for a broad range of operations.
机译:背景技术测序效率的进步极大地增加了生物序列数据库的规模,其中包括成千上万的基因组测序物种。 BLAST算法仍然是检索序列信息的主要搜索引擎,因此必须以前所未有的规模处理数据。由于高性能计算机和并行处理,这已经成为可能。但是,当代搜索中涉及数千个查询的原始BLAST输出变得不适合直接人工处理。很少有程序试图直接可视化和解释BLAST输出。结果通常在此提供了一种名为BLASTGrabber的生物信息学应用程序,适用于高通量测序分析。 BLASTGrabber被实现为Java应用程序,与操作系统无关,并且包含用户友好的图形用户界面。文本或XML格式的BLAST输出文件可以基于BLAST统计信息直接导入,显示和分类。查询名称和FASTA标头可以通过文本挖掘进行分析。除了可视化序列比对,BLAST数据还可以作为交互式分类树进行排序。所有分析模式都支持选择,导出和存储数据。基于Java接口的插件结构促进了自定义第三方功能的添加。结论BLASTGrabber应用程序通过集成分类识别,文本挖掘功能和BLAST匹配的通用多维呈现,引入了可视化和分析大量BLAST输出数据的新方法。该计划针对计算机技能方面的非专业读者。新功能的结合使该程序对于广泛的操作非常灵活且有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号