...
首页> 外文期刊>Journal of computational and theoretical nanoscience >Big Data Analytics Framework for Real-Time Genome Analysis: A Comprehensive Approach
【24h】

Big Data Analytics Framework for Real-Time Genome Analysis: A Comprehensive Approach

机译:实时基因组分析的大数据分析框架:全面的方法

获取原文
获取原文并翻译 | 示例

摘要

Big Data Technologies are well-accepted in the recent years in bio-medical and genome informatics. They are capable to process gigantic and heterogeneous genome information with good precision and recall. With the quick advancements in computation and storage technologies, the costof acquiring and processing the genomic data has decreased significantly. The upcoming sequencing platforms will produce vast amount of data, which will imperatively require high-performance systems for on-demand analysis with time-bound efficiency. Recent bio-informatics tools are capableof utilizing the novel features of Hadoop in a more flexible way. In particular, big data technologies such as MapReduce and Hive are able to provide high-speed computational environment for the analysis of petabyte scale datasets. This has attracted the focus of bio-scientists to use thebig data applications to automate the entire genome analysis. The proposed framework is designed over MapReduce and Java on extended Hadoop platform to achieve the parallelism of Big Data Analysis. It will assist the bioinformatics community by providing a comprehensive solution for Descriptive,Comparative, Exploratory, Inferential, Predictive and Causal Analysis on Genome data. The proposed framework is user-friendly, fully-customizable, scalable and fit for comprehensive real-time genome analysis from data acquisition till predictive sequence analysis.
机译:近年来在生物医学和基因组信息学中近年来,大数据技术在近年来。他们能够以良好的精度和召回地处理巨大和异质基因组信息。随着计算和存储技术的快速进步,获取和处理基因组数据的成本显着下降。即将到来的序列平台将产生大量数据,这将需要具有带有时间束缚效率的按需分析的高性能系统。最近的生物信息工具有能力利用Hadoop的新功能以更灵活的方式。特别是,MapReduce和Hive等大数据技术能够为Petabyte刻度数据集进行高速计算环境。这引起了生物科学家的焦点来利用博格数据应用来自动化整个基因组分析。所提出的框架是在Greated Hadoop平台上的MapReduce和Java设计,以实现大数据分析的并行性。它将通过为基因组数据提供全面的解决方案来帮助生物信息学社区提供全面的解决方案。拟议的框架是用户友好的,完全可定制,可扩展的,可伸缩的,可用于从数据采集到预测序列分析的综合实时基因组分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号