首页> 外文会议>International Convention on Information and Communication Technology, Electronics and Microelectronics >Cloudflow - A framework for MapReduce pipeline development in Biomedical Research
【24h】

Cloudflow - A framework for MapReduce pipeline development in Biomedical Research

机译:CloudFlow - 生物医学研究中Mapreduce管道开发的框架

获取原文

摘要

The data-driven parallelization framework Hadoop MapReduce allows analysing large data sets in a scalable way. Since the development of MapReduce programs can be a time-intensive and challenging task, the application and usage of Hadoop in Biomedical Research is still limited. Here we present Cloudflow, a high-level framework to hide the implementation details of Hadoop and to provide a set of building blocks to create biomedical pipelines in a more intuitive way. We demonstrate the benefit of Cloudflow on three different genetic use cases. It will be shown how the framework can be combined with the Hadoop workflow system Cloudgene and the cloud orchestration platform CloudMan to provide Hadoop pipelines as a service to everyone.
机译:数据驱动的并行化框架Hadoop MapReduce允许以可扩展的方式分析大数据集。由于MapReduce计划的发展可能是一个时间密集型和具有挑战性的任务,因此Hadoop在生物医学研究中的应用和用法仍然有限。在这里,我们呈现CloudFlow,一个高级别的框架来隐藏Hadoop的实现细节,并提供一组构建块以更直观地创建生物医学管道。我们展示了Cloudflow对三种不同遗传用例的益处。将显示框架如何与Hadoop工作流系统CloudGENE和云编程平台Cloudman结合,以将Hadoop管道提供给每个人的服务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号