首页> 外文会议>International Conference on Informatics, Health Technology >BIG-BIO: - big data hadoop-based analytic cluster framework for bioinformatics
【24h】

BIG-BIO: - big data hadoop-based analytic cluster framework for bioinformatics

机译:Big-Bio: - 基于大数据的BioInformatics的分析群框架

获取原文

摘要

Big volume of bioinformatics data needs high processing powers. BIG-BIO is one of the solutions for addressing these challenges. BIG-BIO is a big data analyst MapReduce Hadoop Cluster for Bioinformatics applications. BIG-BIO is tested by implementing the bioinformatics wordcount problem and its applications using MapReduce programming pattern. BIG-BIO counts the number of occurrence of each word in a text and extracts unique words in molecular sequences. The application is characterized by almost low-weight computation on big size data sets. Performance of BIG-BIO is tested. BIG-BIO framework could analyze bioinformatics big data faster and more efficient. It has many bioinformatics applications while maintaining good processing capabilities, scalability and ease of maintenance by using cheap commodity. BIG-BIO reduces the processing time for parallel bioinformatics algorithms compared to legacy serial and MPI based applications. Testing BIG-BIO stated that it scales automatically with size of data. BIG-BIO can be portable on many Hadoop infrastructures without modification, with accelerating data-intensive bioinformatics analysis.
机译:大量的生物信息学数据需要高处理能力。 Big-Bio是解决这些挑战的解决方案之一。 Big-Bio是BioInformatics应用程序的大数据分析师MapReduce Hadoop集群。通过使用MapReduce编程模式实现生物信息化字数问题及其应用来测试Big-Bio。 Big-Bio在文本中计算每个单词的发生次数,并在分子序列中提取独特的单词。该应用的特征在于对大尺寸数据集几乎低重量的计算。测试大生物的表现。 Big-Bio框架可以分析生物信息学的大数据更快,更高效。它具有许多生物信息学应用,同时通过使用廉价的商品保持良好的处理能力,可扩展性和易于维护。与传统串行和MPI的应用相比,Big-BIO减少了并行生物信息学算法的处理时间。测试Big-Bio表示它自动缩放数据大小。 Big-Bio可以在许多Hadoop基础架构上便携,无需修改,加速数据密集型生物信息学分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号