首页> 外文期刊>Journal of supercomputing >Towards a framework for large-scale multimedia data storage and processing on Hadoop platform
【24h】

Towards a framework for large-scale multimedia data storage and processing on Hadoop platform

机译:转向在Hadoop平台上进行大规模多媒体数据存储和处理的框架

获取原文
获取原文并翻译 | 示例
           

摘要

Cloud computing techniques take the form of distributed computing by utilizing multiple computers to execute computing simultaneously on the service side. To process the increasing quantity of multimedia data, numerous large-scale multimedia data storage computing techniques in the cloud computing have been developed. Of all the techniques, Hadoop plays a key role in the cloud computing. Hadoop, a computing cluster formed by low-priced hardware, can conduct the parallel computing of petabytes of multimedia data. Hadoop features high-reliability, high-efficiency, and high-scalability. The numerous large-scale multimedia data computing techniques include not only the key core techniques, Hadoop and MapReduce, but also the data collection techniques, such as File Transfer Protocol and Flume. In addition, distributed system configuration allocation, automatic installation, and monitoring platform building and management techniques are all included. As a result, only with the integration of all the techniques, a reliable large-scale multimedia data platform can be offered. In this paper, we introduce how cloud computing can make a breakthrough by proposing a multimedia social network dataset on Hadoop platform and implementing a prototype version. Detailed specifications and design issues are discussed as well. An important finding of this article is that we can save more time if we conduct the multimedia social networking analysis using Cloud Hadoop Platform rather than using a single computer. The advantages of cloud computing over the traditional data processing practices are fully demonstrated in this article. The applicable framework designs and the tools available for the large-scale data processing are also proposed. We show the experimental multimedia data including data sizes and processing time.
机译:云计算技术通过利用多台计算机在服务端同时执行计算来采取分布式计算的形式。为了处理越来越多的多媒体数据,已经开发了云计算中的许多大规模多媒体数据存储计算技术。在所有技术中,Hadoop在云计算中扮演着关键角色。 Hadoop是由低价硬件组成的计算集群,可以并行处理PB级的多媒体数据。 Hadoop具有高可靠性,高效率和高可扩展性。众多大型多媒体数据计算技术不仅包括关键核心技术Hadoop和MapReduce,还包括数据收集技术,例如文件传输协议和Flume。此外,还包括分布式系统配置分配,自动安装以及监视平台构建和管理技术。结果,只有集成所有技术,才能提供可靠的大规模多媒体数据平台。在本文中,我们介绍了如何通过在Hadoop平台上提出多媒体社交网络数据集并实现原型版本来实现云计算的突破。还讨论了详细的规格和设计问题。本文的一个重要发现是,如果我们使用Cloud Hadoop Platform而不是使用单台计算机进行多媒体社交网络分析,则可以节省更多时间。本文充分展示了云计算相对于传统数据处理实践的优势。还提出了适用于大规模数据处理的适用框架设计和工具。我们展示了实验性多媒体数据,包括数据大小和处理时间。

著录项

  • 来源
    《Journal of supercomputing》 |2014年第1期|488-507|共20页
  • 作者单位

    Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, ROC;

    Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, ROC;

    Department of Computer Science and Information Engineering, National llan University, llan, Taiwan, ROC;

    Department of Computer Science and Software Engineering, Monmouth University, W. Long Branch, Monmouth, NJ 07764, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Cloud computing; Hadoop; MapReduce; BigTable; High performance computing;

    机译:云计算;Hadoop;MapReduce;大表;高性能计算;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号