MPLEX: In-Situ Big Data Processing with Compute-Storage Multiplexing

机译：MPLEX：使用计算 - 存储复用的原位大数据处理

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Cloud-based services are increasingly popular for big data analytics due to the flexibility, scalability, and cost-effectiveness of provisioning elastic resources on-demand. However, data analytics-as-a-service suffers from the overheads of data movement between compute and storage clusters, due to their decoupled architecture in existing cloud infrastructure. In this work, we propose a novel approach of in-situ big data processing on cloud storage by dynamically offloading data-intensive jobs from compute cluster to storage cluster, and improve job throughput. However, it is challenging to achieve this goal since introducing additional workload on the storage cluster can significantly impact interactive web requests that fetch cloud storage data, with strict SLA (service-level agreement) for tail latency. In this work, we present MPLEX, a system that augments data analytics-as-a-service by efficiently multiplexing compute and storage cluster to improve job throughput without violating the SLA of cloud storage service in terms of tail response time. It applies an SLA-aware opportunistic job scheduling technique supported by a machine learning based prediction model to exploit the dynamic workload conditions in the compute, and storage cluster. Performance evaluations on an OpenStack Swift cluster, and an OpenStack based virtual cluster of Hadoop VMs built atop NSFCloud's Chameleon testbed show that MPLEX improves the Hadoop job throughput by up to 1.7X, while maintaining the SLA for cloud storage service requests.

机译：由于灵活性，可伸缩性和供应需求的弹性资源的速度，成本效益，基于云的服务越来越受到大数据分析的流行。然而，由于现有云基础设施的解耦架构，数据分析 - AS-AS-AS-AS-Servers遭受了计算和存储集群之间的数据移动的开销。在这项工作中，通过将计算群集的数据密集型作业动态卸载到存储群集，提出了一种新的云存储原位大数据处理方法，并提高作业吞吐量。但是，实现这一目标是挑战，因为在存储群集中引入额外的工作负载可以显着影响获取云存储数据的交互式Web请求，具有严格的SLA（服务级协议）进行尾延迟。在这项工作中，我们目前通过有效地复用计算和存储群集来增强数据分析 - AS-Service的系统，以改善作业吞吐量，而无需违反尾部响应时间的云存储服务的SLA。它适用于基于机器学习的预测模型支持的SLA感知机会作业调度技术，以利用计算和存储群集中的动态工作负载条件。在OpenStack Swift集群上的性能评估，以及NSFCloud的Chameleon的Chameleon测试机顶op VM的基于OpenStack虚拟群集显示，MPLEX将Hadoop作业吞吐量提高到1.7倍，同时维护云存储服务请求的SLA。

著录项

来源
《IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems》|2017年|264p|共10页
会议地点
作者
Joy Rahman; Palden Lama;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Cloud computing; Throughput; Time factors; Computational modeling; Big Data; Multiplexing; Predictive models;

机译：云计算;吞吐量;时间因素;计算建模;大数据;复用;预测模型;

相似文献

外文文献
中文文献
专利

1. SCANRAW: A Database Meta-Operator for Parallel In-Situ Processing and Loading [J] . Cheng Yu, Rusu Florin ACM transactions on database systems . 2015,第3期

机译：SCANRAW：用于并行原位处理和加载的数据库元运算符
2. IN-SITU TEST OF PRESSURE PIPELINE VIBRATION BASED ON DATA ACQUISITION AND SIGNAL PROCESSING [J] . Huimin Hou, Cundong Xu, Hui Liu, International Journal on Smart Sensing and Intelligent Systems . 2015,第1期

机译：基于数据采集和信号处理的压力管道振动原位测试
3. An Extended Kalman filtering-based method of processing reflectometry data for fast in-situ etch rate measurements [J] . Vincent T.L., Khargonekar P.P. IEEE Transactions on Semiconductor Manufacturing . 1997,第1期

机译：基于扩展卡尔曼滤波的处理反射法数据的方法，用于快速原位蚀刻速率测量
4. MPLEX: In-Situ Big Data Processing with Compute-Storage Multiplexing [C] . Joy Rahman, Palden Lama 2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems . 2017

机译：MPLEX：具有计算存储多路复用的现场大数据处理
5. Assessing The Integration and Pre-Processing of Neon Airborne Remote Sensing and in-Situ Data for Optimal Tree Species Classification [D] . Scholl, Victoria. 2019

机译：评估霓虹灯机载遥感和原位数据的集成和预处理，以获得最佳树种分类
6. In-situ synchrotron X-ray diffraction data for the dynamic reaction processes between titanium and air under laser irradiation [O] . Congyuan Zeng, Hao Wen, Hong Yao, 2020

机译：钛和空气在激光辐照下动态反应过程的原位同步加速器X射线衍射数据
7. In-Situ Process Monitoring in Additive Manufacturing Using Statistics and Pre-Process Data [O] . Eva Maria Scheideler, Andrea Huxol 2020

机译：使用统计数据和预处理数据的添加剂制造的原位过程监测
8. Airborne Flight Test Data Multiplexing and Real-Time Processing System~Technicalpub [R] . Brandt, P. 1993

机译：机载飞行试验数据复用和实时处理系统~Technicalpub

MPLEX: In-Situ Big Data Processing with Compute-Storage Multiplexing

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅