首页> 外文学位 >Framework for Large Data Processing under Constrained Resources
【24h】

Framework for Large Data Processing under Constrained Resources

机译:资源受限的大数据处理框架

获取原文
获取原文并翻译 | 示例

摘要

Data processing is used to uncover, transform, and classify information inside of data. Data-intensive research topics, such as environmental parameter prediction and sensor data imputation, require abundant computing power. To process big data efficiently, a server cluster is used for most cases. On one hand, a more powerful server cluster should be better. On the other hand, the powerful cluster will require a greater budget. "How to balance this tradeoff" is a challenge. Another challenge is how to improve communication between different nodes in a server cluster. The communication is usually through network and transportation speed is very slow.;In this thesis, we propose a data processing framework that can provide stable service with a limited budget. "Stable" service means the average waiting time and queue length do not change massively. The key of this framework control strategy is to import budget and local server computing power concepts into the M/M/1/1/infinity/infinity queue model. To tackle the data communication challenge, data is compressed before transportation and decompressed when it arrives at its destination. An improved compression algorithm is proposed for this data transportation workflow, which leverages multiple GPUs and, to the best of our knowledge, is much faster than most other algorithms. Three data processing services that rely on the proposed framework are also presented in detail, to illustrate and prove the capabilities of our solution.
机译:数据处理用于发现,转换和分类数据内部的信息。诸如环境参数预测和传感器数据插补等数据密集型研究主题需要强大的计算能力。为了有效处理大数据,大多数情况下使用服务器群集。一方面,功能更强大的服务器群集应该更好。另一方面,强大的集群将需要更大的预算。 “如何平衡这种权衡”是一个挑战。另一个挑战是如何改善服务器群集中不同节点之间的通信。通信通常是通过网络进行的,运输速度很慢。本文提出了一种可以在预算有限的情况下提供稳定服务的数据处理框架。 “稳定”服务意味着平均等待时间和队列长度不会发生很大变化。该框架控制策略的关键是将预算和本地服务器计算能力概念导入M / M / 1/1 /无限/无限队列模型。为了解决数据通信难题,数据在传输前先进行压缩,然后在到达目的地时进行解压缩。针对此数据传输工作流程,提出了一种改进的压缩算法,该算法利用多个GPU,据我们所知,它比大多数其他算法快得多。还详细介绍了三种依赖于提出的框架的数据处理服务,以说明和证明我们解决方案的功能。

著录项

  • 作者

    Wu, Rui.;

  • 作者单位

    University of Nevada, Reno.;

  • 授予单位 University of Nevada, Reno.;
  • 学科 Computer engineering.;Computer science.
  • 学位 Ph.D.
  • 年度 2018
  • 页码 124 p.
  • 总页数 124
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号