首页> 外文会议>International Conference on Computing and Communications Technologies >Job starvation avoidance with alleviation of data skewness in Big Data infrastructure
【24h】

Job starvation avoidance with alleviation of data skewness in Big Data infrastructure

机译:作业饥饿避免了大数据基础设施中的数据偏差

获取原文

摘要

During the age of rush in the need for big data, Hadoop is a postulate or cloud-based platform that has been heavily encouraged for all solutions in the business world's big data problems. Parallel execution of jobs consists of large data sets is done through map reduce in the hadoop cluster. The completion of job time will depend on the slowest running task in the job. The entire job is extended if one particular job takes longer time to finish and it is done by the delayer. An inequality in the measure of data allocated to each individual task is referred to as Data skewness. An efficient dynamic data splitting approach on Hadoop called the Hybrid scheduler who monitors the samples while running batch jobs and allocates resources to slaves depending on the complexity of data and the time taken for processing. In this paper, the effectiveness of web swarming is showcased using hadoop eliminating Distributed Denial of Service (DDoS) attack detection scenarios in the Web servers. Query processing is done through Map Reduce in traditional Hadoop clusters and is replaced by the proposed Block chain query processing algorithm. Thereby improvise the execution time of the assigned task in the proposed system to mitigate the data skewness. The main aim of this paper is to avoid job starvation thus minimizing the response time efficiently during the process and mitigating data skewness in existing system.
机译:在需要大数据时急于,Hadoop是一个基于假设或基于云的平台,对商业世界的大数据问题的所有解决方案进行了严重鼓励。作业的并行执行由Hadoop集群中的MAP减少完成大数据集。完成作业的完成将取决于作业中最慢的运行任务。如果一个特定作业需要更长时间才能完成,则会扩展整个作业,并且由延迟器完成。分配给每个任务的数据量度的不等式被称为数据偏差。 Hadoop上的高效动态数据拆分方法称为混合调度程序,在运行批处理作业时监视样本并根据数据的复杂性和处理所需的时间,将资源分配给从设备。在本文中,使用Hadoop消除Web服务器中的分布式拒绝服务(DDOS)攻击检测方案的分布式拒绝检测方案来展示Web蜂拥而至的有效性。查询处理通过地图减少在传统的Hadoop集群中完成,并由所提出的块链查询处理算法取代。因此,在所提出的系统中即可提高所分配的任务的执行时间来减轻数据偏差。本文的主要目的是避免作业饥饿,从而在过程和减轻现有系统中的数据偏差期间有效地最小化响应时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号