首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Online Scheduling and Interference Alleviation for Low-Latency, High-Throughput Processing of Data Streams
【24h】

Online Scheduling and Interference Alleviation for Low-Latency, High-Throughput Processing of Data Streams

机译:数据流低延迟,高吞吐量处理的在线调度和干扰缓解

获取原文
获取原文并翻译 | 示例
           

摘要

Data Streams occur naturally in several observational settings and often need to be processed with a low latency. Streams pose unique challenges: they have no preset lifetimes, the traffic on these streams may be bursty, and data arrival rates on these streams can be quite high. Furthermore, stream processing computations are generally stateful where the outcome of processing a data stream packet depends on the state that builds up within the computation over multiple, successive rounds of execution. As the number of streams increases, stream processing computations need to be orchestrated over a collection of machines. Achieving timeliness and high throughput in such settings is a challenge. Optimal scheduling of stream processing computations is an instance of the resource constrained scheduling problem, and depending on the precise formulation of the problem can be characterized as either NP-Complete or NP-Hard. We have designed an algorithm for online scheduling of stream processing computations. Our algorithm focuses on reducing interference that adversely impacts performance of stream processing computations. Our measure of interference is based on stream packet arrivals at a particular machine, the accompanying resource utilization encompassing CPU, memory and network utilization, and the resource utilization at machines comprising the cluster. Our algorithm performs continuous, incremental detection of interference experienced by computations and performing migrations to alleviate them.
机译:数据流在几种观测环境中自然发生,并且通常需要以低延迟进行处理。流带来了独特的挑战:它们没有预设的生存期,这些流上的流量可能很突发,并且这些流上的数据到达率可能很高。此外,流处理计算通常是有状态的,其中处理数据流包的结果取决于在多个连续的执行回合中在计算内建立的状态。随着流数量的增加,需要在一组机器上协调流处理计算。在这样的环境中实现及时性和高吞吐量是一个挑战。流处理计算的最佳调度是资源受限调度问题的一个实例,根据问题的精确表述,可以将其描述为NP-Complete或NP-Hard。我们设计了一种用于在线调度流处理计算的算法。我们的算法着重于减少对流处理计算性能产生不利影响的干扰。我们对干扰的测量基于流数据包到达特定计算机的情况,伴随的资源利用率包括CPU,内存和网络利用率,以及组成集群的计算机的资源利用率。我们的算法对计算遇到的干扰进行连续的增量检测,并进行迁移以减轻干扰。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号