Computing the Throughput of Probabilistic and Replicated Streaming Applications

Anne Benoit; Matthieu Gallet; Bruno Gaujal; Yves Robert

首页> 外文期刊>Algorithmica >Computing the Throughput of Probabilistic and Replicated Streaming Applications

【24h】

Computing the Throughput of Probabilistic and Replicated Streaming Applications

机译：计算概率和复制流应用程序的吞吐量

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate how to compute the throughput of probabilistic and replicated streaming applications. We are given (ⅰ) a streaming application whose dependence graph is a linear chain; (ⅱ) a one-to-many mapping of the application onto a fully heterogeneous target platform, where a processor is assigned at most one application stage, but where a stage can be replicated onto a set of processors; and (ⅲ) a set of random variables modeling the computation and communication times in the mapping. We show how to compute the throughput of the application, i.e., the rate at which data sets can be processed. The problem is easy when application stages are not replicated, i.e., each application stage is assigned to a single processor: in that case the throughput is dictated by the critical hardware resource. However, when stages are replicated, i.e., each application stage may be assigned to several processors, the problem becomes surprisingly complicated: even in the deterministic case, the optimal throughput may be lower than the smallest internal resource throughput. The first contribution of the paper is to provide a general method to compute the throughput when computation and communication times, also called stage parameters, are constant or follow I.I.D. exponential laws. The second contribution is to provide bounds for the throughput when stage parameters form associated random sequences (correlation between communication and processing times of a given data set on the different application stages, i.e., a data set that takes a long time on the first stage is likely to be large, and to take a long time on the next stages), and are N.B.U.E. (New Better than Used in Expectation) variables (if an operation has already been processed for some duration, the remaining time is smaller than the processing time of a fresh operation): the throughput is bounded from below by the exponential case and bounded from above by the deterministic case. An extensive set of simulation allows us to assess the quality of the model, and to observe the actual behavior of several distributions.

机译：在本文中，我们研究了如何计算概率和复制流应用程序的吞吐量。我们给（ⅰ）一个流应用程序，它的依赖图是线性链；（ⅱ）应用程序到完全异构目标平台的一对多映射，在该目标平台上最多为一个应用程序阶段分配一个处理器，但可以将一个阶段复制到一组处理器上；（ⅲ）一组随机变量，用于模拟映射中的计算和通信时间。我们展示了如何计算应用程序的吞吐量，即可以处理数据集的速率。当不复制应用程序阶段时，即将每个应用程序阶段分配给单个处理器时，此问题就很容易：在这种情况下，吞吐量由关键硬件资源决定。但是，当复制阶段时，即每个应用程序阶段可能分配给多个处理器，问题就变得异常复杂：即使在确定性的情况下，最佳吞吐量也可能低于最小的内部资源吞吐量。本文的第一点是提供一种通用的方法来计算吞吐量，当计算和通信时间（也称为阶段参数）恒定或遵循I.I.D.指数定律。第二个贡献是当阶段参数形成关联的随机序列时为吞吐量提供界限（不同应用阶段上给定数据集的通信和处理时间之间的相关性，即在第一阶段花费较长时间的数据集是可能很大，并且在下一阶段需要花费很长时间），并且是NBUE （比预期中使用的新的更好）变量（如果某个操作已经处理了一段时间，则剩余时间小于新操作的处理时间）：吞吐量受指数情况的限制，从下至上根据确定性情况。大量的模拟使我们能够评估模型的质量，并观察几种分布的实际行为。

著录项

来源
《Algorithmica 》 |2014年第4期| 925-957| 共33页
作者
Anne Benoit; Matthieu Gallet; Bruno Gaujal; Yves Robert;
展开▼
作者单位

ENS Lyon, LIP Laboratory and INRIA, CNRS, UCBL, Lyon, France;

ENS Lyon, LIP Laboratory and INRIA, CNRS, UCBL, Lyon, France;

INRIA, Grenoble, France;

ENS Lyon, LIP Laboratory and INRIA, CNRS, UCBL, Lyon, France;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Lr-Stream: Using latency and resource aware scheduling to improve latency and throughput for streaming applications [J] . Dawei Sun, Hanyu He, Hongbin Yan, Future generation computer systems . 2021 ,第Jana期

机译：LR-Stream：使用延迟和资源意识调度来提高流媒体应用程序的延迟和吞吐量
2. High-Throughput Computing Versus High-Performance Computing forn Groundwater Applications [J] . Fienen Michael N., Hunt Randall J. Ground water . 2015 ,第2期

机译：地下水应用中的高通量计算与高性能计算
3. Performance-aware deployment of streaming applications in distributed stream computing systems [J] . Sun Dawei, Gao Shang, Liu Xunyun, International Journal of Bio-Inspired Computation . 2020 ,第1期

机译：分布式流计算系统中流式传输应用程序的性能感知部署
4. Computing the Throughput of Probabilistic and Replicated Streaming Applications [C] . Anne Benoit, Fanny Dufosse, Matthieu Gallet, Proceedings of the Twenty-Second annual symposium on parallelism in algorithms and architectures . 2010

机译：计算概率和复制流应用程序的吞吐量
5. Scheduling Edge and In-Transit Computing Resources for Stream Processing Applications [D] . Zamani Zadeh Najari, Ali Reza. 2019

机译：流处理应用程序的调度边缘和传输计算资源
6. Computing a Synthetic Chronic Psychosocial Stress Measurement in Multiple Datasets and its Application in the Replication of GxE Interactions of the EBF1 Gene [O] . Abanish Singh, Michael A. Babyak, Beverly H. Brummett, -1

机译：计算多个数据集中的慢性慢性心理压力综合测量及其在EBF1基因GxE相互作用复制中的应用
7. Exploiting Loop-Carried Stream Reuse for Scientific Computing Applications on the Stream Processor [O] . Weixia XU, Qiang DOU, Ying ZHANG, 2010

机译：在流处理器上利用循环传输流进行科学计算应用的重用

Computing the Throughput of Probabilistic and Replicated Streaming Applications

摘要

著录项

相似文献

相关主题

期刊订阅