Detecting anomalies in high-performance parallel programs

机译：在高性能并行程序中检测异常

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Message passing interface (MPI) is an effective programming technique for implementing parallel programs for distributed computation. As these applications run, a number of different types of irregularities can occur including those that result from intrusions, user misbehavior, corrupted data, deadlocks or failure of cluster components. We perform a comparison of different artificial intelligence (AI) techniques that can be used to implement a lightweight monitoring and detection system for parallel applications on a cluster of Linux workstations. We study the accuracy and performance of deterministic and stochastic algorithms when we observe the flow of function library and OS system calls of parallel programs written with MPI. We demonstrate that monitoring of MPI programs can be achieved with high accuracy and in some cases with a 0% false positive rate in real-time, and we show that the added computational load on each node is small. Finally we demonstrate that simple deterministic methods perform poorly when the program flow grows in size and variety, and that more complex methods are required.

机译：消息传递接口（MPI）是一种有效的编程技术，用于实现并行程序以进行分布式计算。随着这些应用程序的运行，可能会出现许多不同类型的异常情况，包括由于入侵，用户行为不当，数据损坏，死锁或群集组件故障而导致的异常情况。我们对不同的人工智能（AI）技术进行了比较，这些技术可用于为Linux工作站集群上的并行应用程序实现轻量级的监视和检测系统。当我们观察用MPI编写的并行程序的功能库和OS系统调用的流程时，我们将研究确定性和随机算法的准确性和性能。我们证明了可以高精度地监视MPI程序，并且在某些情况下可以实时将误报率设为0％，并且证明了每个节点上增加的计算量很小。最后，我们证明了当程序流的大小和种类增加时，简单的确定性方法的性能较差，并且需要更复杂的方法。

著录项

来源
《Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on》|2004年|p.30-34|共5页
会议地点
作者
Florez G.; Liu Z.; Bridges S.; Vaughn R.; Skjellum A.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
message passing; parallel programming; workstation clusters; neural nets; hidden Markov models; deterministic algorithms; Unix; artificial intelligence; application program interfaces; system monitoring; anomaly detection; high-performance parallel programs; message passing interface; distributed computation; artificial intelligence techniques; lightweight monitoring detection system; parallel applications; Linux workstation clusters; deterministic algorithms; stochastic algorithms; function library; OS system calls; MPI program monitoring;

机译：消息传递;并行编程;工作站集群;神经网络;隐马尔可夫模型;确定性算法; Unix;人工智能;应用程序接口;系统监视;异常检测;高性能并行程序;消息传递接口;分布式计算;人工智能技术轻量级监控检测系统并行应用Linux工作站集群确定性算法随机算法功能库OS系统调用MPI程序监控;

相似文献

外文文献
中文文献
专利

1. Detection of High-Level Synchronization Anomalies in Parallel Programs [J] . Ali Jannesari International journal of parallel programming . 2015,第4期

机译：并行程序中高级同步异常的检测
2. Makespan minimization for two parallel machines scheduling with a periodic availability constraint: Mathematical programming model, average-case analysis, and anomalies [J] . Dehua Xu, Dar-Li Yang Applied Mathematical Modelling . 2013,第14a15期

机译：具有周期性可用性约束的两个并行机器调度的Makespan最小化：数学编程模型，平均工况分析和异常
3. Detecting Concurrency Anomalies in Transactional Memory Programs [J] . Jo?￡o Louren?§o, Ricardo Dias Computer Science and Information Systems . 2011,第2期

机译：在事务存储程序中检测并发异常
4. Detecting anomalies in high-performance parallel programs [C] . Florez G., Liu Z., Bridges S., International Conference on Information Technology Coding and Computing . 2004

机译：检测高性能并行程序中的异常
5. Easy pram-based high-performance parallel programming. [D] . Ghanim, Fady Ahmad Abdalrahim. 2016

机译：简单的基于婴儿车的高性能并行编程。
6. EEGgui: a program used to detect electroencephalogram anomalies after traumatic brain injury [O] . Justin Sick, Eric Bray, Amade Bregy, 2013

机译：EEGgui：用于检测脑外伤后脑电图异常的程序
7. Easy PRAM-based High-performance Parallel Programming with ICE [O] . Ghanim, Fady, Barua, Rajeev, Vishkin, Uzi 2016

机译：借助ICE轻松进行基于PRAM的高性能并行编程

Detecting anomalies in high-performance parallel programs

摘要

著录项

相似文献

相关主题

期刊订阅