首页> 外文会议>IEEE International Parallel Distributed Processing Symposium >Scaling Irregular Applications through Data Aggregation and Software Multithreading

【24h】

Scaling Irregular Applications through Data Aggregation and Software Multithreading

机译：通过数据聚合和软件多线程扩展不规则应用程序

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Emerging applications in areas such as bioinformatics, data analytics, semantic databases and knowledge discovery employ datasets from tens to hundreds of terabytes. Currently, only distributed memory clusters have enough aggregate space to enable in-memory processing of datasets of this size. However, in addition to large sizes, the data structures used by these new application classes are usually characterized by unpredictable and fine-grained accesses: i.e., they present an irregular behavior. Traditional commodity clusters, instead, exploit cache-based processor and high-bandwidth networks optimized for locality, regular computation and bulk communication. For these reasons, irregular applications are inefficient on these systems, and require custom, hand-coded optimizations to provide scaling in both performance and size. Lightweight software multithreading, which enables tolerating data access latencies by overlapping network communication with computation, and aggregation, which allows reducing overheads and increasing bandwidth utilization by coalescing fine-grained network messages, are key techniques that can speed up the performance of large scale irregular applications on commodity clusters. In this paper we describe GMT (Global Memory and Threading), a runtime system library that couples software multithreading and message aggregation together with a Partitioned Global Address Space (PGAS) data model to enable higher performance and scaling of irregular applications on multi-node systems. We present the architecture of the runtime, explaining how it is designed around these two critical techniques. We show that irregular applications written using our runtime can outperform, even by orders of magnitude, the corresponding applications written using other programming models that do not exploit these techniques.

机译：生物信息学，数据分析，语义数据库和知识发现等领域中的新兴应用程序使用了数十到数百TB的数据集。当前，只有分布式内存集群具有足够的聚合空间才能在内存中处理此大小的数据集。但是，除了较大的大小外，这些新应用程序类使用的数据结构通常还具有不可预测的细粒度访问的特征：即，它们表现出不规则的行为。取而代之的是，传统的商品集群利用基于缓存的处理器和针对本地性，常规计算和批量通信进行了优化的高带宽网络。由于这些原因，不规则的应用程序在这些系统上效率低下，并且需要自定义的手工编码优化以提供性能和大小上的扩展。轻量级软件多线程技术是可加快大型不规则应用程序性能的关键技术，该技术可通过使网络通信与计算重叠来实现数据访问等待时间，而聚合则可通过合并细粒度的网络消息来减少开销并提高带宽利用率。在商品集群上。在本文中，我们描述了GMT（全局内存和线程），这是一个运行时系统库，该库将软件多线程和消息聚合与分区全局地址空间（PGAS）数据模型结合在一起，以实现更高的性能并扩展多节点系统上的不规则应用程序。我们介绍了运行时的体系结构，并解释了如何围绕这两种关键技术进行设计。我们证明，使用运行时编写的不规则应用程序可以胜过使用其他未利用这些技术的编程模型编写的相应应用程序，甚至能提高几个数量级。

著录项

来源
《IEEE International Parallel Distributed Processing Symposium 》|2014年|1126-1135|共10页
会议地点
作者
Morari Alessandro; Tumeo Antonino; Chavarria-Miranda Daniel; Villa Oreste; Valero Mateo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Multithreading; PGAS; aggregation; semantic graph databases;

机译：多线程; PGAS;聚合;语义图数据库;

相似文献

外文文献
中文文献
专利

1. Asynchronous and multithreaded communications on irregular applications using vectorized divide and conquer approach [J] . Loïc Thébault, Eric Petit Journal of Parallel and Distributed Computing . 2018 ,第APRa期

机译：使用矢量化分而治之方法在不规则应用程序上进行异步和多线程通信
2. Designing Next-Generation Massively Multithreaded Architectures for Irregular Applications [J] . Tumeo Antonino, Secchi Simone, Villa Oreste Computer . 2012 ,第8期

机译：设计用于不规则应用程序的下一代大规模多线程体系结构
3. Hoard: A Scalable Memory Allocator for Multithreaded Applications [J] . Emery D. Berger, Kathryn S. McKinley, Robert D. Blumofe, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2000 ,第11期

机译：Hoard：用于多线程应用程序的可扩展内存分配器
4. Scaling Irregular Applications through Data Aggregation and Software Multithreading [C] . Morari Alessandro, Tumeo Antonino, Chavarria-Miranda Daniel, IEEE International Parallel Distributed Processing Symposium . 2014

机译：通过数据聚合和软件多线程进行扩展不规则应用程序
5. Accelerating Irregular Applications Using Latency Masking Multithreaded Techniques [D] . Budhkar, Prerna. 2018

机译：使用延迟屏蔽多线程技术加速不规则应用程序
6. NSeq: a multithreaded Java application for finding positioned nucleosomes from sequencing data [O] . Abhinav Nellore, Konstantin Bobkov, Elizabeth Howe, 2012

机译：NSeq：一个多线程Java应用程序用于从测序数据中查找定位的核小体
7. Scalable Multithreaded Algorithms for Mutable Irregular Data with Application to Anisotropic Mesh Adaptivity [O] . Rokos Georgios 2015

机译：可变不规则数据的可扩展多线程算法及其在各向异性网格适应性中的应用

Scaling Irregular Applications through Data Aggregation and Software Multithreading

摘要

著录项

相似文献

相关主题

期刊订阅