首页> 外文OA文献 >Scalable Multithreaded Algorithms for Mutable Irregular Data with Application to Anisotropic Mesh Adaptivity

【2h】

Scalable Multithreaded Algorithms for Mutable Irregular Data with Application to Anisotropic Mesh Adaptivity

机译：可变不规则数据的可扩展多线程算法及其在各向异性网格适应性中的应用

页面导航

摘要
著录项
相似文献
相关主题

摘要

Anisotropic mesh adaptation is a powerful way to directly minimise the computational cost of mesh based simulation. It is particularly important for multi-scale problems where the required number of floating-point operations can be reduced by orders of magnitude relative to more traditional static mesh approaches. Increasingly, finite element/volume codes are being optimised for modern multicore architectures. Inter-node parallelism for mesh adaptivity has been successfully implemented by a number of groups using domain decomposition methods. However, thread-level parallelism using programming models such as OpenMP is significantly more challenging because the underlying data structures are extensively modified during mesh adaptation and a greater degree of parallelism must be realised while keeping the code race-free.ududIn this thesis we describe a new thread-parallel implementation of four anisotropic mesh adaptation algorithms, namely edge coarsening, element refinement, edge swapping and vertex smoothing. For each of the mesh optimisation phases we describe how safe parallel execution is guaranteed by processing workitems in batches of independent sets and using a deferred-operations strategy to update the mesh data structures in parallel without data contention. Scalable execution is further assisted by creating worklists using atomic operations, which provides a synchronisation-free alternative to reduction-based worklist algorithms. Additionally, we compare graph colouring methods for the creation of independent sets and present an improved version which can run up to 50% faster than existing techniques. Finally, we describe some early work on an interrupt-driven work-sharing for-loop scheduler which is shown to perform better than existing work-stealing schedulers.ududCombining all aforementioned novel techniques, which are generally applicable to other unordered irregular problems, we show that despite the complex nature of mesh adaptation and inherent load imbalances, we achieve a parallel efficiency of 60% on an 8-core Intel(R) Xeon(R) Sandy Bridge and 40% using 16 cores on a dual-socket Intel(R) Xeon(R) Sandy Bridge ccNUMA system.

机译：各向异性网格自适应是直接最小化基于网格的仿真的计算成本的有效方法。对于多尺度问题而言，这一点尤其重要，因为相对于更传统的静态网格方法，所需的浮点运算数量可以减少几个数量级。有限元/体积代码越来越多地被优化用于现代多核架构。多个节点使用域分解方法已成功实现了节点之间的网格自适应并行性。但是，使用诸如OpenMP这样的编程模型的线程级并行性要更具挑战性，因为在网格自适应过程中对底层数据结构进行了广泛修改，并且必须在不使代码竞争的情况下实现更高程度的并行性。 ud ud我们描述了四种各向异性网格自适应算法的新线程并行实现，即边缘粗化，元素细化，边缘交换和顶点平滑。对于每个网格优化阶段，我们都描述了如何通过批量处理独立集合中的工作项并使用延迟操作策略来并行更新网格数据结构而无数据争用的方式来保证并行执行的安全性。通过使用原子操作创建工作列表，进一步有助于可伸缩执行，这为基于缩减的工作列表算法提供了无同步的替代方法。此外，我们比较了图形着色方法以创建独立的集，并提出了一种改进的版本，其运行速度比现有技术快50％。最后，我们描述了有关中断驱动的工作共享for循环调度程序的一些早期工作，该调度程序表现出比现有的工作窃取调度程序更好的性能。 ud ud结合了上述所有新颖的技术，这些技术通常适用于其他无序不规则问题，我们证明，尽管网状网适应和固有的负载不平衡具有复杂的性质，但在8核Intel®Xeon®Sandy Bridge上，并行效率达到60％，而在双插槽上使用16核，并行效率达到40％。英特尔®至强®Sandy Bridge ccNUMA系统。

著录项

作者
Rokos Georgios;
展开▼
作者单位

展开▼
年度 2015
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Method for the explicit insertion of microstructure in Cellular Automata Finite Element (CAFE) models based on an irregular tetrahedral Finite Element mesh: Application in a multi-scale Finite Element Microstructure MEshfree framework (FEMME) [J] . Saucedo-Mora Luis, Marrow Thomas James Finite Elements in Analysis and Design . 2015,第nova1期

机译：在不规则四面体有限元网格的元胞自动机有限元（CAFE）模型中显式插入微结构的方法：在多尺度有限元微结构MEshfree框架（FEMME）中的应用
2. Using the FEM Meshes Adaption and Genetic Algorithms for Identification of Permeability in Normal Direction of Anisotropic Sheets [J] . Komeza K., Napieralska Juszczak E., Di Barba P., Magnetics, IEEE Transactions on . 2012,第2期

机译：利用有限元网格自适应和遗传算法识别各向异性板法向渗透率
3. Performance evaluation of adaptive meshing algorithms for fluorescence diffuse optical tomography using experimental data [J] . Lu Zhou, Birsen Yazici, Angelique B. F. Ale, Optics Letters . 2010,第22期

机译：使用实验数据的荧光散射光学层析成像自适应网格划分算法的性能评估
4. Scaling Irregular Applications through Data Aggregation and Software Multithreading [C] . Morari Alessandro, Tumeo Antonino, Chavarria-Miranda Daniel, IEEE International Parallel Distributed Processing Symposium . 2014

机译：通过数据聚合和软件多线程扩展不规则应用程序
5. Adaptive locomotion algorithms for hexapod walking machines to autonomously negotiate irregular large-scale obstacles [D] . Kau, Chin-Cheng 1989

机译：六足步行机的自适应运动算法，可自动协商不规则的大型障碍物
6. An anisotropic scale-invariant unstructured mesh generator suitable for volumetric imaging data [O] . Andrew P. Kuprat, Daniel R. Einstein -1

机译：适用于体积成像数据的各向异性尺度不变非结构化网格生成器
7. Design and implementation of band rejected antennas using adaptive surface meshing and genetic algorithms methods. Simulation and measurement of microstrip antennas with the ability of harmonic rejection for wireless and mobile applications including the antenna design optimisation using genetic algorithms. [O] . Binmelha Mohammed Saeed 2013

机译：使用自适应表面网格划分和遗传算法方法设计和实现带阻天线。具有无线谐波抑制能力的微带天线的仿真和测量，适用于无线和移动应用，包括使用遗传算法的天线设计优化。

Scalable Multithreaded Algorithms for Mutable Irregular Data with Application to Anisotropic Mesh Adaptivity

摘要

著录项

相似文献

相关主题

期刊订阅