Facilitating High Performance Code Parallelization.

机译：促进高性能代码并行化。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the surge of social media on one hand and the ease of obtaining information due to cheap sensing devices and open source APIs on the other hand, the amount of data that can be processed is as well vastly increasing. In addition, the world of computing has recently been witnessing a growing shift towards massively parallel distributed systems due to the increasing importance of transforming data into knowledge in today's data-driven world. At the core of data analysis for all sorts of applications lies pattern matching. Therefore, parallelizing pattern matching algorithms should be made efficient in order to cater to this ever-increasing abundance of data. We propose a method that automatically detects a user's single threaded function call to search for a pattern using Java's standard regular expression library, and replaces it with our own data parallel implementation using Java bytecode injection. Our approach facilitates parallel processing on different platforms consisting of shared memory systems (using multithreading and NVIDIA GPUs) and distributed systems (using MPI and Hadoop). The major contributions of our implementation consist of reducing the execution time while at the same time being transparent to the user. In addition to that, and in the same spirit of facilitating high performance code parallelization, we present a tool that automatically generates Spark Java code from minimal user-supplied inputs. Spark has emerged as the tool of choice for efficient big data analysis. However, users still have to learn the complicated Spark API in order to write even a simple application. Our tool is easy to use, interactive and offers Spark's native Java API performance. To the best of our knowledge and until the time of this writing, such a tool has not been yet implemented.

机译：一方面由于社交媒体的激增，另一方面由于廉价的传感设备和开源API使得获取信息变得容易，可处理的数据量也大大增加。此外，由于在当今数据驱动的世界中，将数据转换为知识的重要性日益提高，因此计算机世界近来正在向大规模并行分布式系统发展。模式匹配是各种应用程序数据分析的核心。因此，应该使并行化模式匹配算法高效，以适应这种不断增加的数据量。我们提出了一种方法，该方法可以使用Java的标准正则表达式库自动检测用户的单线程函数调用以搜索模式，然后使用Java字节码注入将其替换为我们自己的数据并行实现。我们的方法有助于在由共享内存系统（使用多线程和NVIDIA GPU）和分布式系统（使用MPI和Hadoop）组成的不同平台上进行并行处理。我们实施的主要贡献在于减少了执行时间，同时对用户透明。除此之外，本着促进高性能代码并行化的精神，我们介绍了一种工具，该工具可从最少的用户提供的输入中自动生成Spark Java代码。 Spark已成为高效大数据分析的首选工具。但是，用户仍然必须学习复杂的Spark API才能编写甚至是一个简单的应用程序。我们的工具易于使用，具有交互性，并提供Spark的本机Java API性能。据我们所知，直到撰写本文时，这种工具尚未实现。

著录项

作者
Abi Saad, Maria.;
展开▼
作者单位

Syracuse University.;

展开▼
授予单位 Syracuse University.;
学科 Computer engineering.
学位 Ph.D.
年度 2017
页码 151 p.
总页数 151
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Closed-Loop Decoder Adaptation on Intermediate Time-Scales Facilitates Rapid BMI Performance Improvements Independent of Decoder Initialization Conditions [J] . Orsborn A.L., Dangi S., Moorman H.G., Neural Systems and Rehabilitation Engineering, IEEE Transactions on . 2012,第4期

机译：中间时间尺度上的闭环解码器自适应，有助于快速提高BMI性能，而与解码器初始化条件无关
2. Performance Evaluation of Population Seeding Techniques of Permutation-Coded GA Traveling Salesman Problems Based Assessment: Performance Evaluation of Population Seeding Techniques of Permutation-Coded GA [J] . Victer Paul, Ganeshkumar C, Jayakumar L International journal of geotechnical earthquake engineering . 2019,第2期

机译：基于置换编码GA旅行商问题的种群播种技术性能评估：基于置换编码GA的种群播种技术性能评估
3. Performance Evaluation of Population Seeding Techniques of Permutation-Coded GA Traveling Salesman Problems Based Assessment: Performance Evaluation of Population Seeding Techniques of Permutation-Coded GA [J] . Victer Paul, Ganeshkumar C, Jayakumar L International journal of geotechnical earthquake engineering . 2019,第2期

机译：基于置换编码的GA旅行推销业务问题的人口种子技术性能评估：置换编码GA人口种子技术的性能评价
4. Poster Abstract: A Scalable Coded Computing Framework for Edge-Facilitated Wireless Distributed Computing [C] . Songze Li, Qian Yu, Mohammad Ali Maddah-Ali, The First IEEE/ACM Symposium on Edge Computing . 2016

机译：海报摘要：用于边缘促进型无线分布式计算的可扩展编码计算框架
5. Codes convolutionnels doublement orthogonaux recursifs: Analyse et recherche des nouveaux codes, evaluation des performances [D] . Rouleau, Laurent. 2008

机译：双正交递归卷积码：新码的分析与研究，性能评估
6. Performance of four computer-coded verbal autopsy methods for cause of death assignment compared with physician coding on 24000 deaths in low- and middle-income countries [O] . Nikita Desai, Lukasz Aleksandrowicz, Pierre Miasnikof, 2014

机译：在中低收入国家两种计算机编码的口头尸检方法对死亡原因的诊断性能与医师对24000例死亡的编码相比较
7. Motor and visual codes interact to facilitate visuospatial memory performance [O] . Marvin Chum, Harold Bekkering, Michael D. Dodd, 2007

机译：电机和视觉代码相互作用以促进面板空间内存性能

Facilitating High Performance Code Parallelization.

摘要

著录项

相似文献

相关主题

期刊订阅