首页> 外文会议>2017 19th International Symposium on Computer Architecture and Digital Systems >High performance implementation of 2-D convolution using AVX2

【24h】

High performance implementation of 2-D convolution using AVX2

机译：使用AVX2的二维卷积的高性能实现

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolution is the most important and fundamental concept in multimedia processing. The 2-D convolution is used for different filtering operations such as sharpening, smoothing, and edge detection. It performs many mathematical operations on all image pixels. Therefore, it is almost a compute-intensive kernel. In this paper, we use Intrinsic Programming Model (IPM) and AVX2 technology to vectorize this kernel, explicitly. We compare our implementations to Compilers Automatic Vectorization (CAVs), OpenCV library and OpenMP API using ICC, GCC and LLVM compilers, on a single-core. For multi-threading, OpenMP has been used to perform IPM and CAVs implementations on multi-cores. Our experimental results show that the performance of our implementations is much higher than other approaches. In addition, OpenMP improves the performance of our explicit vectorizations significantly using ICC and GCC compilers.

机译：卷积是多媒体处理中最重要和最基本的概念。二维卷积用于不同的滤波操作，例如锐化，平滑和边缘检测。它对所有图像像素执行许多数学运算。因此，它几乎是一个计算密集型内核。在本文中，我们使用内在编程模型（IPM）和AVX2技术来显式矢量化此内核。我们在单核上将我们的实现与使用ICC，GCC和LLVM编译器的编译器自动矢量化（CAV），OpenCV库和OpenMP API进行了比较。对于多线程，OpenMP已用于在多核上执行IPM和CAV实施。我们的实验结果表明，我们的实现的性能远高于其他方法。此外，OpenMP使用ICC和GCC编译器可以显着提高显式矢量化的性能。

著录项

来源
《2017 19th International Symposium on Computer Architecture and Digital Systems 》|2017年|1-4|共4页
会议地点 Kish Island(IR)
作者
Hossein Amiri; Asadollah Shahbahrami;
展开▼
作者单位

Department of Computer Engineering, Faculty of Engineering, University of Guilan, Rasht, Iran;

Department of Computer Engineering, Faculty of Engineering, University of Guilan, Rasht, Iran;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Convolution; Kernel; Programming; Two dimensional displays; Registers; Multimedia communication; Distance measurement;

机译：卷积;内核;编程;二维显示;寄存器;多媒体通信;距离测量;;

相似文献

外文文献
中文文献
专利

1. A High Performance Architecturefor Implementation Of 2-d Convolution With quadrant Symmetric Kernels [J] . M.Z. Zhang, H.T. Ngo, A.R. Livingston, International Journal of Computers & Applications . 2008 ,第4期

机译：用象限对称核实现二维卷积的高性能架构
2. Using AVX2 Instruction Set to Increase Performance of High Performance Computing Code [J] . Gepner Pawel Computing and informatics . 2017 ,第5期

机译：使用AVX2指令集提高高性能计算代码的性能
3. USING AVX2 INSTRUCTION SET TO INCREASE PERFORMANCE OF HIGH PERFORMANCE COMPUTING CODE [J] . Gepner Pawel Computing and informatics . 2017 ,第5期

机译：使用AVX2指令集来提高高性能计算代码的性能
4. High performance implementation of 2-D convolution using AVX2 [C] . Hossein Amiri, Asadollah Shahbahrami CSI International Symposium on Computer Architecture and Digital Systems . 2017

机译：使用AVX2的高性能实现2-D卷积
5. Implementation and performance analysis of 2-D order 16 integer transforms in H.264/AVC and AVS-video for high defenition video coding. [D] . Peringassery Krishnan, Madhu. 2010

机译：H.264 / AVC和AVS视频中二维二维16整数转换的实现和性能分析，用于高清晰度视频编码。
6. DEEPScreen: high performance drug–target interaction prediction with convolutional neural networks using 2-D structural compound representations [O] . Ahmet Sureyya Rifaioglu, Esra Nalbat, Volkan Atalay, 2020

机译：使用2-D结构复合表示深屏幕：高性能药物 - 目标交互预测与卷积神经网络
7. An area-efficient 2-D convolution implementation on FPGA for space applications [O] . Di Carlo, Stefano, Gambardella, Giulio, Indaco, Marco, 2011

机译：在FPGA上用于空间应用的面积有效的二维卷积实现
8. A Systolic 2-D Convolution Chip [R] . Kung, H. T., Song, S. W. 1981

机译：一种收缩的二维卷积芯片

High performance implementation of 2-D convolution using AVX2

摘要

著录项

相似文献

相关主题

期刊订阅