Measurement-based evaluation of data-parallelism for OpenCV feature-detection algorithms

机译：基于测量的OpenCV特征检测算法的数据并行性评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate the effects on the execution time, shared cache usage and speed-up gains when using data-partitioned parallelism for the feature detection algorithms available in the OpenCV library. We use a data set of three different images which are scaled to six different sizes to exercise the different cache memories of our test architectures. Our measurements reveal that the algorithms using the default settings of OpenCV behave very differently when using data-partitioned parallelism. Our investigation shows that the executions of the algorithms SURF, Dense and MSER correlate to L3-cache usage and they are therefore not suitable for data-partitioned parallelism on multi-core CPUs. Other algorithms: BRISK, FAST, ORB, HARRIS, GFTT, SimpleBlob and SIFT, do not correlate to L3-cache in the same extent, and they are therefore more suitable for data-partitioned parallelism. Furthermore, the SIFT algorithm provides the most stable speed-up, resulting in an execution between 3 and 3.5 times faster than the original execution time for all image sizes. We also have evaluated the hardware resource usage by measuring the algorithm execution time simultaneously with the L3-cache usage. We have used our measurements to conclude which algorithms are suitable for parallelization on hardware with shared resources.

机译：在OpenCV库中使用的特征检测算法使用数据划分的并行性时，我们调查对执行时间，共享缓存使用和加速增益的影响。我们使用三种不同图像的数据集，该图像被缩放为六种不同的大小来锻炼我们的测试架构的不同高速缓存存储器。我们的测量表明，使用数据分区并行性时，使用OpenCV的默认设置的算法非常不同。我们的研究表明，算法冲浪的执行与L3-Cache使用情况相关，因此它们不适用于多核CPU上的数据分区并行性。其他算法：快步，快速，ORB，HARRIS，GFTT，SIPLELBLOB和SIFT，与L3-Cache相同的程度不相关，因此它们更适合数据分区并行性。此外，SIFT算法提供了最稳定的加速，导致比所有图像大小的原始执行时间快3到3.5倍之间的执行。我们还通过使用L3-Cache使用量同时测量算法执行时间来评估硬件资源使用。我们使用的测量结果得出结论，哪些算法适用于具有共享资源的硬件上的并行化。

著录项

来源
《IEEE Annual Computer Software and Applications Conference》|2018年|522-1037p|共10页
会议地点
作者
Jakob Danielsson; Marcus Jagemar; Moris Behnam; Mikael Sjodin; Tiberiu Seceleanu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.5-53;
关键词

相似文献

外文文献
中文文献
专利

1. Application of the feature-detection rule to the Negative Selection Algorithm [J] . Mario Poggiolini, Andries Engelbrecht Expert Systems with Application . 2013,第8期

机译：特征检测规则在负选择算法中的应用
2. Joint In-network Video Rate Adaptation and Measurement-Based Admission Control: Algorithm Design and Evaluation [J] . Steven Latre, Filip De Turck Journal of network and systems management . 2013,第4期

机译：联合网络内视频速率自适应和基于测量的准入控制：算法设计和评估
3. Joint In-network Video Rate Adaptation and Measurement-Based Admission Control: Algorithm Design and Evaluation [J] . Steven Latré, Filip De Turck Journal of Network and Systems Management . 2013,第4期

机译：联合网络内视频速率自适应和基于测量的准入控制：算法设计和评估
4. Measurement-Based Evaluation of Data-Parallelism for OpenCV Feature-Detection Algorithms [C] . Jakob Danielsson, Marcus Jägemar, Moris Behnam, IEEE Annual Computer Software and Applications Conference . 2018

机译：OpenCV特征检测算法基于测量的数据并行性评估
5. Exploiting Data-Parallelism in GPUs. [D] . Zhang, Yongpeng. 2012

机译：在GPU中利用数据并行性。
6. Setting Measurement-Based Care in Motion: Practical Lessons in the Implementation and Integration of Measurement-Based Care in Psychiatry Clinical Practice [O] . Kristin Martin-Cook, Lucy Palmer, Larry Thornton, 2021

机译：设定基于测量的运动：在精神病学临床实践中实施和整合的实用课程
7. Joint in-network video rate adaptation and measurement-based admission control: algorithm design and evaluation [O] . Latré Steven, De Turck Filip 2013

机译：联合网内视频速率自适应和基于测量的准入控制：算法设计和评估
8. Transient Analysis of Traffic Generated by Bursty Sources, and Its Application to Measurement-Based Admission Control. Probability, Networks and Algorithms [R] . Mandjes, M. R. H., van Uitert, M. J. G. 2000

机译：突发源生成的交通瞬态分析及其在基于测量的准入控制中的应用。概率，网络和算法

Measurement-based evaluation of data-parallelism for OpenCV feature-detection algorithms

摘要

著录项

相似文献

相关主题

期刊订阅