【24h】

Similar Samples Cleaning in Speculative Multithreading

机译:推测性多线程中的类似样本清洗

获取原文

摘要

Speculative multithreading (SpMT) is a thread-level automatic paral-lelization technique to accelerate sequential programs on multi-core. Too large and too dense samples can not be able to effectively promote the effectiveness of thread partition, parallel thread evaluation, etc. Selection of appropriate samples is of vital importance. The appropriateness reflects in two points. First, redundant samples never exist. Second, similarity between any two samples is not high. We express a sample with one feature vector of fixed length. We extract sample feature vectors using profiler in Prophet during compile time when running programs. Such profiles are created by feature extraction routines which map each program onto a tuple (N_1 N_2, N-3, N_4, N_5, N_6) where N_i is a count of an occurrence of a particular feature. A comparison routine is then invoked which detects similarities amongst tuples. According to the program features, similarity values between samples are calculated to assess the similar degree. In this paper, we introduce a novel way of assessing the similarity of two program samples using Theory of Fuzzy. We firstly calculate the Euclidean Distance of two different program samples as the input, and then assess the overall similarity degrees as well as respective similarity degrees, using corresponding Fuzzy Functions. Based on them, we clean the similar samples. With multidimensional samples generated virtually, we get that average density of samples decreases, so that a more effective collection of samples are created.
机译:推测多线程(SpMT)是一种线程级自动并行化技术,用于加速多核上的顺序程序。太大和太稠密的样本将无法有效地提高线程分配,并行线程评估等的有效性。选择适当的样本至关重要。适当性体现在两点。首先,冗余样本永远不会存在。其次,任何两个样本之间的相似度都不高。我们用一个固定长度的特征向量表示一个样本。我们在运行程序时的编译期间使用Prophet中的事件探查器提取样本特征向量。通过特征提取例程来创建这样的配置文件,该特征提取例程将每个程序映射到元组(N_1 N_2,N-3,N_4,N_5,N_6),其中N_i是特定特征出现的计数。然后调用一个比较例程,该例程检测元组之间的相似性。根据程序特征,计算样本之间的相似度值以评估相似度。在本文中,我们介绍了一种使用模糊理论评估两个程序样本相似性的新颖方法。我们首先计算两个不同程序样本的欧式距离作为输入,然后使用相应的模糊函数评估整体相似度以及各个相似度。基于它们,我们清洗相似的样品。通过虚拟生成多维样本,我们得到样本的平均密度降低,从而创建了更有效的样本集合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号