首页> 外文会议>International Conference on Computational Science >Analysis of the Construction of Similarity Matrices on Multi-core and Many-Core Platforms Using Different Similarity Metrics
【24h】

Analysis of the Construction of Similarity Matrices on Multi-core and Many-Core Platforms Using Different Similarity Metrics

机译:使用不同相似度量的多核和多核平台相似矩阵构建分析

获取原文
获取外文期刊封面目录资料

摘要

Similarity matrices are 2D representations of the degree of similarity between points of a given dataset which are employed in different fields such as data mining, genetics or machine learning. However, their calculation presents quadratic complexity and, thus, it is specially expensive for large datasets. MPICorMat is able to accelerate the construction of these matrices through the use of a hybrid paralleliza-tion strategy based on MPI and OpenMP. The previous version of this tool achieved high performance and scalability, but it only implemented one single similarity metric, the Pearson's correlation. Therefore, it was suitable only for those problems where data are normally distributed and there is a linear relationship between variables. In this work, we present an extension to MPICorMat that incorporates eight additional metrics for similarity so that the users can choose the one that best adapts to their problem. The performance and energy consumption of each metric is measured in two platforms: a multi-core platform with two Intel Xeon Sandy-Bridge processors and a many-core Intel Xeon Phi KNL. Results show that MPICorMat executes faster and consumes less energy on the many-core architecture. The new version of MPICorMat is publicly available to download from its website: https://sourceforge. net/ projects / m picormat /
机译:相似性矩阵是相似的,其在不同的领域,如数据挖掘,遗传学或机器学习中使用的给定的数据集的点之间的程度的2D表示。然而,他们的计算提出了二次复杂性,因此,它是大型数据集特别昂贵。 MPICorMat能够通过使用基于MPI和OpenMP的混合paralleliza,重刑战略,加快这些矩阵的建设。这个工具的以前版本实现了高性能和可扩展性,但它只能实现一个单一的相似性度量,Pearson相关。因此,这是仅适用于其中数据是正态分布的并且存在变量之间的线性关系的那些问题。在这项工作中,我们提出了一个扩展MPICorMat并入相似度八个附加指标,从而使用户可以选择一个最能适应他们的问题。多核平台,采用两枚英特尔至强桑迪Bridge处理器和多核心英特尔至强融核KNL:每项指标的性能和能耗在两个平台上进行测量。结果表明,MPICorMat执行速度更快,能耗更少的多核心架构。 MPICorMat的新版本是公开的,从它的网站下载:https://开头sourceforge上。净/项目/米picormat /

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号