A parallel architecture for meaning comparison

机译：用于含义比较的并行架构

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a fine grained parallel architecture that performs meaning comparison using vector cosine similarity (dot product). Meaning comparison assigns a similarity value to two objects (e.g. text documents) based on how similar their meanings (represented as two vectors) are to each other. The novelty of our design is the fine grained parallelism which is not exploited in available hardware based dot product processor designs and can not be achieved in traditional server class processors like the Intel Xeon. We compare the performance of our design against that of available hardware based dot product processors as well a server class processor using optimum software code performing the same computation. We show that our hardware design can achieve a speedup of 62,000 times compared to an available hardware design and a speedup of 8866 times with 33% (1.5 times) less power consumption, compared to software code running on Intel Xeon processor for 1024 basis vectors. Our design can significantly reduce the amount of servers required for similarity comparison in a distributed search engine. Thus it can enable reduction in energy consumption, investment, operational costs and floor area in search engine data centers. This design can also be deployed for other applications which require fast dot product computation.

机译：在本文中，我们提出了一种细粒度的并行架构，该架构使用向量余弦相似度（点积）执行含义比较。含义比较基于两个对象（例如，文本文档）的含义（表示为两个向量）之间的相似程度，为它们分配相似度值。我们设计的新颖之处在于细粒度的并行性，而这种并行性并未在基于硬件的点积处理器设计中得到利用，并且在传统的服务器级处理器（如Intel Xeon）中也无法实现。我们将设计的性能与使用基于硬件的点积处理器以及使用执行相同计算的最佳软件代码的服务器类处理器的性能进行比较。我们证明，与在1024 X基点矢量的Intel Xeon处理器上运行的软件代码相比，与可用的硬件设计相比，我们的硬件设计可实现62,000倍的加速，在8866倍的加速下，功耗降低33％（1.5倍）。我们的设计可以大大减少在分布式搜索引擎中进行相似度比较所需的服务器数量。因此，它可以减少搜索引擎数据中心的能耗，投资，运营成本和占地面积。该设计还可以用于需要快速点积计算的其他应用程序。

著录项

来源
《2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS)》|2010年|P.1-10|共10页
会议地点 Atlanta GA(US);Atlanta GA(US)
作者
Mohan Suneil; Biswas Amitava; Tripathy Aalap; Pannigrahy Jagannath; Mahapatra Rabi;
展开▼
作者单位

Department of Computer Science and Engineering, Texas AM University, College Station, Texas, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.133;
关键词
Meaning comparison; dot product computation; green computing; hardware accelerator; information retrieval;

机译：含义比较;点积计算;绿色计算;硬件加速器;信息检索;

相似文献

外文文献
中文文献
专利

1. Creating Shapes in Civil and Naval Architecture: a Cross-Disciplinary Comparison - edited by Horst Nowacki and Wolfgang Lefèvre and Building on the Sea: Form and Meaning in Modern Ship Architecture - by Peter Quartermaine [J] . LARRIE D. FERREIRO International Journal of Nautical Archaeology . 2009,第1期

机译：在民用和海军建筑中创造形状：跨学科的比较-霍斯特·诺瓦奇（Horst Nowacki）和沃尔夫冈·莱夫弗（WolfgangLefèvre）编辑，《海上建筑：现代船舶建筑的形式和意义》（Peter Quartermaine）
2. Comparison of common parallel architectures for the executionrnof the island model and the global parallelization ofrnevolutionary algorithms [J] . Steffen Limmer, Dietmar Fey Concurrency and computation: practice and experience . 2017,第9期

机译：孤岛模型和进化算法的全局并行化执行的通用并行架构比较
3. Meaning-driven syntactic predictions in a parallel processing architecture: Theory and algorithmic modeling of ERP effects [J] . Michalon Olivier, Baggio Giosue Neuropsychologia . 2019,第期

机译：并行处理架构中的意义驱动的句法预测：ERP效果理论与算法建模
4. A parallel architecture for meaning comparison [C] . Mohan S., Biswas A., Tripathy A., 2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) . 2010

机译：用于含义比较的并行架构
5. Comparison of sequential and parallel architectured run times for Project Euler problems. [D] . McLauthlin, Andrew Bradford. 2015

机译：针对Project Euler问题的顺序和并行架构运行时间的比较。
6. Parallel Genetic Architecture of Parallel Adaptive Radiations in Mimetic Heliconius Butterflies [O] . Marcus R. Kronforst, Durrell D. Kapan, Lawrence E. Gilbert 2006

机译：拟态Heliconius蝴蝶的并行自适应辐射的并行遗传结构。
7. Efficient Parallelization and Optimization of Protein Sequence Comparison Algorithm on Many-Core Architecture [O] . Ye Xiao-chun, Lin Wei, Fan Dong-rui, 2015

机译：多核架构上蛋白质序列比较算法的高效并行化与优化
8. Analysis of Parallel Burn, No-Crossfield TSTO RLV Architectures and Comparison to Parallel Burn with Crossfeed and Series Burn Architectures [R] . Smith, G. , Phillips, A. 2003

机译：平行烧伤，无交叉场TsTO RLV结构分析及与交叉进给和串烧结构并行烧伤的比较

A parallel architecture for meaning comparison

摘要

著录项

相似文献

相关主题

期刊订阅