...
首页> 外文期刊>Journal of Parallel and Distributed Computing >GPU accelerated t-distributed stochastic neighbor embedding
【24h】

GPU accelerated t-distributed stochastic neighbor embedding

机译:GPU加速T分布式随机邻居嵌入

获取原文
获取原文并翻译 | 示例

摘要

Modern datasets and models are notoriously difficult to explore and analyze due to their inherent high dimensionality and massive numbers of samples. Existing visualization methods which employ dimensionality reduction to two or three dimensions are often inefficient and/or ineffective for these datasets. This paper introduces t-SNE-CUDA, a GPU-accelerated implementation of t-Distributed Symmetric Neighbor Embedding (t-SNE) for visualizing datasets and models. t-SNE-CUDA significantly outperforms current implementations with 15-700x speedups on the CIFAR-10 and MNIST datasets. These speedups enable, for the first time, large scale visualizations of modern computer vision datasets such as ImageNet, as well as larger NLP datasets such as GloVe. From these new visualizations, we can draw a number of interesting conclusions. In addition, the performance on machine learning datasets allows us to compute t-SNE embeddings in close to real time, and we explore the applications of such fast embeddings in the domain of importance sampling for neural network training. (C) 2019 Elsevier Inc. All rights reserved.
机译:由于其固有的高度和大量样本,现代数据集和模型令人难以探索和分析。对于这些数据集,采用维度降低到两维的现有可视化方法通常是低效和/或无效。本文介绍了T-SNE-CUDA,用于可视化数据集和模型的T分布式对称邻居嵌入(T-SNE)的GPU加速实现。 T-SNE-CUDA在CIFAR-10和MNIST数据集中具有15-700倍的加速度显着优于电流实现。这些Speedups首次启用现代计算机视觉数据集的大规模可视化,如想象成,以及更大的NLP数据集,如手套。从这些新的可视化,我们可以得出一些有趣的结论。此外,机器学习数据集的性能允许我们在接近实时计算T-SNE嵌入式,并且我们探讨了这种快速嵌入在神经网络培训的重要性采样领域中的应用。 (c)2019 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号