...
首页> 外文期刊>Journal of Parallel and Distributed Computing >GPU accelerated t-distributed stochastic neighbor embedding
【24h】

GPU accelerated t-distributed stochastic neighbor embedding

机译:GPU加速了t分布的随机邻居嵌入

获取原文
获取原文并翻译 | 示例

摘要

Modern datasets and models are notoriously difficult to explore and analyze due to their inherent high dimensionality and massive numbers of samples. Existing visualization methods which employ dimensionality reduction to two or three dimensions are often inefficient and/or ineffective for these datasets. This paper introduces t-SNE-CUDA, a GPU-accelerated implementation of t-Distributed Symmetric Neighbor Embedding (t-SNE) for visualizing datasets and models. t-SNE-CUDA significantly outperforms current implementations with 15-700x speedups on the CIFAR-10 and MNIST datasets. These speedups enable, for the first time, large scale visualizations of modern computer vision datasets such as ImageNet, as well as larger NLP datasets such as GloVe. From these new visualizations, we can draw a number of interesting conclusions. In addition, the performance on machine learning datasets allows us to compute t-SNE embeddings in close to real time, and we explore the applications of such fast embeddings in the domain of importance sampling for neural network training. (C) 2019 Elsevier Inc. All rights reserved.
机译:众所周知,现代数据集和模型因其固有的高维性和大量样本而难以探索和分析。对于这些数据集,采用降维到二维或三维的现有可视化方法通常是无效的和/或无效的。本文介绍了t-SNE-CUDA,这是t分布式对称邻居嵌入(t-SNE)的GPU加速实现,用于可视化数据集和模型。在CIFAR-10和MNIST数据集上,t-SNE-CUDA的性能提高了15-700倍,大大超过了当前的实现。这些提速首次实现了对现代计算机视觉数据集(例如ImageNet)以及较大的NLP数据集(例如GloVe)的大规模可视化。从这些新的可视化中,我们可以得出许多有趣的结论。此外,机器学习数据集的性能使我们能够实时地计算t-SNE嵌入,并且我们探索了这种快速嵌入在神经网络训练的重要性采样领域中的应用。 (C)2019 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号