【24h】

Multipoint Neighbor Embedding

机译:多点邻居嵌入

获取原文

摘要

Dimensionality reduction methods for visualization attempt to preserve in the embedding as much of the original information as possible. However, projection to 2-D or 3-D heavily distorts the data. Instead, we propose a multipoint extension to neighbor embedding methods, which allows to express datapoints from a high-dimensional space as sets of datapoints in a low-dimensional space. Cardinality of those sets is not assumed a priori. Using gradient of the cost function, we derive an expression, which for every datapoint indicates its remote area of attraction. We use it as a heuristic that guides selection and placement of additional datapoints. We demonstrate the approach with multipoint t-SNE, and adapt the O(N log N) approximation for computing the gradient of t-SNE to our setting. Experiments show that the approach brings qualitative and quantitative gains, i.e., it expresses more pairwise similarities and multi-group memberships of individual datapoints, better preserving the local structure of the data.
机译:用于可视化尝试的维度减少方法,以尽可能多地保留嵌入式。但是,投影到2-D或3-D严重扭曲数据。相反,我们向邻居嵌入方法提出了一个多点扩展,这允许从高维空间中表达DataPoints作为低维空间中的数据点集。这些集合的基数不是假设的。使用成本函数的梯度,我们推出了一个表达式,对于每个数据点表示其远程吸引力区域。我们将其用作引导指导选择和放置其他数据点的启发式。我们展示了具有多点T-SNE的方法,并调整O(n log n)近似,以计算T-SNE的梯度。实验表明,该方法带来了定性和定量增益,即,它表达了更多的双向相似性和多组成员的单个数据点,更好地保留了数据的本地结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号