...
首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Incremental Isometric Embedding of High-Dimensional Data Using Connected Neighborhood Graphs
【24h】

Incremental Isometric Embedding of High-Dimensional Data Using Connected Neighborhood Graphs

机译:使用连接的邻域图进行高维数据的增量等距嵌入

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Most nonlinear data embedding methods use bottom-up approaches for capturing the underlying structure of data distributed on a manifold in high dimensional space. These methods often share the first step which defines neighbor points of every data point by building a connected neighborhood graph so that all data points can be embedded to a single coordinate system. These methods are required to work incrementally for dimensionality reduction in many applications. Because input data stream may be under-sampled or skewed from time to time, building connected neighborhood graph is crucial to the success of incremental data embedding using these methods. This paper presents algorithms for updating $k$-edge-connected and $k$-connected neighborhood graphs after a new data point is added or an old data point is deleted. It further utilizes a simple algorithm for updating all-pair shortest distances on the neighborhood graph. Together with incremental classical multidimensional scaling using iterative subspace approximation, this paper devises an incremental version of Isomap with enhancements to deal with under-sampled or unevenly distributed data. Experiments on both synthetic and real-world data sets show that the algorithm is efficient and maintains low dimensional configurations of high dimensional data under various data distributions.
机译:大多数非线性数据嵌入方法使用自下而上的方法来捕获高维空间中分布在流形上的数据的底层结构。这些方法通常共享第一步,即通过建立连接的邻域图来定义每个数据点的相邻点,以便所有数据点都可以嵌入到单个坐标系中。在许多应用中,这些方法需要逐步工作以降低尺寸。由于输入数据流可能会不时采样或偏斜,因此建立连接的邻域图对于使用这些方法成功嵌入增量数据至关重要。本文提出了在添加新数据点或删除旧数据点后更新$ k $ -edge-connected和$ k $ -connected邻域图的算法。它进一步利用一种简单的算法来更新邻域图上的所有对最短距离。结合使用迭代子空间逼近的增量经典多维缩放,本文设计了具有增强功能的Isomap增量版本,以处理欠采样或分布不均的数据。在合成和真实数据集上的实验表明,该算法是有效的,并且在各种数据分布下都可以保持高维数据的低维配置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号