【24h】

Steerable, Progressive Multidimensional Scaling

机译:可控,渐进式多维缩放

获取原文

摘要

Current implementations of multidimensional scaling (MDS), an approach that attempts to best represent data point similarity in a low-dimensional representation, are not suited for many of today's large-scale datasets. We propose an extension to the spring model approach that allows the user to interactively explore datasets that are far beyond the scale of previous implementations of MDS. We present MDSteer, a steerable MDS computation engine and visualization tool that progressively computes an MDS layout and handles datasets of over one million points. Our technique employs hierarchical data structures and progressive layouts to allow the user to steer the computation of the algorithm to the interesting areas of the dataset. The algorithm iteratively alternates between a layout stage in which a subselection of points are added to the set of active points affected by the MDS iteration, and a binning stage which increases the depth of the bin hierarchy and organizes the currently unplaced points into separate spatial regions. This binning strategy allows the user to select onscreen regions of the layout to focus the MDS computation into the areas of the dataset that are assigned to the selected bins. We show both real and common synthetic benchmark datasets with dimensionalities ranging from 3 to 300 and cardinalities of over one million points
机译:多维缩放(MDS)的当前实现是一种尝试以低维表示形式最好地表示数据点相似性的方法,因此不适用于当今的许多大型数据集。我们提出了对弹簧模型方法的扩展,该模型允许用户以交互方式探索远远超出MDS先前实现范围的数据集。我们介绍了MDSteer,这是一种可操纵的MDS计算引擎和可视化工具,可逐步计算MDS布局并处理超过一百万点的数据集。我们的技术采用分层数据结构和渐进式布局,以允许用户将算法的计算引导到数据集的有趣区域。该算法迭代地在布局阶段(其中将点的子选择添加到受MDS迭代影响的活动点的集合)与装箱阶段之间进行交替,该装箱阶段增加了箱层次结构的深度并将当前未放置的点组织到单独的空间区域中。这种分箱策略允许用户选择布局的屏幕上区域,以将MDS计算集中到分配给所选分箱的数据集区域中。我们展示了真实和通用的综合基准数据集,其维度范围从3到300,并且基数超过一百万点

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号