...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Finding representative landmarks of data on manifolds
【24h】

Finding representative landmarks of data on manifolds

机译:在流形上找到代表性的数据界标

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Data-driven non-parametric models, such as manifold learning algorithms, are promising data analysis tools. However, to fit an off-training-set data point in a learned model, one must first "locate" the point in the training set. This query has a time cost proportional to the problem size, which limits the model's scalability. In this paper, we address the problem of selecting a subset of data points as the landmarks helping locate the novel points on the data manifolds. We propose a new category of landmarks defined with the following property: the way the landmarks represent the data in the ambient Euclidean space should resemble the way they represent the data on the manifold, Given the data points and the subset of landmarks, we provide procedures to test whether the proposed property presents for the choice of landmarks. If the data points are organized with a neighbourhood graph, as it is often conducted in practice, we interpret the proposed property in terms of the graph topology. We also discuss the extent to which the topology is preserved for landmark set passing our test procedure. Another contribution of this work is to develop an optimization based scheme to adjust an existing landmark set, which call improve the reliability for representing the manifold data. Experiments on the synthetic data and the natural data have been done. The results support the proposed propel-ties and algorithms.
机译:数据驱动的非参数模型,例如流形学习算法,是很有前途的数据分析工具。但是,要在学习模型中拟合非训练集数据点,必须首先在训练集中“定位”该点。该查询的时间成本与问题的大小成正比,这限制了模型的可伸缩性。在本文中,我们解决了选择数据点子集作为界标的问题,以帮助在数据流形上找到新点。我们提出了一种具有以下属性的新地标类别:地标表示环境欧几里得空间中数据的方式应类似于它们在流形上表示数据的方式。给定数据点和地标子集,我们提供了以下步骤:测试提议的属性是否适合选择地标。如果数据点是用邻域图组织的(通常在实践中进行),则我们将根据图拓扑来解释建议的属性。我们还将讨论通过测试过程的地标集保留拓扑的程度。这项工作的另一项贡献是开发了一种基于优化的方案来调整现有地标集,从而提高了代表流形数据的可靠性。已经对合成数据和自然数据进行了实验。结果支持提出的属性和算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号