...
首页> 外文期刊>Applied and Computational Harmonic Analysis >Landmark diffusion maps (L-dMaps): Accelerated manifold learning out-of-sample extension
【24h】

Landmark diffusion maps (L-dMaps): Accelerated manifold learning out-of-sample extension

机译:地标扩散图(L-dMaps):加速流形学习样本外扩展

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Diffusion maps are a nonlinear manifold learning technique based on harmonic analysis of a diffusion process over the data. Out-of-sample extensions with computational complexity (9(N), where N is the number of points comprising the manifold, frustrate applications to online learning applications requiring rapid embedding of high-dimensional data streams. We propose landmark diffusion maps (L-dMaps) to reduce the complexity to O(M), where M N is the number of landmark points selected using pruned spanning trees or k-rnedoids. Offering (N/M) speedups in out-of-sample extension, L-dMaps enable the application of diffusion maps to high-volume and/or high-velocity streaming data. We illustrate our approach on three datasets: the Swiss roll, molecular simulations of a C24H50 polymer chain, and biomolecular simulations of alanine dipeptide. We demonstrate up to 50-fold speedups in out-of-sample extension for the molecular systems with less than 4% errors in manifold reconstruction fidelity relative to calculations over the full dataset. (C) 2017 Elsevier Inc. All rights reserved.
机译:扩散图是基于数据扩散过程的谐波分析的非线性流形学习技术。样本外扩展具有计算复杂度(9(N),其中N是包含流形,令人沮丧的应用程序的点数,这些应用程序需要快速嵌入高维数据流的在线学习应用程序。我们提出了界标扩散图(L- dMaps),以将复杂度降低至O(M),其中M N是使用修剪的生成树或k神经形选择的界标点的数量。提供(N / M)的样本外扩展加速,L- dMaps可以将扩散图应用于大量和/或高速流数据,我们在三个数据集上说明了我们的方法:Swiss roll,C24H50聚合物链的分子模拟以及丙氨酸二肽的生物分子模拟。相对于整个数据集的计算,分子系统的样本外扩展速度提高50倍,并且歧管重构保真度的误差小于4%(C)2017 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号