首页> 外文会议>IEEE Interntional Conference on Data Mining Workshops >Using Betweenness Centrality to Identify Manifold Shortcuts
【24h】

Using Betweenness Centrality to Identify Manifold Shortcuts

机译:使用之间的中心性来识别流形快捷键

获取原文

摘要

High-dimensional data presents a significant challenge to a broad spectrum of pattern recognition and machine-learning applications. Dimensionality reduction (DR) methods serve to remove unwanted variance and make such problems tractable. Several nonlinear DR methods, such as the well known ISOMAP algorithm, rely on a neighborhood graph to compute geodesic distances between data points. These graphs may sometimes contain unwanted edges which connect disparate regions of one or more manifolds. This topological sensitivity is well known [1, 9, 13], yet managing high-dimensional, noisy data in the absence of a priori knowledge, remains an open and difficult problem. This manuscript introduces a divisive, edge-removal method based on graph betweenness centrality which can robustly identify manifold-shorting edges. The problem of graph construction in high dimensions is discussed and the proposed algorithm is inserted into the ISOMAP workflow. ROC analysis is performed and the performance is tested on both synthetic and real datasets.
机译:高维数据对广泛的模式识别和机器学习应用具有重大挑战。减少维度减少(DR)方法用于消除不需要的方差并使此类问题进行易行。几种非线性DR方法,例如众所周知的ISOMAP算法,依赖于邻域图来计算数据点之间的测地距离。这些曲线图有时可能包含不需要的边缘,该边缘连接一个或多个歧管的不同区域。这种拓扑敏感性是众所周知的[1,9,13],但在没有先验的知识的情况下管理高维嘈杂的数据,仍然是一个开放和难题的问题。该稿件介绍了基于植物之间的分裂,边缘去除方法,该中心是能够鲁棒地识别歧管短路边缘的中心性。讨论了高维度的图形结构问题,并将所提出的算法插入ISOMAP工作流程中。执行ROC分析,在合成和实时数据集上测试性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号