首页> 外文会议>Data Mining Workshops, ICDMW, 2008 IEEE International Conference on >Using Betweenness Centrality to Identify Manifold Shortcuts
【24h】

Using Betweenness Centrality to Identify Manifold Shortcuts

机译:使用中间性中心来确定歧管快捷方式

获取原文

摘要

High-dimensional data presents a significant challenge to a broad spectrum of pattern recognition and machine-learning applications. Dimensionality reduction (DR) methods serve to remove unwanted variance and make such problems tractable. Several nonlinear DR methods, such as the well known ISOMAP algorithm, rely on a neighborhood graph to compute geodesic distances between data points. These graphs may sometimes contain unwanted edges which connect disparate regions of one or more manifolds. This topological sensitivity is well known, yet managing high-dimensional, noisy data in the absence of a priori knowledge, remains an open and difficult problem. This manuscript introduces a divisive, edge-removal method based on graph betweenness centrality which can robustly identify manifold-shorting edges. The problem of graph construction in high dimensions is discussed and the proposed algorithm is inserted into the ISOMAP workflow. ROC analysis is performed and the performance is tested on both synthetic and real datasets.
机译:高维数据对广泛的模式识别和机器学习应用提出了重大挑战。降维(DR)方法用于消除不必要的差异并使此类问题易于解决。几种非线性DR方法(例如众所周知的ISOMAP算法)依赖于邻域图来计算数据点之间的测地距离。这些图有时可能包含不想要的边缘,这些边缘连接一个或多个歧管的不同区域。这种拓扑敏感性是众所周知的,但是在没有先验知识的情况下管理高维,嘈杂的数据仍然是一个开放而困难的问题。该手稿介绍了一种基于图之间居中性的分割式边缘去除方法,该方法可以可靠地识别流形短路边缘。讨论了高维图形构建的问题,并将所提出的算法插入ISOMAP工作流程中。进行了ROC分析,并在综合数据集和实际数据集上测试了性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号