...
首页> 外文期刊>Medical Physics >Exploring nonlinear feature space dimension reduction and data representation in breast Cadx with Laplacian eigenmaps and t-SNE.
【24h】

Exploring nonlinear feature space dimension reduction and data representation in breast Cadx with Laplacian eigenmaps and t-SNE.

机译:利用Laplacian特征图和t-SNE探索乳房Cadx中的非线性特征空间尺寸缩减和数据表示。

获取原文
获取原文并翻译 | 示例
           

摘要

PURPOSE: In this preliminary study, recently developed unsupervised nonlinear dimension reduction (DR) and data representation techniques were applied to computer-extracted breast lesion feature spaces across three separate imaging modalities: Ultrasound (U.S.) with 1126 cases, dynamic contrast enhanced magnetic resonance imaging with 356 cases, and full-field digital mammography with 245 cases. Two methods for nonlinear DR were explored: Laplacian eigenmaps [M. Belkin and P. Niyogi, "Laplacian eigenmaps for dimensionality reduction and data representation," Neural Comput. 15, 1373-1396 (2003)] and t-distributed stochastic neighbor embedding (t-SNE) [L. van der Maaten and G. Hinton, Visualizing data using t-SNE, METHODS: These methods attempt to map originally high dimensional feature spaces to more human interpretable lower dimensional spaces while preserving both local and global information. The properties of these methods as applied to breast computer-aided diagnosis (CADx) were evaluated in the context of malignancy classification performance as well as in the visual inspection of the sparseness within the two-dimensional and three-dimensional mappings. Classification performance was estimated by using the reduced dimension mapped feature output as input into both linear and nonlinear classifiers: Markov chain Monte Carlo based Bayesian artificial neural network (MCMC-BANN) and linear discriminant analysis. The new techniques were compared to previously developed breast CADx methodologies, including automatic relevance determination and linear stepwise (LSW) feature selection, as well as a linear DR method based on principal component analysis. Using ROC analysis and 0.632+bootstrap validation, 95% empirical confidence intervals were computed for the each classifier's AUC performance. RESULTS: In the large U.S. data set, sample high performance results include, AUC0.632+ = 0.88 with 95% empirical bootstrap interval [0.787;0.895] for 13 ARD selected features and AUC0.632+ = 0.87 with interval [0.817;0.906] for four LSW selected features compared to 4D t-SNE mapping (from the original 81D feature space) giving AUC0.632+ = 0.90 with interval [0.847;0.919], all using the MCMC-BANN. CONCLUSIONS: Preliminary results appear to indicate capability for the new methods to match or exceed classification performance of current advanced breast lesion CADx algorithms. While not appropriate as a complete replacement of feature selection in CADx problems, DR techniques offer a complementary approach, which can aid elucidation of additional properties associated with the data. Specifically, the new techniques were shown to possess the added benefit of delivering sparse lower dimensional representations for visual interpretation, revealing intricate data structure of the feature space.
机译:目的:在这项初步研究中,最近开发的无监督非线性降维(DR)和数据表示技术已应用于计算机提取的乳腺病变特征空间,其跨三个独立的成像方式:超声(US),1126例,动态对比增强磁共振成像356例,全视野数字化乳腺摄影245例。探索了两种非线性DR的方法:Laplacian特征图[M. Belkin和P. Niyogi,“拉普拉斯特征图用于降维和数据表示”,神经计算。 15,1373-1396(2003)]和t分布随机邻居嵌入(t-SNE)[L. van der Maaten和G. Hinton,使用t-SNE可视化数据,方法:这些方法试图将最初的高维特征空间映射到更多人类可解释的低维空间,同时保留本地和全局信息。这些方法应用于乳腺计算机辅助诊断(CADx)的属性是在恶性分类性能以及在二维和三维映射中的稀疏性的目视检查中进行评估的。通过使用降维映射特征输出作为线性和非线性分类器的输入来估计分类性能:基于马尔可夫链基于蒙特卡洛的贝叶斯人工神经网络(MCMC-BANN)和线性判别分析。将该新技术与以前开发的乳腺CADx方法进行了比较,包括自动相关性确定和线性逐步(LSW)特征选择,以及基于主成分分析的线性DR方法。使用ROC分析和0.632+引导程序验证,对于每个分类器的AUC性能,计算出95%的经验置信区间。结果:在大型美国数据集中,样本高性能结果包括针对13种ARD选择特征的AUC0.632 + = 0.88,经验自举间隔为95%[0.787; 0.895],AUC0.632 + = 0.87,间隔为[0.817; 0.906 ]与4D t-SNE映射(来自原始81D特征空间)相比,四个LSW选定特征给出的AUC0.632 + = 0.90,间隔[0.847; 0.919],全部使用MCMC-BANN。结论:初步结果似乎表明该新方法具有匹配或超过当前先进乳腺病变CADx算法分类性能的能力。虽然不适合完全替代CADx问题中的特征选择,但DR技术提供了一种补充方法,可以帮助阐明与数据相关的其他属性。具体而言,新技术被证明具有传递稀疏的较低维表示以进行视觉解释的附加好处,从而揭示了特征空间的复杂数据结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号