...
首页> 外文期刊>Journal of Neuroscience Methods >Visualization and unsupervised predictive clustering of high-dimensional multimodal neuroimaging data
【24h】

Visualization and unsupervised predictive clustering of high-dimensional multimodal neuroimaging data

机译:高维多模态神经影像数据的可视化和无监督预测聚类

获取原文
获取原文并翻译 | 示例
           

摘要

Background: Neuroimaging machine learning studies have largely utilized supervised algorithms - meaning they require both neuroimaging scan data and corresponding target variables (e.g. healthy vs. diseased) to be successfully 'trained' for a prediction task. Noticeably, this approach may not be optimal or possible when the global structure of the data is not well known and the researcher does not have an a priori model to fit the data.New method: We set out to investigate the utility of an unsupervised machine learning technique; t-distributed stochastic neighbour embedding (t-SNE) in identifying 'unseen' sample population patterns that may exist in high-dimensional neuroimaging data. Multimodal neuroimaging scans from 92 healthy subjects were pre-processed using atlas-based methods, integrated and input into the t-SNE algorithm. Patterns and clusters discovered by the algorithm were visualized using a 2D scatter plot and further analyzed using the K-means clustering algorithm.Comparison with existing methods: t-SNE was evaluated against classical principal component analysis. Conclusion: Remarkably, based on unlabelled multimodal scan data, t-SNE separated study subjects into two very distinct clusters which corresponded to subjects' gender labels (cluster silhouette index value = 0.79). The resulting clusters were used to develop an unsupervised minimum distance clustering model which identified 93.5% of subjects' gender. Notably, from a neuropsychiatric perspective this method may allow discovery of data-driven disease phenotypes or sub-types of treatment responders.
机译:背景技术:神经影像机器学习研究已大量使用监督算法-这意味着它们需要神经影像扫描数据和相应的目标变量(例如健康vs.患病)才能成功地进行``训练''以完成预测任务。值得注意的是,当数据的全局结构不为人所知并且研究人员没有先验模型来拟合数据时,这种方法可能不是最佳方法或可行方法。新方法:我们着手研究无监督机器的效用学习技巧; t分布随机邻居嵌入(t-SNE)用于识别可能存在于高维神经影像数据中的“看不见的”样本种群模式。使用基于图谱的方法对来自92位健康受试者的多模式神经影像扫描进行了预处理,并将其整合并输入到t-SNE算法中。使用2D散点图对通过算法发现的模式和聚类进行可视化,并使用K-means聚类算法进行进一步分析。与现有方法的比较:针对经典主成分分析对t-SNE进行了评估。结论:值得注意的是,基于未标记的多模态扫描数据,t-SNE将研究对象分为两个非常不同的群集,分别对应于受试者的性别标签(群集轮廓指数值= 0.79)。所得的聚类用于建立无监督的最小距离聚类模型,该模型可识别出93.5%的受试者性别。值得注意的是,从神经精神病学的角度来看,该方法可以允许发现数据驱动的疾病表型或治疗反应者的亚型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号