首页> 外文会议>Imaging Informatics for Healthcare, Research, and Applications >Correlative hierarchical clustering-based low-rank dimensionality reduction of radiomics-driven phenotype in non-small cell lung cancer
【24h】

Correlative hierarchical clustering-based low-rank dimensionality reduction of radiomics-driven phenotype in non-small cell lung cancer

机译:非小细胞肺癌中的基于相关分层聚类的低秩维数减少了非小细胞肺癌的辐射瘤

获取原文

摘要

Background: Lung cancer is one of the most common cancers in the United States and the most fatal, with 142,670 deaths in 2019. Accurately determining tumor response is critical to clinical treatment decisions, ultimately impacting patient survival. To better differentiate between non-small cell lung cancer (NSCLC) responders and non-responders to therapy, radiomic analysis is emerging as a promising approach to identify associated imaging features undetectable by the human eye. However, the plethora of variables extracted from an image may actually undermine the performance of computer-aided prognostic assessment, known as the curse of dimensionality. In the present study, we show that correlative-driven hierarchical clustering improves high-dimensional radiomics-based feature selection and dimensionality reduction, ultimately predicting overall survival in NSCLC patients. Methods: To select features for high-dimensional radiomics data, a correlation-incorporated hierarchical clustering algorithm automatically categorizes features into several groups. The truncation distance in the resulting dendrogram graph is used to control the categorization of the features, initiating low-rank dimensionality reduction in each cluster, and providing descriptive features for Cox proportional hazards (CPH)-based survival analysis. Using a publicly available non- NSCLC radiogenomic dataset of 204 patients' CT images, 429 established radiomics features were extracted. Low-rank dimensionality reduction via principal component analysis (PCA) was employed (k = 1, n < 1) to find the representative components of each cluster of features and calculate cluster robustness using the relative weighted consistency metric. Results: Hierarchical clustering categorized radiomic features into several groups without primary initialization of cluster numbers using the correlation distance metric (as a function) to truncate the resulting dendrogram into different distances. The dimensionality was reduced from 429 to 67 features (for truncation distance of 0.1). The robustness within the features in clusters was varied from -1.12 to -30.02 for truncation distances of 0.1 to 1.8, respectively, which indicated that the robustness decreases with increasing truncation distance when smaller number of feature classes (i.e., clusters) are selected. The best multivariate CPH survival model had a C-statistic of 0.71 for truncation distance of 0.1, outperforming conventional PCA approaches by 0.04, even when the same number of principal components was considered for feature dimensionality. Conclusions: Correlative hierarchical clustering algorithm truncation distance is directly associated with robustness of the clusters of features selected and can effectively reduce feature dimensionality while improving outcome prediction.
机译:背景:肺癌是美国最常见的癌症和最致命的之一,142670人死亡2019年准确地确定肿瘤的反应是临床治疗决策,最终影响患者的生存是至关重要的。对治疗非小细胞肺癌(NSCLC)应答​​者和非应答者之间更好地区分,radiomic分析正在成为一个有前途的方法,以确定相关的成像设有由人眼不可检测的。然而,变量从图像中提取的大量可能实际上破坏计算机辅助预后评估的性能,被称为维度的诅咒。在本研究中,我们表明,相关的驱动层次聚类提高高维基于radiomics的特征选择和降维,最终预测非小细胞肺癌患者的总生存。方法:为了选择高维数据radiomics特征,相关并入分级聚类算法自动分类特征为若干组。在所得到的树状图图的截断距离被用于控制的特征的分类,发起在每个集群低等级降维,以及用于Cox比例风险(CPH)系存活分析提供的描述性特征。使用的204名患者的CT图像的可公开获得的非NSCLC radiogenomic数据集,提取429种建立radiomics功能。通过主成分分析(PCA)低秩降维被采用(K = 1,N <1)找到使用相对加权一致性度量的特征每个群集,计算群集的鲁棒性的代表性组件。结果:分级聚类分类radiomic特征分成若干组,而不使用相关距离度量(作为函数)来截断所得树形图分割至不同的距离簇编号的主初始化。维度从429降至67个特征(对于截断距离为0.1)。在簇中的特征中的鲁棒性,从-1.12变化到-30.02分别为0.1截断距离至1.8,这表明鲁棒性,当选择要素类(即,簇)的更小数量随着截断距离减小。最好多元CPH存活模型具有0.71 C-统计量的0.1截断距离,表现优于常规PCA 0.04接近,即使当被认为是特征维数相同数目的主成分分析。结论:相关层次聚类算法截断距离直接与所选择的特征的簇的稳健性相关联,并且可以有效地减少特征维数的同时提高结果的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号