首页> 外文期刊>Biocybernetics and biomedical engineering / >Diagnosis of Parkinson's disease based on SHAP value feature selection
【24h】

Diagnosis of Parkinson's disease based on SHAP value feature selection

机译:基于SHAP值特征选择的帕金森病诊断

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

To address the problem of high feature dimensionality of Parkinson's disease medical data, this paper introduces SHapley Additive exPlanations (SHAP) value for feature selection of Parkinson's disease medical dataset. This paper combines SHAP value with four classifiers, namely deep forest (gcForest), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM) and random forest (RF), respectively. Then this paper applies them to Parkinson's disease diagnosis. First, the classifier is used to calculate the magnitude of con-tribution of SHAP value to the features, then the features with significant contribution in the classification task are selected, and then the data after feature selection is used as input to classify the Parkinson's disease dataset for diagnosis using the classifier. The experimental results show that compared to Fscore, analysis of variance (Anova-F) and mutual informa-tion (MI) feature selection methods, the four models based on SHAP-value feature selection achieved good classification results. The SHAP-gcForest model combined with gcForest achieves classification accuracy of 91.78 and F1-score of 0.945 when 150 features are selected. The SHAP-LightGBM model combined with LightGBM achieves classification accu-racy and F1-score of 91.62 and 0.945 when 50 features are selected, respectively. The clas-sification effectiveness is second only to the SHAP-gcForest model, but the SHAP-LightGBM model is more computationally efficient than the SHAP-gcForest model. Finally, the effec-tiveness of the proposed method is verified by comparing it with the results of existing lit-erature. The findings demonstrate that machine learning with SHAP value feature selection method has good classification performance in the diagnosis of Parkinson's disease, and provides a reference for physicians in the diagnosis and prevention of Parkinson's disease.(c) 2022 Nalecz Institute of Biocybernetics and Biomedical Engineering of the Polish Academy of Sciences. Published by Elsevier B.V. All rights reserved.
机译:针对帕金森病医学数据集特征维数高的问题,该文引入SHapley加法解释(SHAP)值对帕金森病医学数据集进行特征选择。该文将SHAP值与4个分类器相结合,分别是深森林(gcForest)、极端梯度提升(XGBoost)、轻度梯度提升机(LightGBM)和随机森林(RF)。然后,本文将其应用于帕金森病的诊断。首先,利用分类器计算SHAP值对特征的贡献幅度,然后选择在分类任务中贡献显著的特征,然后将特征选择后的数据作为输入,利用分类器对帕金森病数据集进行分类诊断。实验结果表明,与Fscore、方差分析(Anova-F)和互信息(MI)特征选择方法相比,基于SHAP值特征选择的4种模型取得了较好的分类效果。SHAP-gcForest模型结合gcForest,在选择150个特征时,分类准确率为91.78%,F1得分为0.945。SHAP-LightGBM模型结合LightGBM在选取50个特征时,分类精度和F1得分分别为91.62%和0.945。分类效率仅次于SHAP-gcForest模型,但SHAP-LightGBM模型的计算效率高于SHAP-gcForest模型。最后,通过与现有照明结果的对比,验证了所提方法的有效性。结果表明,基于SHAP值特征选择方法的机器学习在帕金森病的诊断中具有良好的分类性能,为医生诊断和预防帕金森病提供了参考。(c) 2022 年波兰科学院纳莱茨生物控制论和生物医学工程研究所。由以下开发商制作:Elsevier B.V.保留所有权利。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号