首页> 外文会议>International Conference on Computational Science and Its Applications >Comparison of Efficiency, Stability and Interpretability of Feature Selection Methods for Multiclassification Task on Medical Tabular Data
【24h】

Comparison of Efficiency, Stability and Interpretability of Feature Selection Methods for Multiclassification Task on Medical Tabular Data

机译:效率,稳定性和衡量特征选择方法对医疗表格数据的多分类任务

获取原文

摘要

Feature selection is an important step of machine learning pipeline. Certain models may select features intrinsically without human interactions or additional algorithms applied. Such algorithms usually belong to neural networks class. Others require help of a researcher or feature selection algorithms. However, it is hard to know beforehand which variables contain the most relevant information and which may cause difficulties for a model to learn the correct relations. In that respect, researchers have been developing feature selection algorithms. To understand what methods perform better on tabular medical data, we have conducted a set of experiments to measure accuracy, stability and compare interpretation capacities of different feature selection approaches. Moreover, we propose an application of Bayesian Inference to the task of feature selection that may provide more interpretable and robust solution. We believe that high stability and interpretability are as important as classification accuracy especially in predictive tasks in medicine.
机译:特征选择是机器学习管道的重要步骤。某些模型可以本质上选择特征,没有人机交互或应用的附加算法。这种算法通常属于神经网络类。其他人需要研究人员或特征选择算法的帮助。然而,事先难以知道哪些变量包含最相关的信息,并且可能导致模型造成困难以学习正确的关系。在这方面,研究人员一直在开发特征选择算法。要了解如何在表格医疗数据上表现更好,我们已经进行了一组实验来测量不同特征选择方法的准确性,稳定性和比较解释能力。此外,我们建议将贝叶斯推理应用于特征选择的任务,这些任务可以提供更可解释和强大的解决方案。我们认为,高稳定性和可解释性与分类准确性一样重要,特别是在药物中的预测任务中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号