首页> 外文会议>IT in Medicine and Education, 2008 IEEE International Symposium on >Application of improved random forest variables importance measure to traditional Chinese chronic gastritis diagnosis
【24h】

Application of improved random forest variables importance measure to traditional Chinese chronic gastritis diagnosis

机译:改进后的随机森林变量重要性测度在中国慢性胃炎诊断中的应用

获取原文

摘要

Many machine learning approaches have been proposed to establish the chronic gastritis diagnostic models. But till now, most of the machine-learning classifiers do not give any insight as to which features play key roles with respect to the derived classifier as well as the individual class. Recently, the variables importance measure yielded by random forest (RF) has been proposed in many applications. However, in multi-label classifications RF attempts to yield a common feature ranking for all classes, which fail in identifying the distinct predictive structures for individual class. This paper developed an improved random forest variables importance measure to evaluate the importance of features according to each individual class in multi-classification problem, and then applied a wrapper method for feature selection to construct the key features sets referring to each subtype of the chronic gastritis. Experiment results show that, compared with the previous studies, the selected features are more close to expert knowledge and contribute to better understanding of the underlying process that characterize the chronic gastritis.
机译:已经提出了许多机器学习方法来建立慢性胃炎诊断模型。但是直到现在,大多数机器学习分类器还没有提供关于哪个功能在派生分类器和各个类中起关键作用的见解。最近,在许多应用中已经提出了由随机森林(RF)产生的变量重要性度量。然而,在多标签分类中,RF试图对所有类别产生一个共同的特征等级,这未能识别出各个类别的不同预测结构。本文提出了一种改进的随机森林变量重要性评估方法,以评估在多分类问题中每个类别的特征的重要性,然后应用包装方法进行特征选择,以构造参照慢性胃炎每个亚型的关键特征集。实验结果表明,与以前的研究相比,所选特征与专家知识更加接近,有助于更好地理解慢性胃炎的特征性潜在过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号