首页> 外文期刊>Biomedical signal processing and control >A novel rhinitis prediction method for class imbalance
【24h】

A novel rhinitis prediction method for class imbalance

机译:一种新型鼻炎预测方法,适用于类别不平衡

获取原文
获取原文并翻译 | 示例

摘要

Rhinitis is a prevalent respiratory disease. Clinical rhinitis instances are characterized by multi-label and class imbalance, which is difficult to be accurately classified by typical machine learning methods. We propose a cascaded under-sampling ensemble learning method (CUEL) to construct multiple batch classifiers, each of which is composed of a few base classifiers with different structures. Through batch-by-batch under-sampling, the correctly classified instances of majority class are removed, and the samples that are difficult to classify are kept to gradually reach the equalization of class imbalance. We assign different weights to each of the batch classifiers to construct the final integrated classifier. Cross validation was performed on 2231 clinical rhinitis instances from Shanghai Tongji Hospital Affiliated to Tongji University. The experiment showed that the average accuracy, true positive rate, and G-mean of the CUEL model were 90.71 %, 87.44 %, and 88.18 %, respectively. Compared to typical classifiers, the CUEL model has higher accuracy, true positive rate and lower missed diagnosis rate, and has stronger generalization performance. It can make full use of all rhinitis instances and effectively reduce the prediction deviation caused by class imbalance. Therefore, it has a good auxiliary effect for the prevention and diagnosis of clinical rhinitis. In addition, we calculate the feature importance for rhinitis features on the grounds of the purity of nodes in decision-making tree inside Random Forest and study the correlation between rhinitis features and classifications.
机译:鼻炎是一种普遍的呼吸道疾病。临床鼻炎实例的特点是多标签和类别不平衡,这难以通过典型的机器学习方法准确分类。我们提出了一种级联的底层抽样集合学习方法(CUEUE)来构建多个批量分类器,每个分类器由具有不同结构的一些基本分类器组成。通过批次批次抽样,移除了大多数类的正确分类实例,并且难以分类的样本逐渐达到类别不平衡的均衡。我们为每个批量分类器分配不同的权重,以构造最终集成分类器。对同济大学附属的上海同济医院的2231家临床鼻炎实例进行了交叉验证。实验表明,加油模型的平均准确性,真正的阳性率和G均值分别为90.71%,87.44%和88.18%。与典型分类器相比,Cuel模型具有更高的准确度,真正的阳性率和低迷的诊断率,具有更强的泛化性能。它可以充分利用所有鼻炎,并有效地减少由类别不平衡引起的预测偏差。因此,它对预防和诊断临床鼻炎具有良好的辅助效果。此外,我们计算随机森林内决策树中节点纯度的鼻炎特征的特征重要性,并研究鼻炎特征和分类之间的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号