...
首页> 外文期刊>Machine Learning >Joint maximization of accuracy and information for learning the structure of a Bayesian network classifier
【24h】

Joint maximization of accuracy and information for learning the structure of a Bayesian network classifier

机译:学习贝叶斯网络分类器结构的准确性和信息的联合最大化

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Although recent studies have shown that a Bayesian network classifier (BNC) that maximizes the classification accuracy (i.e., minimizes the 0/1 loss function) is a powerful tool in both knowledge representation and classification, this classifier: (1) focuses on the majority class and, therefore, misclassifies minority classes; (2) is usually uninformative about the distribution of misclassifications; and (3) is insensitive to error severity (making no distinction between misclassification types). In this study, we propose to learn the structure of a BNC using an information measure (IM) that jointly maximizes the classification accuracy and information, motivate this measure theoretically, and evaluate it compared with six common measures using various datasets. Using synthesized confusion matrices, twenty-three artificial datasets, seventeen UCI datasets, and different performance measures, we show that an IM-based BNC is superior to BNCs learned using the other measures-especially for ordinal classification (for which accounting for the error severity is important) and/or imbalanced problems (which are most real-life classification problems)-and that it does not fall behind state-of-the-art classifiers with respect to accuracy and amount of information provided. To further demonstrate its ability, we tested the IM-based BNC in predicting the severity of motorcycle accidents of young drivers and the disease state of ALS patients-two class-imbalance ordinal classification problems-and show that the IM-based BNC is accurate also for the minority classes (fatal accidents and severe patients) and not only for the majority class (mild accidents and mild patients) as are other classifiers, providing more informative and practical classification results. Based on the many experiments we report on here, we expect these advantages to exist for other problems in which both accuracy and information should be maximized, the data is imbalanced, and/or the problem is ordinal, whether the classifier is a BNC or not. Our code, datasets, and results are publicly available http://www.ee.bgu.ac.il/similar to boaz/software.
机译:尽管最近的研究表明,贝叶斯网络分类器(BNC)最大化分类准确性(即,最小化0/1损耗函数)是知识表示和分类中的强大工具,此分类器:(1)专注于大多数班级,因此,错误分类少数阶级; (2)通常是对错误分类分配的不可形式化; (3)对错误严重性不敏感(在错误分类类型之间没有区别)。在这项研究中,我们建议使用信息测量(IM)学习BNC的结构,该信息测量(IM)在理论上联合最大化分类精度和信息,激励这一措施,并使用各种数据集进行六种常见措施进行评估。使用合成的混乱矩阵,二十三个人工数据集,17个UCI数据集和不同的性能措施,我们表明,基于IM的BNC优于使用其他措施的BNC - 特别是对于序数分类(用于错误严重性的计费是重要的)和/或不平衡问题(这是大多数现实生活分类问题) - 并不落在最先进的分类器后面,了解所提供的准确性和信息量。为了进一步展示其能力,我们测试了基于IM的BNC,预测了年轻司机摩托车事故的严重程度以及ALS患者的疾病状态 - 两个类别不平衡序数分类问题 - 并且表明IM为基础的BNC也是准确的对于少数民族课程(致命事故和严重患者)而且不仅适用于多数阶级(轻度事故和轻度患者),也是其他分类机,提供更具信息丰富和实​​践的分类结果。基于我们在此报告的许多实验,我们预计将存在这些优点,其中对两种准确性和信息都应该最大化,数据是不平衡的,并且/或问题是序号,是否分类器是一个bnc 。我们的代码,数据集和结果是公开使用http://www.ee.bu.ac.il/similar与boaz / software。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号