首页> 外文会议>Asian conference on intelligent information and database systems >Is a Data-Driven Approach Still Better Than Random Choice with Naive Bayes Classifiers?
【24h】

Is a Data-Driven Approach Still Better Than Random Choice with Naive Bayes Classifiers?

机译:与朴素贝叶斯分类器相比,数据驱动的方法是否仍比随机选择更好?

获取原文

摘要

We study the performance of data-driven, a priori and random approaches to label space partitioning for multi-label classification with a Gaussian Naive Bayes classifier. Experiments were performed on 12 benchmark data sets and evaluated on 5 established measures of classification quality: micro and macro averaged F1 score, subset accuracy and Hamming loss. Data-driven methods are significantly better than an average run of the random baseline. In case of F1 scores and Subset Accuracy - data driven approaches were more likely to perform better than random approaches than otherwise in the worst case. There always exists a method that performs better than a priori methods in the worst case. The advantage of data-driven methods against a priori methods with a weak classifier is lesser than when tree classifiers are used.
机译:我们使用高斯朴素朴素贝叶斯分类器研究数据驱动,先验和随机方法进行标签空间分区以进行多标签分类的性能。对12个基准数据集进行了实验,并根据5种已建立的分类质量度量进行了评估:微观和宏观平均F1得分,子集准确性和汉明损失。数据驱动的方法明显优于随机基线的平均运行。在F1分数和子集准确性的情况下-在最坏的情况下,数据驱动的方法比随机方法更有可能比其他方法具有更好的性能。在最坏的情况下,总有一种方法比先验方法执行得更好。与使用树分类器相比,数据驱动方法相对于具有弱分类器的先验方法的优势要小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号