【24h】

Possibilistic classifiers for numerical data

机译:数值数据的可能分类器

获取原文
获取原文并翻译 | 示例
           

摘要

Naive Bayesian Classifiers, which rely on independence hypotheses, together with a normality assumption to estimate densities for numerical data, are known for their simplicity and their effectiveness. However, estimating densities, even under the normality assumption, may be problematic in case of poor data. In such a situation, possibility distributions may provide a more faithful representation of these data. Naive Possibilistic Classifiers (NPC), based on possibility theory, have been recently proposed as a counterpart of Bayesian classifiers to deal with classification tasks. There are only few works that treat possibilistic classification and most of existing NPC deal only with categorical attributes. This work focuses on the estimation of possibility distributions for continuous data. In this paper we investigate two kinds of possibilistic classifiers. The first one is derived from classical or flexible Bayesian classifiers by applying a probability-possibility transformation to Gaussian distributions, which introduces some further tolerance in the description of classes. The second one is based on a direct interpretation of data in possibilistic formats that exploit an idea of proximity between data values in different ways, which provides a less constrained representation of them. We show that possibilistic classifiers have a better capability to detect new instances for which the classification is ambiguous than Bayesian classifiers, where probabilities may be poorly estimated and illusorily precise. Moreover, we propose, in this case, an hybrid possibilistic classification approach based on a nearest-neighbour heuristics to improve the accuracy of the proposed possibilistic classifiers when the available information is insufficient to choose between classes. Possibilistic classifiers are compared with classical or flexible Bayesian classifiers on a collection of benchmarks databases. The experiments reported show the interest of possibilistic classifiers. In particular, flexible possibilistic classifiers perform well for data agreeing with the normality assumption, while proximity-based possibilistic classifiers outperform others in the other cases. The hybrid possibilistic classification exhibits a good ability for improving accuracy.
机译:朴素贝叶斯分类器依赖于独立性假设以及正态性假设来估计数字数据的密度,以其简单性和有效性而著称。然而,即使在正常假设下,估计密度也可能在数据不佳的情况下出现问题。在这种情况下,可能性分布可以更忠实地表示这些数据。最近,基于可能性理论的朴素可能性分类器(NPC)被提出作为贝叶斯分类器的对应物来处理分类任务。只有很少的作品处理可能的分类,而现有的NPC大多只处理分类属性。这项工作着重于估计连续数据的可能性分布。在本文中,我们研究了两种可能的分类器。第一个是通过对高斯分布应用概率-可能性变换而从经典或灵活的贝叶斯分类器派生而来的,这在类的描述中引入了更多的容忍度。第二种是基于对可能格式的数据的直接解释,该格式以不同的方式利用了数据值之间的接近度的思想,这为数据值提供了较少受约束的表示形式。我们表明,与贝叶斯分类器相比,可能性分类器具有更好的检测分类歧义的新实例的能力,在贝叶斯分类器中,概率可能估算得很差,而且错综复杂。此外,在这种情况下,我们提出了一种基于最近邻启发式的混合可能性分类方法,以在可用信息不足以在类别之间进行选择时提高提议的可能性分类器的准确性。在一组基准数据库上,将可能的分类器与经典或灵活的贝叶斯分类器进行比较。报道的实验表明可能分类器的兴趣。特别地,灵活的可能性分类器在与正态性假设相符的数据上表现良好,而在其他情况下,基于接近度的可能性分类器则优于其他分类器。混合可能性分类法具有提高准确性的良好能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号