首页> 外文会议>Iranian Conference on Fuzzy Systems >Two layer algorithm for data classification based on rough set and Bayesian network classifiers
【24h】

Two layer algorithm for data classification based on rough set and Bayesian network classifiers

机译:基于粗糙集和贝叶斯网络分类器的数据分类两层算法

获取原文

摘要

Data classification, especially text classification has been one of the key subjects in intelligent information processing due to the enormous growth of digital content available on-line. Owing to the high feature space dimensions in most of data types, reduction of feature space and improving classification accuracy is important and difficult problem. A rough set theory is a powerful tool to deal with uncertainty, so it is a good tool for feature reduction. Bayesian networks are also one of the most powerful tools in design of expert systems located in an uncertainty framework. In this paper, we proposed an algorithm for data classification that first, it uses rough set theory and conditional entropy for feature selection and then through rough membership degree concept, it classifies objects with high membership degree, certainly. For classification of other objects, we use Bayesian network classifiers for instance Tree Augmented Naïve Bayes and general Bayesian classifier with two different search approaches. Finally, proposed algorithm is evaluated on Reuters-21578 collection and 4 UCI data sets.
机译:数据分类,特别是文本分类是智能信息处理中的关键主题之一,这是由于在线上可用的数字内容的巨大增长。由于大多数数据类型中的高特征空间尺寸,特征空间的减少和提高分类准确性是重要且难题的问题。粗糙集理论是一种处理不确定性的强大工具,因此它是一个很好的功能减少工具。贝叶斯网络也是位于不确定性框架的专家系统中最强大的工具之一。在本文中,我们提出了一种用于数据分类的算法,首先,它使用粗糙集理论和特征选择的条件熵,然后通过粗略的成员资格概念,它肯定地将具有高隶属度的对象分类。对于其他对象的分类,我们使用Bayesian网络分类器例如树增强Naïve贝内斯和普通贝叶斯分类器,具有两个不同的搜索方法。最后,在REUTERS-21578集合和4个UCI数据集上评估了所提出的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号