首页> 外文期刊>International journal of organizational and collective intelligence >An Improved Naive Bayes Classifier on Imbalanced Attributes
【24h】

An Improved Naive Bayes Classifier on Imbalanced Attributes

机译:属性不平衡的改进朴素贝叶斯分类器

获取原文
获取原文并翻译 | 示例
           

摘要

Data plays a major and prominent role in this modern information era. Classification is a data mining task to discover the hidden information from large amounts of data stored in the repository. This process becomes extremely challenging in case of highly imbalanced dataset. Prediction from imbalanced attributes cannot be done accurately in the following case: During the training phase, the categorical variable is not observed but the test phase encounters the categorical variable and hence it assigns zero probability which leads to false prediction. To overcome this scenario, this article proposes a novel smoothing technique called optimized laplace smoothing estimation. This technique adds a bias value function to improve the accuracy of imbalanced attributes. For example, a child dataset has more attributes and the classification model is used to predict the child weight. Some of the attribute values may not be present in the child dataset due to which Naive Bayes assigns a zero for incomplete and an empty attribute. This leads to inaccurate prediction. In such cases, Naive Bayes can be further tuned by adding some new parameters as well as altering the existing optimization method. Experimental analysis shows that this novel smoothing technique enhances the classification accuracy by means of accurate predictions for imbalanced attributes.
机译:在这个现代信息时代,数据扮演着重要的角色。分类是一项数据挖掘任务,旨在从存储在存储库中的大量数据中发现隐藏的信息。在高度不平衡的数据集的情况下,此过程变得极具挑战性。在以下情况下,无法准确地从不平衡属性进行预测:在训练阶段,未观察到分类变量,但是测试阶段遇到了分类变量,因此将其分配为零概率,这导致了错误的预测。为了克服这种情况,本文提出了一种称为优化拉普拉斯平滑估计的新颖平滑技术。此技术添加了一个偏倚值函数来提高不平衡属性的准确性。例如,子数据集具有更多属性,并且分类模型用于预测子体重。某些属性值可能不会出现在子数据集中,因为朴素贝叶斯为不完整属性和空属性分配了零。这导致不准确的预测。在这种情况下,可以通过添加一些新参数以及更改现有的优化方法来进一步调整朴素贝叶斯。实验分析表明,这种新颖的平滑技术通过对不平衡属性的准确预测来提高分类精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号