首页> 外文期刊>International journal of organizational and collective intelligence >An Improved Naive Bayes Classifier on Imbalanced Attributes
【24h】

An Improved Naive Bayes Classifier on Imbalanced Attributes

机译:在不平衡属性上改进的朴素贝叶斯分类器

获取原文
获取原文并翻译 | 示例
           

摘要

Data plays a major and prominent role in this modern information era. Classification is a data mining task to discover the hidden information from large amounts of data stored in the repository. This process becomes extremely challenging in case of highly imbalanced dataset. Prediction from imbalanced attributes cannot be done accurately in the following case: During the training phase, the categorical variable is not observed but the test phase encounters the categorical variable and hence it assigns zero probability which leads to false prediction. To overcome this scenario, this article proposes a novel smoothing technique called optimized laplace smoothing estimation. This technique adds a bias value function to improve the accuracy of imbalanced attributes. For example, a child dataset has more attributes and the classification model is used to predict the child weight. Some of the attribute values may not be present in the child dataset due to which Naive Bayes assigns a zero for incomplete and an empty attribute. This leads to inaccurate prediction. In such cases, Naive Bayes can be further tuned by adding some new parameters as well as altering the existing optimization method. Experimental analysis shows that this novel smoothing technique enhances the classification accuracy by means of accurate predictions for imbalanced attributes.
机译:数据在这个现代信息时代扮演着一个重大和突出的角色。分类是数据挖掘任务,用于从存储库中存储的大量数据发现隐藏的信息。在高度不平衡数据集的情况下,此过程变得非常具有挑战性。在以下情况下无法准确地完成来自不平衡属性的预测:在训练阶段期间,未观察到分类变量,但测试阶段遇到分类变量,因此它分配零概率,从而导致错误预测。为了克服这种情况,本文提出了一种新颖的平滑技术,称为优化的LAPLACE平滑估计。该技术增加了偏置值函数以提高不平衡属性的准确性。例如,子数据集具有更多属性,并且分类模型用于预测子权重。由于哪个天真贝内斯为不完整和空属性为其分配零而导致的某些属性值可能不存在于子数据集中。这导致预测不准确。在这种情况下,通过添加一些新参数以及改变现有的优化方法,可以进一步调整幼稚贝叶斯。实验分析表明,这种新颖的平滑技术通过对不平衡属性的准确预测来增强分类精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号