首页> 外文会议>Conference on Digital Information Processing, Data Mining, and Wireless Communications >A generalized probabilistic approach for managing inconsistency to improve classifier accuracy
【24h】

A generalized probabilistic approach for managing inconsistency to improve classifier accuracy

机译:一种管理不一致提高分类器精度的通用概率方法

获取原文

摘要

Removal of inconsistency from a data set contributes significantly in improving classification accuracy. Inconsistency occurs when attributes of objects have same value but they belong to different classes. Inconsistency is either inherent in the data set or appear during different data preprocessing steps, like discretization, dimensionality reduction and missing value prediction. The aim of the paper is to develop a generalized inconsistency handling scheme based on probability distribution unlike the previous methods which are context dependent. We propose two algorithms to remove inconsistency by assigning class labels to the objects afresh based on the statistical properties of the training data set. The ultimate goal of this research work is to generate consistent data which provide superior classification accuracy compare to the original data set. The proposed methods are verified with real life intrusion domain NSL-KDD data set for establishing our claim.
机译:从数据集中删除不一致显着提高分类准确性。当对象的属性具有相同值但它们属于不同类时,会发生不一致。不一致的是数据集中的固有或在不同的数据预处理步骤期间出现,如离散化,维度降低和缺少值预测。本文的目的是基于概率分布来开发一个广义不一致处理方案,与上文相关的方法不同。我们提出了两种算法,通过将类标签重新分配给对象,基于训练数据集的统计属性来删除不一致。本研究工作的最终目标是生成一致的数据,该数据提供与原始数据集的卓越分类精度。提出的方法是用真实的入侵域NSL-KDD数据集进行验证,用于建立我们的索赔。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号