首页> 外文期刊>International Journal of Applied Engineering Research >An Improved Instance Based K-Nearest Neighbor (IIBK) Classification of Imbalanced Datasets with Enhanced Preprocessing
【24h】

An Improved Instance Based K-Nearest Neighbor (IIBK) Classification of Imbalanced Datasets with Enhanced Preprocessing

机译:基于实例的基于实例的基于基于邻居(IIBK)分类,具有增强的预处理的不平衡数据集

获取原文
获取原文并翻译 | 示例
           

摘要

The presence of data with skewed class distributions is a problem common to a variety of fields, including Bioinformatics, Computer science, Text classification, Remote-sensing, and Manufacturing industries. In Bioinformatics applications, the numbers of non-interacting proteins (majority class) are greater than number of interacting proteins (minority class) in predicting the protein-protein interactions. These results to imbalance dataset problem. Standard classification learning algorithms are often biased towards the majority class and therefore there is a higher misclassification rate for the minority class instances. In this work we focused on the minority class, for solving the imbalanced dataset problem. This research work consists of two stages-preprocessing and classification of imbalanced datasets. The imbalanced datasets were preprocessed using the SMOTE technique and it has been improved to show better results. Then the resultant dataset is classified using the existing J48 decision tree, Instance Based K-nearest neighbor (IBK), Support Vector Machine (SVM) and the proposed (IIBK) algorithm. Four datasets have been used for evaluation of the proposed algorithms and our results outperformed the other methods under the statistical comparative analysis.
机译:具有偏斜类分布的数据存在是各种领域的问题,包括生物信息学,计算机科学,文本分类,遥感和制造业。在生物信息学应用中,非相互作用蛋白(大多数类)的数量大于预测蛋白质 - 蛋白质相互作用的相互作用蛋白(少数类)的数量。这些结果是不平衡数据集问题。标准分类学习算法通常偏向于多数类,因此少数阶级实例的错误分类率较高。在这项工作中,我们专注于少数群体,解决了不平衡的数据集问题。该研究工作包括两个阶段 - 预处理和分类的不平衡数据集。使用Smote技术预处理的不平衡数据集并已得到改进以显示更好的结果。然后使用现有的J48决策树,实例基于k最近邻(IBK),支持向量机(SVM)和所提出的(IIBK)算法进行分类。四个数据集已用于评估所提出的算法,我们的结果在统计比较分析下表现了其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号