首页> 外文会议>International Conference on Information Technology in Medicine and Education >Prediction of Postpartum Hemorrhage by Adaptive K-Nearest Neighbor Based on Influence Factors
【24h】

Prediction of Postpartum Hemorrhage by Adaptive K-Nearest Neighbor Based on Influence Factors

机译:基于影响因素的自适应k靠邻居预测产后出血

获取原文

摘要

For the binary classification problems that the available values for all feature attributes in the dataset are discrete conventional, K-Nearest Neighbor (KNN) considers that the distances between different feature attribute values are the same when calculating the distance between the predicted sample and the training sample, which is not consistent with the actual situation. In this paper, we proposed an algorithm named Adaptive K-Nearest Neighbor Based on Influence Factors (AKNN-IF) which solved the disadvantages of KNN algorithm in calculating the distance of discrete conventional feature attributes. Moreover, as the size of the dataset grows, the influence factors will reflect the degree of the influence more realistically of different feature attribute values on label attributes and it's more effective to use influence factors to calculate the distance of different feature attribute values. Meanwhile, the impact of imbalanced data on the classification effect is weakened. As the influence factor is obtained through self-learning of the data, no human intervention is required, so that the algorithm has a certain self-adaptability. We tested some data which contained 1829 samples and proved that the classification effect of the AKNN-IF algorithm is superior to the KNN algorithm, SVM, and C4.5 for the binary classification problems in which all the feature attributes are discrete conventional. AKNN-IF can contribute to the prevention and control of postpartum hemorrhage.
机译:对于二进制分类问题,即数据集中的所有特征属性的可用值是离散的传统,k最近邻(knn)认为在计算预测样本和培训之间的距离时不同特征属性值之间的距离是相同的样品,这与实际情况不一致。在本文中,我们提出了一种基于影响因素(AKNN-IF)的名为Adaptive K-Collect邻居的算法,该算法解决了KNN算法在计算离散传统特征属性的距离时的缺点。此外,随着数据集的大小增长,影响因素将更加现实地反映标签属性的不同特征属性值的影响程度,并且使用影响因素来计算不同特征属性值的距离更有效。同时,减弱了数据对分类效果的不平衡数据的影响。由于影响因素是通过数据的自学获得的,因此不需要人类干预,因此该算法具有一定的自适应性。我们测试了一些包含1829个样本的数据,并证明了AKNN-IF算法的分类效果优于KNN算法,SVM和C4.5,用于二进制分类问题,其中所有特征属性都是离散的传统问题。 AKNN - 如果可以有助于预防和控制产后出血。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号