首页> 外文会议>IEEE Latin American Conference on Computational Intelligence >Evaluation of the Impact of Initial Positions obtained by Clustering Algorithms on the Straight Line Segments Classifier
【24h】

Evaluation of the Impact of Initial Positions obtained by Clustering Algorithms on the Straight Line Segments Classifier

机译:聚类算法获得的初始位置对直线段分类器的影响评估

获取原文
获取外文期刊封面目录资料

摘要

Supervised learning is an important component of several applications, such as speech recognition, handwritten symbol recognition, data mining, among others. Supervised classification algorithms aim at producing a learning model from a labeled training set. Different methods and approaches have been proposed to overcome the two-class classification problem. Among the existing techniques in literature, the classifier based on Straight Line Segments (SLS Classifier) is worthy of note. This technique is based on distances between points and two sets of straight line segments, whose initial positions are obtained by applying the K-Means algorithm. Then, the gradient descent method finds its optimal positions that minimize the Mean Squared Error. This paper aims to study the impact of the initial positions on the classifier accuracy. For this purpose, we performed two experiments to demonstrate the stability of the classifier performance when the initial positions are not optimal (close to the samples): (i) random initial positions and; (ii) k-means positions displaced by adding Gaussian and uniform noises. In addition, we perform a comparison with positions obtained using different clustering algorithms. As expected, the results suggest that with an increased noise level, the classification rate decreases, however, such reduction was not significant as compared when using the random initial positions. It is worth mentioning that in most of the experiments, the classification rate of the SLS and the Bayes classifier are comparable.
机译:监督学习是多种应用程序的重要组成部分,例如语音识别,手写符号识别,数据挖掘等。监督分类算法旨在从标记的训练集中产生学习模型。已经提出了不同的方法和方法来克服两类​​分类问题。在文献中的现有技术中,基于直线段的分类器(SLS分类器)值得一提。该技术基于点与两组直线段之间的距离,而直线段的初始位置是通过应用K-Means算法获得的。然后,梯度下降法找到了最小化均方误差的最佳位置。本文旨在研究初始位置对分类器准确性的影响。为此,我们进行了两个实验,以证明初始位置不是最佳位置(接近样本)时分类器性能的稳定性:(i)随机初始位置;以及(ii)通过添加高斯噪声和均匀噪声使k均值位置偏移。此外,我们对使用不同聚类算法获得的位置进行了比较。如预期的那样,结果表明,随着噪声水平的提高,分类率会降低,但是,与使用随机初始位置相比,这种降低并不明显。值得一提的是,在大多数实验中,SLS和贝叶斯分类器的分类率是可比的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号