首页> 外文会议>International Conference on Developments in eSystems Engineering >Infant Cry Classification Using Semi-supervised K-Nearest Neighbor Approach
【24h】

Infant Cry Classification Using Semi-supervised K-Nearest Neighbor Approach

机译:使用半监控k最近邻近的婴儿哭泣分类

获取原文

摘要

Infants cry for many different reasons. Understanding the infant’s language is a critical challenge that many parents suffer from, thus, it is hard to know precisely for what reason infants are crying. The purpose of our study is to determine whether the infant cry is due to hunger or not, using semi-supervised machine learning techniques. There are two commonly used datasets in the literature, the Dunstan Baby Language and Baby Chillanto database. The total length of each of the datasets is only between 8 and 32 minutes, which is very short. For this reason, we proposed a semi-supervised learning approach (also known as self-training), which can increase the dataset by classifying the unlabeled data from Google AudioSet. We have chosen the k-nearest neighbors (KNN) classifier to determine whether the cry is due to hunger or not. The KNN is known to produce low-performance results if trained with limited data. Thus, we proposed our semi-supervised k-nearest neighbor (SSKNN) that can benefit from unlabeled data to increase the training set. As for feature extraction, we chose Mel Frequency Cepstral Coefficient. To evaluate the performance of the semi-supervised approach, we used the supervised KNN as our baseline model and compared the accuracy between the two approaches. The SSKNN yields better accuracy, which is 94% compared to the supervised KNN which has only an accuracy of 87%.
机译:婴儿出于许多不同的原因哭泣。了解婴儿的语言是许多父母遭受的危急挑战,因此,很难完全了解婴儿在哭的原因。我们的研究目的是确定婴儿哭是否是由于饥饿而不是饥饿,使用半监控机器学习技术。文献中有两个常用的数据集,Dunstan Baby语言和婴儿Chillanto数据库。每个数据集的总长度仅在8到32分钟之间,这非常短。因此,我们提出了一个半监督的学习方法(也称为自我训练),可以通过将未标记的数据从Google audioset分类来增加数据集。我们选择了K-Collect邻居(KNN)分类器来确定哭是否是由于饥饿而不是饥饿。已知KNN如果培训有限的数据训练,则会产生低性能结果。因此,我们提出了我们的半监督K-最近邻(SSKNN),可以从未标记的数据中受益,以增加培训集。至于特征提取,我们选择MEL频率抗搏酸系数。为了评估半监督方法的性能,我们使用监督knn作为我们的基线模型,并比较了两种方法之间的准确性。 SSKNN与监督KNN相比,SSKNN产生更好的精度,这是仅具有87%的准确性的94%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号