首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Assessing impacts of data volume and data set balance in using deep learning approach to human activity recognition
【24h】

Assessing impacts of data volume and data set balance in using deep learning approach to human activity recognition

机译:在使用深度学习方法进行人类活动识别时评估数据量和数据集平衡的影响

获取原文

摘要

Over the past decade, deep learning developed rapidly and had significant impact on a variety of application domains. It has been applied to the field of human activity recognition to substitute for well-established analysis techniques that rely on handcrafted feature extraction and classification methods in recent years. However, less attentions have been paid to the influence of training data on recognition accuracy. In this paper, we assessed the influence factors of data volume and data balance in human activity recognition when using deep learning approaches. We evaluated the relationship between data volumes of training dataset and predict accuracy of deep learning algorithms. Given the impact of the data balance between activity categories on the recognition accuracy, we modified the SMOTE algorithm so that it can be applied to human activity recognition. Results show that when the data volume is small (<;4M), the recognition accuracy increased quickly with the increase of the quantity of training data. However, the growth trend of recognition accuracy slows down when the data quantity reaches 4 million. Further increase the data volume does not significantly improve the activity recognition performance. So we can conclude that 4 million data volume can ensure a sufficient accuracy for human activity recognition. Meanwhile, the data set balance operation can not only improve the recognition accuracy of minority categories, but also helps to increase the overall accuracy.
机译:在过去的十年中,深度学习发展迅速,并对各种应用程序领域产生了重大影响。近年来,它已被应用于人类活动识别领域,以替代依靠手工特征提取和分类方法的成熟分析技术。但是,很少将注意力放在训练数据对识别准确性的影响上。在本文中,我们评估了使用深度学习方法时数据量和数据平衡在人类活动识别中的影响因素。我们评估了训练数据集的数据量之间的关系,并预测了深度学习算法的准确性。考虑到活动类别之间的数据平衡对识别准确性的影响,我们修改了SMOTE算法,使其可以应用于人类活动识别。结果表明,当数据量较小(<; 4M)时,随着训练数据量的增加,识别精度迅速提高。但是,当数据量达到400万时,识别精度的增长趋势变慢。进一步增加数据量不会显着提高活动识别性能。因此我们可以得出结论,400万个数据量可以确保足够的准确性以进行人类活动识别。同时,数据集平衡运算不仅可以提高少数群体类别的识别准确度,而且有助于提高整体准确度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号