Assessing impacts of data volume and data set balance in using deep learning approach to human activity recognition

机译：在使用深度学习方法进行人类活动识别时评估数据量和数据集平衡的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Over the past decade, deep learning developed rapidly and had significant impact on a variety of application domains. It has been applied to the field of human activity recognition to substitute for well-established analysis techniques that rely on handcrafted feature extraction and classification methods in recent years. However, less attentions have been paid to the influence of training data on recognition accuracy. In this paper, we assessed the influence factors of data volume and data balance in human activity recognition when using deep learning approaches. We evaluated the relationship between data volumes of training dataset and predict accuracy of deep learning algorithms. Given the impact of the data balance between activity categories on the recognition accuracy, we modified the SMOTE algorithm so that it can be applied to human activity recognition. Results show that when the data volume is small (<;4M), the recognition accuracy increased quickly with the increase of the quantity of training data. However, the growth trend of recognition accuracy slows down when the data quantity reaches 4 million. Further increase the data volume does not significantly improve the activity recognition performance. So we can conclude that 4 million data volume can ensure a sufficient accuracy for human activity recognition. Meanwhile, the data set balance operation can not only improve the recognition accuracy of minority categories, but also helps to increase the overall accuracy.

机译：在过去的十年中，深度学习发展迅速，并对各种应用程序领域产生了重大影响。近年来，它已被应用于人类活动识别领域，以替代依靠手工特征提取和分类方法的成熟分析技术。但是，很少将注意力放在训练数据对识别准确性的影响上。在本文中，我们评估了使用深度学习方法时数据量和数据平衡在人类活动识别中的影响因素。我们评估了训练数据集的数据量之间的关系，并预测了深度学习算法的准确性。考虑到活动类别之间的数据平衡对识别准确性的影响，我们修改了SMOTE算法，使其可以应用于人类活动识别。结果表明，当数据量较小（<; 4M）时，随着训练数据量的增加，识别精度迅速提高。但是，当数据量达到400万时，识别精度的增长趋势变慢。进一步增加数据量不会显着提高活动识别性能。因此我们可以得出结论，400万个数据量可以确保足够的准确性以进行人类活动识别。同时，数据集平衡运算不仅可以提高少数群体类别的识别准确度，而且有助于提高整体准确度。

著录项

来源
《IEEE International Conference on Bioinformatics and Biomedicine》|2017年|1160-1165|共6页
会议地点
作者
Haipeng Chen; Fuhai Xiong; Dihong Wu; Lingxiang Zheng; Ao Peng; Xuemin Hong; Biyu Tang; Hai Lu; Haibin Shi; Huiru Zheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Activity recognition; Machine learning; Temperature sensors; Intelligent sensors; Acceleration; Neurons;

机译：活动识别;机器学习;温度传感器;智能传感器;加速度;神经元;
入库时间 2022-08-26 15:25:04

相似文献

外文文献
中文文献
专利

1. ERA: A Data Set and Deep Learning Benchmark for Event Recognition in Aerial Videos [Software and Data Sets] [J] . Lichao Mou, Yuansheng Hua, Pu Jin, Geoscience and Remote Sensing . 2020,第4期

机译：ERA：空中视频中的事件识别数据集和深度学习基准[软件和数据集]
2. An end-to-end deep learning model for human activity recognition from highly sparse body sensor data in Internet of Medical Things environment [J] . Hassan Mohammad Mehedi, Ullah Sana, Hossain M. Shamim, Journal of supercomputing . 2021,第3期

机译：从互联网环境中从高稀稀物体传感器数据的人类活动识别的端到端深度学习模型
3. Human Activity Recognition from Body Sensor Data using Deep Learning [J] . Hassan Mohammad Mehedi, Huda Shamsul, Uddin Md Zia, Journal of medical systems . 2018,第6期

机译：使用深度学习的人体活动识别身体传感器数据
4. Assessing impacts of data volume and data set balance in using deep learning approach to human activity recognition [C] . Haipeng Chen, Fuhai Xiong, Dihong Wu, IEEE International Conference on Bioinformatics and Biomedicine . 2017

机译：评估数据量和数据集平衡对人类活动识别的深度学习方法的影响
5. Human Activity Recognition: A Data-driven Approach. [D] . Luong, Thi Bich Thuy. 2015

机译：人类活动识别：一种数据驱动的方法。
6. Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition [O] . Dionicio Neira-Rodado, Chris Nugent, Ian Cleland, 2020

机译：评估两阶段多元数据清理方法对提高机器学习分类器性能的影响：人类活动识别的案例研究
7. ERA: A Data Set and Deep Learning Benchmark for Event Recognition in Aerial Videos Software and Data Sets [O] . Lichao Mou, Yuansheng Hua, Pu Jin, 2020

机译：ERA：空中视频中的事件识别数据集和深度学习基准软件和数据集

Assessing impacts of data volume and data set balance in using deep learning approach to human activity recognition

摘要

著录项

相似文献

相关主题

期刊订阅