首页> 外文期刊>Frontiers in Bioengineering and Biotechnology >Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning
【24h】

Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning

机译:系统预处理方法对使用机器学习的步态分类性能影响的系统性比较

获取原文
           

摘要

Human movements are characterized by highly non-linear and multi-dimensional interactions within the motor system. Therefore, the future of human movement analysis requires procedures that enhance the classification of movement patterns into relevant groups and support practitioners in their decisions. In this regard, the use of data-driven techniques seems to be particularly suitable to generate classification models. Recently, an increasing emphasis on machine-learning applications has led to a significant contribution, e.g., in increasing the classification performance. In order to ensure the generalizability of the machine-learning models, different data preprocessing steps are usually carried out to process the measured raw data before the classifications. In the past, various methods have been used for each of these preprocessing steps. However, there are hardly any standard procedures or rather systematic comparisons of these different methods and their impact on the classification performance. Therefore, the aim of this analysis is to compare different combinations of commonly applied data preprocessing steps and test their effects on the classification performance of gait patterns. A publicly available dataset on intra-individual changes of gait patterns was used for this analysis. Forty-two healthy participants performed 6 sessions of 15 gait trials for one day. For each trial, two force plates recorded the three-dimensional ground reaction forces (GRFs). The data was preprocessed with the following steps: GRF filtering, time derivative, time normalization, data reduction, weight normalization and data scaling. Subsequently, combinations of all methods from each preprocessing step were analyzed by comparing their prediction performance in a six-session classification using Support Vector Machines, Random Forest Classifiers, Multi-Layer Perceptrons and Convolutional Neural Networks. The results indicate that filtering GRF data and a supervised data reduction (e.g., using Principal Components Analysis) lead to increased prediction performance of the machine-learning classifiers. Interestingly, the weight normalization and the number of data points (above a certain minimum) in the time normalization does not have a substantial effect. In conclusion, the present results provide first domain-specific recommendations for commonly applied data preprocessing methods and might help to build more comparable and more robust classification models based on machine learning that are suitable for a practical application.
机译:人类运动的特征在于电动机系统内的高度非线性和多维相互作用。因此,人类运动分析的未来需要将运动模式分类转变为相关群体以及支持从业者的决定。在这方面,使用数据驱动技术似乎特别适合生成分类模型。最近,增加了对机器学习应用的强调导致了显着的贡献,例如,在增加分类性能时。为了确保机器学习模型的概括性,通常执行不同的数据预处理步骤以在分类之前处理测量的原始数据。过去,已经针对这些预处理步骤中的每一个使用了各种方法。然而,几乎没有任何标准程序或相当系统地比较这些不同的方法及其对分类性能的影响。因此,该分析的目的是比较常用数据预处理步骤的不同组合,并测试它们对步态模式的分类性能的影响。该分析使用了一个关于个人步态模式的个人变化的公共数据集。四十二个健康的参与者每天进行6个步态审判的课程。对于每次试验,两个力板记录了三维地面反作用力(GRF)。数据被预处理以下步骤:GRF滤波,时间衍生,时间归一化,数据减少,权重标准化和数据缩放。随后,通过使用支持向量机,随机森林分类器,多层感知和卷积神经网络将其预处理性能与六会议分类中的预测性能进行比较,分析来自每个预处理步骤的所有方法的组合。结果表明,过滤GRF数据和监督数据减少(例如,使用主成分分析)导致机器学习分类器的预测性能提高。有趣的是,在时间归一化中的重量归一化和数据点数(高于一定最小值)没有显着效果。总之,目前的结果为常用数据预处理方法提供了第一域的特定建议,并可能有助于基于适合实际应用的机器学习构建更具可比和更强大的分类模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号