Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning

Burdack Johannes; Horst Fabian; Giesselbach Sven; Hassan Ibrahim; Daffner Sabrina; Sch?llhorn Wolfgang I.

首页> 外文期刊>Frontiers in Bioengineering and Biotechnology >Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning

【24h】

Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning

机译：系统预处理方法对使用机器学习的步态分类性能影响的系统性比较

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Human movements are characterized by highly non-linear and multi-dimensional interactions within the motor system. Therefore, the future of human movement analysis requires procedures that enhance the classification of movement patterns into relevant groups and support practitioners in their decisions. In this regard, the use of data-driven techniques seems to be particularly suitable to generate classification models. Recently, an increasing emphasis on machine-learning applications has led to a significant contribution, e.g., in increasing the classification performance. In order to ensure the generalizability of the machine-learning models, different data preprocessing steps are usually carried out to process the measured raw data before the classifications. In the past, various methods have been used for each of these preprocessing steps. However, there are hardly any standard procedures or rather systematic comparisons of these different methods and their impact on the classification performance. Therefore, the aim of this analysis is to compare different combinations of commonly applied data preprocessing steps and test their effects on the classification performance of gait patterns. A publicly available dataset on intra-individual changes of gait patterns was used for this analysis. Forty-two healthy participants performed 6 sessions of 15 gait trials for one day. For each trial, two force plates recorded the three-dimensional ground reaction forces (GRFs). The data was preprocessed with the following steps: GRF filtering, time derivative, time normalization, data reduction, weight normalization and data scaling. Subsequently, combinations of all methods from each preprocessing step were analyzed by comparing their prediction performance in a six-session classification using Support Vector Machines, Random Forest Classifiers, Multi-Layer Perceptrons and Convolutional Neural Networks. The results indicate that filtering GRF data and a supervised data reduction (e.g., using Principal Components Analysis) lead to increased prediction performance of the machine-learning classifiers. Interestingly, the weight normalization and the number of data points (above a certain minimum) in the time normalization does not have a substantial effect. In conclusion, the present results provide first domain-specific recommendations for commonly applied data preprocessing methods and might help to build more comparable and more robust classification models based on machine learning that are suitable for a practical application.

机译：人类运动的特征在于电动机系统内的高度非线性和多维相互作用。因此，人类运动分析的未来需要将运动模式分类转变为相关群体以及支持从业者的决定。在这方面，使用数据驱动技术似乎特别适合生成分类模型。最近，增加了对机器学习应用的强调导致了显着的贡献，例如，在增加分类性能时。为了确保机器学习模型的概括性，通常执行不同的数据预处理步骤以在分类之前处理测量的原始数据。过去，已经针对这些预处理步骤中的每一个使用了各种方法。然而，几乎没有任何标准程序或相当系统地比较这些不同的方法及其对分类性能的影响。因此，该分析的目的是比较常用数据预处理步骤的不同组合，并测试它们对步态模式的分类性能的影响。该分析使用了一个关于个人步态模式的个人变化的公共数据集。四十二个健康的参与者每天进行6个步态审判的课程。对于每次试验，两个力板记录了三维地面反作用力（GRF）。数据被预处理以下步骤：GRF滤波，时间衍生，时间归一化，数据减少，权重标准化和数据缩放。随后，通过使用支持向量机，随机森林分类器，多层感知和卷积神经网络将其预处理性能与六会议分类中的预测性能进行比较，分析来自每个预处理步骤的所有方法的组合。结果表明，过滤GRF数据和监督数据减少（例如，使用主成分分析）导致机器学习分类器的预测性能提高。有趣的是，在时间归一化中的重量归一化和数据点数（高于一定最小值）没有显着效果。总之，目前的结果为常用数据预处理方法提供了第一域的特定建议，并可能有助于基于适合实际应用的机器学习构建更具可比和更强大的分类模型。

著录项

来源
《Frontiers in Bioengineering and Biotechnology》 |2020年第1期|共12页
作者
Burdack Johannes; Horst Fabian; Giesselbach Sven; Hassan Ibrahim; Daffner Sabrina; Sch?llhorn Wolfgang I.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
Gait classificationData Selectiondata processingground reaction force (GRF)Multi-layer perceptron (MLP)support vector machine (SVM)random forest classifier (RFC)Convolutional neural network (CNN);

机译：步态分类Data SelectionData处理地图反作用力（GRF）多层Perceptron（MLP）支持向量机（SVM）随机森林分类器（RFC）卷积神经网络（CNN）;

相似文献

外文文献
中文文献
专利

1. Machine learning, waveform preprocessing and feature extraction methods for classification of acoustic startle waveforms [J] . Timothy J. Fawcett, Chad S. Cooper, Ryan J. Longenecker, MethodsX . 2021,第a期

机译：机器学习，波形预处理和声学震动波形分类的特征提取方法
2. Systematic comparison of five machine-learning models in classification and interpolation of soil particle size fractions using different transformed data [J] . Zhang Mo, Shi Wenjiao, Xu Ziwei Hydrology and Earth System Sciences Discussions . 2020,第5期

机译：不同转化数据的土壤粒度分数分类和插值中的五种机器学习模型的系统比较
3. Systematic comparison of five machine-learning models in classification and interpolation of soil particle size fractions using different transformed data [J] . Mo Zhang, Wenjiao Shi, Ziwei Xu Hydrology and Earth System Sciences . 2020,第5期

机译：不同转化数据的土壤粒度分数分类和插值中的五种机器学习模型的系统比较
4. Machine Learning Methods Based Preprocessing to Improve Categorical Data Classification [C] . Zoila Ruiz-Chavez, Jaime Salvador-Meneses, Jose Garcia-Rodriguez International conference on intelligent data engineering and automated learning . 2018

机译：基于机器学习方法的预处理以改进分类数据分类
5. Application of Engineering Principles with a Comparison of Machine Learning Classification Methods to Predict Treatment Outcomes in Head and Neck Cancer Patients. [D] . Hernandez, Alberto Miranda. 2016

机译：工程原理与机器学习分类方法的比较在预测头颈癌患者治疗结果中的应用。
6. Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning [O] . Johannes Burdack, Fabian Horst, Sven Giesselbach, 2020

机译：使用机器学习系统比较不同数据预处理方法对步态分类性能的影响
7. Supplementary material to quot;Systematic comparison of five machine-learning methods in classification and interpolation of soil particle size fractions using different transformed dataquot; [O] . Mo Zhang, Wenjiao Shi 2020

机译：补充材料“使用不同转化数据进行分类和插值中的五种机器学习方法的系统比较”
8. Data Processing And Machine Learning Methods For Multi-Modal Operator State Classification Systems. [R] . Hearn, T. A. 2015

机译：多模态算子状态分类系统的数据处理和机器学习方法。

Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning

摘要

著录项

相似文献

相关主题

期刊订阅