Implementing data processing toward the edge of the Internet of Things (IoT) is an important requirement in order to produce real-time feedback according to some decision-based approach. The authors aim to experimentally prove that "simplicity is best." Their experiment-based paper considers three platforms (Notebook i7-7500U, Raspberry PI 3 model B+, Raspberry Zero W) and seven machine learning algorithms for classification (logistic regression, random forest, k-nearest neighbors, naive Bayes, linear support vector machine, multilayer perceptron, and decision trees). They use the Scikit-learn Python library and a dataset for ADL recognition with wrist-worn accelerometer measurements with 839 instances and 14 classes (available on the UCI repository). All results were obtained by training with over 80 percent of data along a five-step cross-validation process. The applied methodology deals with the signals generated by a three-axial inertial sensor (X,Y,Z). After preprocessing, for every component of the signal, five segments are considered. Nine features are computed for each segment, as well for the entire sequence: min, max, average, median, kurtosis, skewness, standard deviation, variance, and mean absolute deviation. Dimension reduction processes, based on chi scores, are used to select first the top ten features and then the top three features.
展开▼
机译:实现朝向物联网边缘的数据处理(IOT)是根据基于决策方法产生实时反馈的重要要求。作者的目标是通过实验证明“简单是最好的”。他们的实验纸张考虑了三个平台(笔记本电脑I7-7500U,Raspberry PI 3型号B +,Raspberry Zero W)和七种机器学习算法(Logistic回归,随机森林,K最近邻居,Naive Bayes,线性支持向量机,多层的感知者和决策树)。他们使用Scikit-Learn Python库和DataSet进行ADL识别,使用839实例和14个类(在UCI存储库上提供)。所有结果都是通过培训沿着五步交叉验证过程超过80%的数据。所应用的方法涉及由三轴惯性传感器(x,y,z)产生的信号。在预处理之后,对于信号的每个组件,考虑五个段。每个段计算九个特征,也适用于整个序列:最小,最大,平均,中值,峰,偏差,标准偏差,方差和平均绝对偏差。基于CHI分数的维度减少过程用于选择前十个特征,然后是前三个特征。
展开▼