A Gaussian Data Augmentation Technique on Highly Dimensional, Limited Labeled Data for Multiclass Classification Using Deep Learning

机译：使用深度学习对高维，受限标签数据进行多类分类的高斯数据增强技术

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, using oceans of data and virtually infinite cloud-based computation power, deep learning models leverage the current state-of-the-art classification to reach expert level performance. Researchers continue to explore applications of deep machine learning models ranging from face-, text- and voice-recognition to signal and information processing. With the continuously increasing data collection capabilities, datasets are becoming larger and more dimensional. However, manually labeled data points cannot keep up. It is this disparity between the high number of features and the low number of labeled samples what motivates a new approach to integrate feature reduction and sample augmentation to deep learning classifiers. This paper explores the performance of such approach on three deep learning classifiers: MLP, CNN, and LSTM. First, we establish a baseline using the original dataset. Second, we preprocess the dataset using principal component analysis (PCA). Third, we preprocess the dataset with PCA followed by our Gaussian data augmentation (GDA) technique. To estimate performance, we add k-fold cross-validation to our experiments and compile our results in a numerical and graphical using the confusion matrix and a classification report that includes accuracy, recall, f-score and support. Our experiments suggest superior classification accuracy of all three classifiers in the presence of our PCA+GDA approach.

机译：近年来，深度学习模型利用数据的海洋和几乎无限的基于云的计算能力，利用当前的最新分类来达到专家级的性能。研究人员继续探索深度机器学习模型的应用，范围从面部，文本和语音识别到信号和信息处理。随着数据收集能力的不断提高，数据集正在变得越来越大，维度越来越大。但是，手动标记的数据点无法跟上。正是由于大量特征和少量标记样本之间的这种差异，才促使人们采用一种新的方法来将特征缩减和样本扩充集成到深度学习分类器中。本文探讨了这种方法在三个深度学习分类器上的性能：MLP，CNN和LSTM。首先，我们使用原始数据集建立基线。其次，我们使用主成分分析（PCA）对数据集进行预处理。第三，我们先使用PCA预处理数据集，然后再使用高斯数据扩充（GDA）技术。为了评估性能，我们在实验中添加了k倍交叉验证，并使用混淆矩阵和包括准确性，召回率，f评分和支持度的分类报告以数字和图形的形式汇总了我们的结果。我们的实验表明，在使用PCA + GDA方法的情况下，所有三个分类器的分类准确度都很高。

著录项

来源
《International Conference on Intelligent Control and Information Processing》|2019年|145-151|共7页
会议地点
作者
Juan F. Ramirez Rochac; Lily Liang; Nian Zhang; Timothy Oladunni;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Machine learning; Principal component analysis; Signal to noise ratio; Hyperspectral imaging; Computational modeling; Data models; Quantum cascade lasers;

机译：机器学习;主成分分析;信噪比;高光谱成像;计算建模;数据模型;量子级联激光器;

相似文献

外文文献
中文文献
专利

1. Deep bacteria: robust deep learning data augmentation design for limited bacterial colony dataset [J] . Nour Eldeen Mahmoud Khalifa, Mohamed Hamed N. Taha, Aboul Ella Hassanien, International journal of reasoning-based intelligent systems . 2019,第3期

机译：深层细菌：适用于有限细菌菌落数据集的强大的深度学习数据增强设计
2. Deep learning enabled multi-wavelength spatial coherence microscope for the classification of malaria-infected stages with limited labelled data size [J] . Singla Neeru, Srivastava Vishal Optics & Laser Technology . 2020,第1期

机译：深度学习使多波长空间相干显微镜具有有限标记数据尺寸的疟疾感染阶段的分类
3. Transfer Learning with Deep Convolutional Neural Network for SAR Target Classification with Limited Labeled Data [J] . Zhongling Huang, Zongxu Pan, Bin Lei Remote Sensing . 2017,第9期

机译：深度卷积神经网络的转移学习用于有限标签数据的SAR目标分类
4. A Gaussian Data Augmentation Technique on Highly Dimensional, Limited Labeled Data for Multiclass Classification Using Deep Learning [C] . Juan F. Ramirez Rochac, Lily Liang, Nian Zhang, International Conference on Intelligent Control and Information Processing . 2019

机译：高斯数据增强技术对高度尺寸，有限标记数据进行深层学习的多款分类数据
5. Evaluation of Synthetic Training Data and Training-Data-Augmentation Techniques for Object Detection in Ground-Penetrating Radar Data using Deep-Learning Models [D] . Ruggiero, Jean. 2021

机译：使用深度学习模型评估用于地面穿透雷达数据的对象检测的综合训练数据和训练数据增强技术
6. Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data [O] . Bomi Jeong, Hyunjeong Cho, Jieun Kim, 2020

机译：高度不平衡的多类肾脏数据分类的统计模型与机器学习方法的比较
7. Transfer Learning with Deep Convolutional Neural Network for SAR Target Classification with Limited Labeled Data [O] . Zhongling Huang, Zongxu Pan, Bin Lei 2017

机译：用Liment Liment标记数据与SAR目标分类进行深度卷积神经网络的转移学习

A Gaussian Data Augmentation Technique on Highly Dimensional, Limited Labeled Data for Multiclass Classification Using Deep Learning

摘要

著录项

相似文献

相关主题

期刊订阅