首页> 外文期刊>Journal of Information and Telecommunication >An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals
【24h】

An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals

机译:对动物图像进行分类的旋转矩阵和颜色恒定数据增强分析

获取原文
           

摘要

ABSTRACT In this paper, we examine a novel data augmentation (DA) method that transforms an image into a new image containing multiple rotated copies of the original image. The DA method creates a grid of cells, in which each cell contains a different randomly rotated image and introduces a natural background in the newly created image. We investigate the use of deep learning to assess the classification performance on the rotation matrix or original dataset with colour constancy versions of the datasets. For the colour constancy methods, we use two well-known retinex techniques: the multi-scale retinex and the multi-scale retinex with colour restoration for enhancing both original (ORIG) and rotation matrix (ROT) images. We perform experiments on three datasets containing images of animals, from which the first dataset is collected by us and contains aerial images of cows or non-cow backgrounds. To classify the Aerial UAV images, we use a convolutional neural network (CNN) architecture and compare two loss functions (hinge loss and cross-entropy loss). Additionally, we compare the CNN to classical feature-based techniques combined with a k -nearest neighbour classifier or a support vector machine. The best approach is then used to examine the colour constancy DA variants, ORIG and ROT-DA alone for three datasets (Aerial UAV, Bird-600 and Croatia fish). The results show that the rotation matrix data augmentation is very helpful for the Aerial UAV dataset. Furthermore, the colour constancy data augmentation is helpful for the Bird-600 dataset. Finally, the results show that the fine-tuned CNNs significantly outperform the CNNs trained from scratch on the Croatia fish and the Bird-600 datasets, and obtain very high accuracies on the Aerial UAV and Bird-600 datasets.
机译:摘要在本文中,我们研究了一种新颖的数据增强(DA)方法,该方法可将图像转换为包含原始图像的多个旋转副本的新图像。 DA方法创建一个单元格网格,其中每个单元格包含一个不同的随机旋转图像,并在新创建的图像中引入自然背景。我们调查了深度学习的使用,以评估旋转矩阵或原始数据集的分类性能,这些数据集具有颜色恒定版本。对于颜色恒定性方法,我们使用两种众所周知的retinex技术:多尺度retinex和具有色彩还原的多尺度retinex,以增强原始(ORIG)和旋转矩阵(ROT)图像。我们在包含动物图像的三个数据集上进行实验,我们从中收集了第一个数据集,其中包含牛或非牛背景的航空图像。为了对航空无人机图像进行分类,我们使用了卷积神经网络(CNN)架构,并比较了两种损失函数(铰链损失和交叉熵损失)。此外,我们将CNN与结合k近邻分类器或支持向量机的经典基于特征的技术进行了比较。然后使用最佳方法来检查三个数据集(空中无人机,Bird-600和克罗地亚鱼)的颜色恒定DA变体,ORIG和ROT-DA。结果表明,旋转矩阵数据的增加对航空无人机数据集非常有帮助。此外,颜色恒定性数据增强对Bird-600数据集很有帮助。最后,结果表明,经过微调的CNN明显优于在克罗地亚鱼类和Bird-600数据集上从头训练的CNN,并且在空中无人机和Bird-600数据集上获得了很高的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号