首页> 外文会议>Conference on Counterterrorism, Crime Fighting, Forensics, and Surveillance Technologies >Deep learning-based fine-grained car make/model classification for visual surveillance
【24h】

Deep learning-based fine-grained car make/model classification for visual surveillance

机译:基于深度学习的细粒度汽车制作/模型分类,可视监测

获取原文

摘要

Fine-grained object recognition is a potential computer vision problem that has been recently addressed by utilizing deep Convolutional Neural Networks (CNNs). Nevertheless, the main disadvantage of classification methods relying on deep CNN models is the need for considerably large amount of data. In addition, there exists relatively less amount of annotated data for a real world application, such as the recognition of car models in a traffic surveillance system. To this end, we mainly concentrate on the classification of fine-grained car make and/or models for visual scenarios by the help of two different domains. First, a large-scale dataset including approximately 900K images is constructed from a website which includes fine-grained car models. According to their labels, a state-of-the-art CNN model is trained on the constructed dataset. The second domain that is dealt with is the set of images collected from a camera integrated to a traffic surveillance system. These images, which are over 260K, are gathered by a special license plate detection method on top of a motion detection algorithm. An appropriately selected size of the image is cropped from the region of interest provided by the detected license plate location. These sets of images and their provided labels for more than 30 classes are employed to fine-tune the CNN model which is already trained on the large scale dataset described above. To fine-tune the network, the last two fully-connected layers are randomly initialized and the remaining layers are fine-tuned in the second dataset. In this work, the transfer of a learned model on a large dataset to a smaller one has been successfully performed by utilizing both the limited annotated data of the traffic field and a large scale dataset with available annotations. Our experimental results both in the validation dataset and the real field show that the proposed methodology performs favorably against the training of the CNN model from scratch.
机译:细粒度的物体识别是最近通过利用深卷积神经网络(CNNS)来解决的潜在计算机视觉问题。尽管如此,依赖于深度CNN模型的分类方法的主要缺点是需要大量的数据。此外,存在相对较少量的用于现实世界应用的注释数据,例如在交通监测系统中识别汽车模型。为此,我们主要集中在两个不同的域通过两个不同的域通过两个不同的域来分类微粒汽车制作和/或模型的视觉方案。首先,包括大约900K图像的大规模数据集由包括细粒型汽车模型的网站构建。根据其标签,在构造的数据集上培训了最先进的CNN模型。处理的第二个域是从集成到流量监控系统的摄像机收集的一组图像。这些图像超过260k,通过特殊的牌照检测方法在运动检测算法的顶部收集。 An appropriately selected size of the image is cropped from the region of interest provided by the detected license plate location.这些图像和它们提供的标签超过30个类别用于微调已经在上述大规模数据集上培训的CNN模型。为了微调网络,最后两个完全连接的图层是随机初始化的,并且剩余的图层在第二个数据集中进行微调。在这项工作中,通过利用流量字段的有限注释数据和具有可用注释的大规模数据集来成功地执行将学习模型转移到较小的数据集。我们在验证数据集和真实领域的实验结果表明,所提出的方法对来自划痕的CNN模型的培训表现有利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号