首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Ensembles of Deep Learning Models and Transfer Learning for Ear Recognition
【2h】

Ensembles of Deep Learning Models and Transfer Learning for Ear Recognition

机译:深度学习模型和用于耳识别的转移学习的集成

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The recognition performance of visual recognition systems is highly dependent on extracting and representing the discriminative characteristics of image data. Convolutional neural networks (CNNs) have shown unprecedented success in a variety of visual recognition tasks due to their capability to provide in-depth representations exploiting visual image features of appearance, color, and texture. This paper presents a novel system for ear recognition based on ensembles of deep CNN-based models and more specifically the Visual Geometry Group (VGG)-like network architectures for extracting discriminative deep features from ear images. We began by training different networks of increasing depth on ear images with random weight initialization. Then, we examined pretrained models as feature extractors as well as fine-tuning them on ear images. After that, we built ensembles of the best models to further improve the recognition performance. We evaluated the proposed ensembles through identification experiments using ear images acquired under controlled and uncontrolled conditions from mathematical analysis of images (AMI), AMI cropped (AMIC) (introduced here), and West Pomeranian University of Technology (WPUT) ear datasets. The experimental results indicate that our ensembles of models yield the best performance with significant improvements over the recently published results. Moreover, we provide visual explanations of the learned features by highlighting the relevant image regions utilized by the models for making decisions or predictions.
机译:视觉识别系统的识别性能高度依赖于提取和表示图像数据的区别特征。卷积神经网络(CNN)由于具有利用外观,颜色和纹理的视觉图像特征提供深度表示的能力,因此已在各种视觉识别任务中取得了空前的成功。本文提出了一种基于深度CNN的模型,更具体地说是类似于视觉几何组(VGG)的网络体系结构的人耳识别系统,用于从耳朵图像中提取具有区别的深层特征。我们首先通过随机权重初始化训练不同深度耳朵图像网络。然后,我们检查了作为特征提取器的预训练模型,并在耳部图像上对其进行了微调。之后,我们构建了最佳模型的集合,以进一步提高识别性能。我们通过识别实验评估了拟议的合奏,该实验使用在受控和非受控条件下从图像的数学分析(AMI),AMI裁剪(AMIC)(在此处引入)和西波美拉尼亚科技大学(WPUT)的耳朵数据集中获取的耳朵图像进行。实验结果表明,我们的模型集合产生了最佳性能,并且与最近发布的结果相比有了显着改进。此外,我们通过突出显示模型用于做出决策或预测的相关图像区域,来提供对所学功能的直观解释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号