首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops >Co-Compressing and Unifying Deep CNN Models for Efficient Human Face and Speaker Recognition
【24h】

Co-Compressing and Unifying Deep CNN Models for Efficient Human Face and Speaker Recognition

机译:共压缩和统一深度CNN模型以实现有效的人脸和说话人识别

获取原文

摘要

Deep CNN models have become state-of-the-art techniques in many application, e.g., face recognition, speaker recognition, and image classification. Although many studies address on speedup or compression of individual models, very few studies focus on co-compressing and unifying models from different modalities. In this work, to joint and compress face and speaker recognition models, a shared-codebook approach is adopted to reduce the redundancy of the combined model. Despite the modality of the inputs of these two CNN models are quite different, the shared codebook can support two CNN models of sound and image for speaker and face recognition. Experiments show the promising results of unified and co-compressing heterogeneous models for efficient inference.
机译:深度CNN模型已成为许多应用程序中的最新技术,例如人脸识别,说话者识别和图像分类。尽管许多研究着眼于单个模型的加速或压缩,但是很少有研究着重于共压缩和统一来自不同模式的模型。在这项工作中,为了联合和压缩面部和说话者识别模型,采用了共享码本方法来减少组合模型的冗余度。尽管这两个CNN模型的输入形式有很大不同,但共享代码簿可以支持声音和图像的两个CNN模型以用于说话者和面部识别。实验表明,将统一模型和共压缩异构模型用于有效推理的结果令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号