首页> 外文会议>International conference on discovery science >Deep Learning Does Not Generalize Well to Recognizing Cats and Dogs in Chinese Paintings
【24h】

Deep Learning Does Not Generalize Well to Recognizing Cats and Dogs in Chinese Paintings

机译:深度学习不能很好地概括中国画中识别猫和狗的能力

获取原文

摘要

Although Deep Learning (DL) image analysis has made recent rapid advances, it still has limitations that indicate that its app-roacn differs significantly from human vision, e.g. the requirement for large training sets, and adversarial attacks. Here we show that DL also differs in failing to generalize well to Traditional Chinese Paintings (TCPs). We developed a new DL object detection method A-RPN (Assembled Region Proposal Network), which concatenates low-level visual information, and high-level semantic knowledge to reduce coarseness in region-based object detection. A-RPN significantly outperforms Y0LO2 and Faster R-CNN on natural images (P < 0.02). We applied YOLO2, Faster R-CNN and A-RPN to TCPs with a 12.9%, 13.2% and 13.4% drop in mAP compared to natural images. There was little or no difference in recognizing humans, but a large drop in mAP for cats and dogs (27% & 31%), and very large drop for horses (35.9%). The abstract nature of TCPs may be responsible for DL poor performance.
机译:尽管深度学习(DL)图像分析在近期取得了长足的进步,但它仍然存在局限性,表明其应用范围与人类视觉有很大不同,例如需要大量训练和对抗性攻击。在这里,我们显示DL在未能很好地概括到中国传统绘画(TCP)方面也有所不同。我们开发了一种新的DL对象检测方法A-RPN(组合区域提议网络),该方法将低级视觉信息和高级语义知识连接在一起,以减少基于区域的对象检测中的粗糙性。在自然图像上,A-RPN明显优于Y0LO2和Faster R-CNN(P <0.02)。我们将YOLO2,Faster R-CNN和A-RPN应用于TCP,与自然图像相比,mAP分别下降了12.9%,13.2%和13.4%。识别人类的差异很小或没有差异,但是猫和狗的mAP下降幅度很大(27%和31%),而马的mAP下降幅度很大(35.9%)。 TCP的抽象性质可能是DL性能不佳的原因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号