首页> 外文会议>International Conference on Discovery Science >Deep Learning Does Not Generalize Well to Recognizing Cats and Dogs in Chinese Paintings
【24h】

Deep Learning Does Not Generalize Well to Recognizing Cats and Dogs in Chinese Paintings

机译:深入学习并未概括为识别中国绘画中的猫和狗

获取原文

摘要

Although Deep Learning (DL) image analysis has made recent rapid advances, it still has limitations that indicate that its approach differs significantly from human vision, e.g. the requirement for large training sets, and adversarial attacks. Here we show that DL also differs in failing to generalize well to Traditional Chinese Paintings (TCPs). We developed a new DL object detection method A-RPN (Assembled Region Proposal Network), which concatenates low-level visual information, and high-level semantic knowledge to reduce coarseness in region-based object detection. A-RPN significantly outperforms YOLO2 and Faster R-CNN on natural images (P < 0.02). We applied YOLO2, Faster R-CNN and A-RPN to TCPs with a 12.9%, 13.2% and 13.4%) drop in mAP compared to natural images. There was little or no difference in recognizing humans, but a large drop in mAP for cats and dogs (27%) & 31%), and very large drop for horses (35.9%). The abstract nature of TCPs may be responsible for DL poor performance.
机译:虽然深度学习(DL)图像分析已经实现了最近的快速进步,但它仍然有局限性表明其方法与人类视觉有显着不同,例如,大型训练集的要求和对抗攻击。在这里,我们表明DL也与传统的中国绘画(TCPS)概括不足。我们开发了一种新的DL对象检测方法A-RPN(组装区域提议网络),它串联低级视觉信息和高级语义知识,以减少基于区域的物体检测中的粗糙度。 A-RPN在自然图像上显着优于YOLO2和更快的R-CNN(P <0.02)。与自然图像相比,我们应用Yolo2,更快的R-CNN和A-RPN到TCP,在地图中下降12.9%,13.2%和13.4%。识别人类很少或没有差异,但猫狗和31%的猫和狗的地图大幅下降,而且马匹(35.9%)也很大。 TCP的抽象性质可能对DL性能不佳负责。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号