首页> 外文会议>International conference on multimedia modeling >CNN-Based Classification of Illustrator Style in Graphic Novels: Which Features Contribute Most?
【24h】

CNN-Based Classification of Illustrator Style in Graphic Novels: Which Features Contribute Most?

机译:图形小说中基于CNN的插画风格分类:哪些功能贡献最大?

获取原文

摘要

Can classification of graphic novel illustrators be achieved by convolutional neural network (CNN) features evolved for classifying concepts on photographs? Assuming that basic features at lower network levels generically represent invariants of our environment, they should be reusable. However, features at what level of abstraction are characteristic of illustrator style? We tested transfer learning by classifying roughly 50,000 digitized pages from about 200 comic books of the Graphic Narrative Corpus (GNC, [6]) by illustrator. For comparison, we also classified Mangal09 [18] by book. We tested the predictability of visual fear tures by experimentally varying which of the mixed layers of Inception V3 [29] was used to train classifiers. Overall, the top-1 test-set classification accuracy in the artist attribution analysis increased from 92% for mixed-layer 0 to over 97% when adding mixed-layers higher in the hierarchy. Above mixed-layer 5, there were signs of overfitting, suggesting that texture-like mid-level vision features were sufficient. Experiments varying input material show that page layout and coloring scheme are important contributors. Thus, stylistic classification of comics artists is possible re-using pre-trained CNN features, given only a limited amount of additional training material. We propose that CNN features are general enough to provide the foundation of a visual stylometry, potentially useful for comparative art history.
机译:卷积神经网络(CNN)功能是否可以通过发展用于对照片概念进行分类的卷积神经网络来实现对图形小说插图画家的分类?假设较低网络级别的基本功能通常代表我们环境的不变性,则它们应该是可重用的。但是,插画家风格的特征是什么抽象水平的?我们通过对插画家对来自大约200本书的图形叙事语料库(GNC,[6])漫画中的大约50,000个数字化页面进行分类来测试迁移学习。为了进行比较,我们还按书对Mangal09 [18]进行了分类。我们通过实验改变Inception V3的哪些混合层[29]来训练分类器,从而测试了视觉恐惧的可预测性。总体而言,在艺术家归因分析中,排名前1的测试集分类准确性从混合层0的92%增加到了在层次结构中添加更高的混合层时的97%以上。在混合层5上方,有过度拟合的迹象,表明类似纹理的中层视觉特征就足够了。各种输入材料的实验表明,页面布局和着色方案是重要的贡献者。因此,在仅提供有限数量的额外培训材料的情况下,可以重新使用预先训练的CNN功能,对漫画艺术家进行风格分类。我们建议CNN功能足够通用,可以提供视觉测绘法的基础,这可能对比较艺术史有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号