首页> 外文期刊>Neurocomputing >Sketch-specific data augmentation for freehand sketch recognition
【24h】

Sketch-specific data augmentation for freehand sketch recognition

机译:写手素描识别的素描特定数据增强

获取原文
获取原文并翻译 | 示例
       

摘要

Sketch recognition remains a significant challenge due to the limited training data and the substantial intra-class variance of freehand sketches for the same object. Conventional methods for this task often rely on the availability of the temporal order of sketch strokes, additional cues acquired from different modalities and supervised augmentation of sketch datasets with real images, which also limit the applicability and feasibility of these methods in real scenarios. In this paper, we propose a novel sketch-specific data augmentation (SSDA) method that leverages the quantity and quality of the sketches automatically. From the aspect of quantity, we introduce a Bezier pivot based deformation (BPD) strategy to enrich the training data. Towards quality improvement, we present a mean stroke reconstruction (MSR) approach to generate a set of novel types of sketches with smaller intra-class variances. Both of these solutions are unrestricted from any multi-source data and temporal cues of sketches. Furthermore, we show that some recent deep convolutional neural network models that are trained on generic classes of real images can be better choices than most of the elaborate architectures that are designed explicitly for sketch recognition. As SSDA can be integrated with any convolutional neural networks, it has a distinct advantage over the existing methods. Our extensive experimental evaluations demonstrate that the proposed method achieves the state-of-the-art results (84.27%) on the TU-Berlin dataset, outperforming the human performance by a remarkable 11.17% increase. Finally, more experiments show the practical value of our approach for the task of sketch-based image retrieval. (c) 2021 Elsevier B.V. All rights reserved.
机译:由于培训数据有限,并且对同一对象的徒手草图的大量内部方差,草图识别仍然是一个重大挑战。此任务的常规方法往往依赖于素描笔画的时间顺序的可用性,从不同的方式获取的额外提示以及使用真实图像监控草图数据集的增强,这也限制了这些方法在真实方案中的适用性和可行性。在本文中,我们提出了一种新的草图特定的数据增强(SSDA)方法,可以自动利用草图的数量和质量。从数量方面来看,我们引入了基于贝塞尔的枢轴变形(BPD)策略来丰富培训数据。为了质量改进,我们介绍了一个平均中风重建(MSR)方法,以产生一组具有较小类别的差异的一组新颖的草图。这两种解决方案都没有从草图的任何多源数据和时间线索中不受限制。此外,我们表明,近期在普通的真实图像上培训的深度卷积神经网络模型可以是比大多数设计用于草图识别的大多数精心设计的艺术架构。随着SSDA可以与任何卷积神经网络集成,它对现有方法具有明显的优势。我们广泛的实验评估表明,该方法在Tu-Berlin数据集上实现了最先进的结果(84.27%),以卓越的11.17%的增加表现优于人类性能。最后,更多的实验表明了我们对基于草图的图像检索任务的方法的实际价值。 (c)2021 elestvier b.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号