首页> 外文会议>IAPR International Workshop on Document Analysis Systems >Exploring Critical Aspects of CNN-based Keyword Spotting. A PHOCNet Study
【24h】

Exploring Critical Aspects of CNN-based Keyword Spotting. A PHOCNet Study

机译:探索基于CNN的关键字发现的关键方面。 PHOCNet研究

获取原文

摘要

Deep convolutional neural networks are today the new baseline for a wide range of machine vision tasks. The problem of keyword spotting is no exception to this rule. Many successful network architectures and learning strategies have been adapted from other vision tasks to create successful keyword spotting systems. In this paper, we argue that various details concerning this adaptation could be re-examined, to the end of building stronger spotting models. In particular, we examine the usefulness of a pyramidal spatial pooling layer versus a simpler approach, and show that a zoning strategy combined with fixed-size inputs can be just as effective while less computationally expensive. We also examine the usefulness of augmentation, class balancing and ensemble learning strategies and propose an improved network. Our hypotheses are tested with numerical experiments on the IAM document collection, where the proposed network outperforms all other existing models.
机译:如今,深度卷积神经网络已成为各种机器视觉任务的新基准。关键字发现的问题也不例外。许多成功的网络体系结构和学习策略已从其他视觉任务改编成成功的关键词发现系统。在本文中,我们认为可以重新检查有关这种适应性的各种细节,直到建立更强大的发现模型为止。特别是,我们研究了金字塔形空间池化层与较简单方法相比的有用性,并显示了结合固定大小输入的分区策略可以在降低计算成本的同时实现同样的效果。我们还研究了增强,类平衡和合奏学习策略的有用性,并提出了一个改进的网络。我们的假设通过IAM文件收集的数值实验进行了检验,其中建议的网络优于所有其他现有模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号