首页> 外文会议>International Computer Science and Engineering Conference >Word Recognition in Captured Images by CNN Trained with Synthetic Images
【24h】

Word Recognition in Captured Images by CNN Trained with Synthetic Images

机译:CNN用合成图像训练的CNN捕获图像中的字识别

获取原文

摘要

Problems like robotic navigation and automatic geocoding of businesses require artificial agents to perform rapid and accurate word recognition in natural images. We set out to develop a deep learning method to recognize words from different languages in captured images, with high accuracy and with small number of captured samples. Our experiments reveal three main findings. First, we feed images of words as inputs to the neural network directly, omitting segmentation and postprocessing step to avoid compound errors. We found this method to work well for our samples. Second, we are able to train machine learning models to recognize words using purely synthetic training samples by applying feature extractions to both training and testing datasets prior to passing them through deep networks. This achievement allows us to train neural network cheaply on synthetic data and transfer knowledge to recognize words in real data. Third, we set up experiments to compare model performances when using Canny edge detection and Chu's 3D thinning algorithm as preprocessing methods. We found that Canny edge detection performs better in most cases.
机译:机器人导航和企业自动地理编码等问题需要人工代理在自然图像中进行快速准确的词识别。我们开始开发深入学习方法,以识别来自捕获图像中不同语言的单词,具有高精度和少量捕获的样本。我们的实验显示了三种主要结果。首先,我们将单词图像作为输入到神经网络的输入,省略分段和后处理步骤以避免复合误差。我们发现这种方法适用于我们的样本。其次,我们能够培训机器学习模型,通过在通过深网络传递之前应用于训练和测试数据集来识别使用纯合成训练样本的单词。这一成就允许我们廉价地培训神经网络,以综合数据和转移知识来识别实际数据中的单词。第三,我们在使用Canny Edge检测和CHU的3D变薄算法作为预处理方法时,设置实验以比较模型性能。我们发现在大多数情况下,Canny Edge检测更好地表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号