首页> 外文会议>International Conference on Document Analysis and Recognition >Can One Deep Learning Model Learn Script-Independent Multilingual Word-Spotting?
【24h】

Can One Deep Learning Model Learn Script-Independent Multilingual Word-Spotting?

机译:一个深度学习模型可以学习独立于脚本的多语言单词发现吗?

获取原文

摘要

Word spotting has gained increased attention lately as it can be used to extract textual information from handwritten documents and scene-text images. Current word spotting approaches are designed to work on a single language and/or script. Building intelligent models that learn script-independent multilingual word-spotting is challenging due to the large variability of multilingual alphabets and symbols. We used ResNet-152 and the Pyramidal Histogram of Characters (PHOC) embedding to build a one-model script-independent multilingual word-spotting and we tested it on Latin, Arabic, and Bangla (Indian) languages. The one-model we propose performs on par with the multi-model language-specific word-spotting system, and thus, reduces the number of models needed for each script and/or language.
机译:Word Spotting最近获得了更多的注意,因为它可以用于从手写文档和场景文本图像中提取文本信息。目前的单词发现方法旨在处理单个语言和/或脚本。由于多语言字母和符号的巨大变化,建立学习脚本无关的多语言单词斑点的智能模型是具有挑战性的。我们使用了Reset-152和字符的金字塔直方图嵌入以构建一个型号脚本独立的多语言单词,我们在拉丁语,阿拉伯语和孟加拉(印度)语言上测试了它。我们提出的单型模型与多模型语言特定的字样系统进行执行,因此,减少每个脚本和/或语言所需的模型数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号