首页> 外文会议>International Conference on Document Analysis and Recognition >An Improved Component Tree Based Approach to User-Intention Guided Text Extraction from Natural Scene Images
【24h】

An Improved Component Tree Based Approach to User-Intention Guided Text Extraction from Natural Scene Images

机译:一种改进的基于组件树的自然场景图像中用户意图指导文本提取方法

获取原文

摘要

We have proposed previously a component-tree based approach to user-intention guided text extraction from natural scene images. In this paper, in addition to improving the performance of text extraction algorithm for "swipe" gesture, the algorithm has also been extended to support a new mode of using "tap" gesture to indicate the intended text. Given a grayscale image, two component-trees are built and pre-pruned first by using a so-called contrasting extremal region (CER) criterion and simple rules of geometric features. The remaining nodes are enhanced by using color information in a perceptual color space. Then, a pre-trained neural network is used to classify a selected set of enhanced nodes as single-character or non-text objects. The remaining nodes are grouped into candidate text lines, where possible outliers are pruned in individual lines. Finally, the text line "swiped" or "tapped" by a user is selected as the target line and the intended text is extracted accordingly. The proposed algorithm has been evaluated on ICDAR-2003 benchmark dataset and a superior performance is achieved against the previous methods.
机译:我们先前已经提出了一种基于组件树的方法,用于从自然场景图像中提取用户意图指导的文本。在本文中,除了改善“抽签”手势的文本提取算法的性能外,该算法还得到了扩展,以支持使用“轻击”手势指示目标文本的新模式。给定灰度图像,首先使用所谓的对比度极值区域(CER)准则和简单的几何特征规则来构建和预修剪两个分量树。通过在感知颜色空间中使用颜色信息来增强其余节点。然后,使用预训练的神经网络将一组选定的增强节点分类为单字符或非文本对象。其余节点被分组为候选文本行,其中可能的异常值在单个行中被修剪。最后,选择用户“滑动”或“轻击”的文本行作为目标行,并相应地提取预期的文本。该算法已在ICDAR-2003基准数据集上进行了评估,与以前的方法相比,具有更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号