首页> 外文期刊>Pattern recognition letters >Delaunay triangulation based text detection from multi-view images of natural scene
【24h】

Delaunay triangulation based text detection from multi-view images of natural scene

机译:从自然场景的多视图图像中基于Delaunay三角剖分的文本检测

获取原文
获取原文并翻译 | 示例
           

摘要

Text detection in the wild is still considered as a challenging issue to the researchers because of its several real time applications like forensic application, where CCTV camera captures images at different angles of the same scene. Unlike the existing methods that consider a single view captured orthogonally for text detection, this paper considers multi-view (view-1 and view-2 of the same spot) of the same scene captured at different angles or different height distances for text detection. For each pair of the same scene, the proposed method extracts features that describe characteristics of text components based on Delaunay Triangulation (DT), namely corner points, area and cavity of the DT. The features of corresponding DT in view-1 and view-2 are compared through cosine distance measure to estimate the similarity between two components of respective view-1 and view-2. If the pair satisfies the similarity condition, the components are considered as Candidate Text Components (CTC). In other words, these are the common components for view-1 and view-2 that satisfy the similarity condition. From each CTC of view-1 and view-2, the proposed method finds nearest neighbor components to restore the components of the same text line based on estimating degree of similarly between CTC and neighbor components using Chi-square and cosine distance measures. Furthermore, the proposed method uses a recognition step to detect correct texts by comparing recognition results of view-1 and view-2. The same recognition step is used for removing false positives to improve the performance of the proposed method. Experimental results on our own dataset, which contains pair of images of different situations, and the standard datasets, namely, ICDAR 2013, MSRATD-500, CTW1500, Total-text, ICDAR 2017 MLT and COCO-text, show that the proposed method outperforms the existing methods. (C) 2019 Published by Elsevier B.V.
机译:野外文本检测仍被认为是研究人员面临的挑战性问题,因为它具有多种实时应用,例如取证应用,其中CCTV摄像机可在同一场景的不同角度捕获图像。与考虑将单个视图正交捕获以进行文本检测的现有方法不同,本文考虑了以不同角度或不同高度距离捕获的同一场景的多视图(同一点的view-1和view-2)用于文本检测。对于同一场景的每一对,所提出的方法基于Delaunay三角剖分(DT)提取描述文本成分特征的特征,即DT的角点,面积和空腔。通过余弦距离测量比较视图1和视图2中相应DT的特征,以估计相应视图1和视图2的两个组件之间的相似性。如果该对满足相似性条件,则将这些组件视为候选文本组件(CTC)。换句话说,它们是满足相似条件的view-1和view-2的通用组件。该方法从视图1和视图2的每个CTC中,使用卡方和余弦距离量度,基于CTC和邻居分量之间的相似程度估计,找到最接近的邻居分量以恢复同一文本行的分量。此外,所提出的方法使用识别步骤,通过比较视图1和视图2的识别结果来检测正确的文本。相同的识别步骤用于消除误报,以提高所提出方法的性能。在我们自己的数据集上的实验结果包含了不同情况的图像对,而标准数据集即ICDAR 2013,MSRATD-500,CTW1500,Total-text,ICDAR 2017 MLT和COCO-text,表明该方法的性能优于现有方法。 (C)2019由Elsevier B.V.发布

著录项

  • 来源
    《Pattern recognition letters》 |2020年第1期|92-100|共9页
  • 作者

  • 作者单位

    Heritage Inst Technol Comp Sci & Engn Kolkata India;

    Univ Malaya Fac Comp Sci & Informat Technol Kuala Lumpur Malaysia;

    Indian Stat Inst Comp Vis & Pattern Recognit Unit Kolkata India;

    Nanjing Univ Natl Key Lab Novel Software Technol Nanjing Jiangsu Peoples R China;

    Univ Mysore Dept Studies Comp Sci Mysore Karnataka India;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号