首页> 外文期刊>Image Processing, IET >Adaptive scene-text binarisation on images captured by smartphones
【24h】

Adaptive scene-text binarisation on images captured by smartphones

机译:智能手机捕获的图像的自适应场景文本二值化

获取原文
获取原文并翻译 | 示例

摘要

The authors address, in this study, a new adaptive binarisation method on images captured by smartphones. This work is part of an application for visually impaired people assistance, which aims at making text information accessible to people who cannot read it. The main advantage of the proposed method is that the windows underlying the local thresholding process are automatically adapted to the image content. This avoids the problematic parameter setting of local thresholding approaches, difficult to adapt to a heterogeneous database. The adaptive windows are extracted based on ultimate opening (a morphological operator) and then used as thresholding windows to perform a local Otsu's algorithm. The authors' method is evaluated and compared with the Niblack, Sauvola, Wolf, toggle mapping morphological segmentation (TMMS) and maximally stable extremal regions methods on a new challenging database introduced by them. Their database is acquired by visually impaired people in real conditions. It contains 4000 annotated characters (available online for research purposes). Experiments show that the proposed method outperforms classical binarisation methods for degraded images such as low-contrasted or blurred images, very common in their application.
机译:在这项研究中,作者针对智能手机捕获的图像提出了一种新的自适应二值化方法。这项工作是为视障人士提供帮助的应用程序的一部分,该应用程序旨在使无法阅读的人可以访问文本信息。所提出的方法的主要优点是,局部阈值处理的基础窗口自动适应了图像内容。这避免了难以适应异构数据库的局部阈值方法的参数设置问题。自适应窗口是根据最终开口(形态运算符)提取的,然后用作阈值窗口以执行本地Otsu算法。作者的方法经过了评估,并与Niblack,Sauvola,Wolf,切换映射形态学分割(TMMS)和最大稳定的极值区域方法进行了比较,并采用了他们提出的新的具有挑战性的数据库。他们的数据库是由视障人士在实际条件下获取的。它包含4000个带注释的字符(可在线用于研究目的)。实验表明,对于退化图像(例如低对比度或模糊图像),该方法在应用中非常普遍,优于传统的二值化方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号