Adaptive scene-text binarisation on images captured by smartphones

Amira Belhedi; Beatriz Marcotegui

首页> 外文期刊>Image Processing, IET >Adaptive scene-text binarisation on images captured by smartphones

【24h】

Adaptive scene-text binarisation on images captured by smartphones

机译：智能手机捕获的图像的自适应场景文本二值化

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The authors address, in this study, a new adaptive binarisation method on images captured by smartphones. This work is part of an application for visually impaired people assistance, which aims at making text information accessible to people who cannot read it. The main advantage of the proposed method is that the windows underlying the local thresholding process are automatically adapted to the image content. This avoids the problematic parameter setting of local thresholding approaches, difficult to adapt to a heterogeneous database. The adaptive windows are extracted based on ultimate opening (a morphological operator) and then used as thresholding windows to perform a local Otsu's algorithm. The authors' method is evaluated and compared with the Niblack, Sauvola, Wolf, toggle mapping morphological segmentation (TMMS) and maximally stable extremal regions methods on a new challenging database introduced by them. Their database is acquired by visually impaired people in real conditions. It contains 4000 annotated characters (available online for research purposes). Experiments show that the proposed method outperforms classical binarisation methods for degraded images such as low-contrasted or blurred images, very common in their application.

机译：在这项研究中，作者针对智能手机捕获的图像提出了一种新的自适应二值化方法。这项工作是为视障人士提供帮助的应用程序的一部分，该应用程序旨在使无法阅读的人可以访问文本信息。所提出的方法的主要优点是，局部阈值处理的基础窗口自动适应了图像内容。这避免了难以适应异构数据库的局部阈值方法的参数设置问题。自适应窗口是根据最终开口（形态运算符）提取的，然后用作阈值窗口以执行本地Otsu算法。作者的方法经过了评估，并与Niblack，Sauvola，Wolf，切换映射形态学分割（TMMS）和最大稳定的极值区域方法进行了比较，并采用了他们提出的新的具有挑战性的数据库。他们的数据库是由视障人士在实际条件下获取的。它包含4000个带注释的字符（可在线用于研究目的）。实验表明，对于退化图像（例如低对比度或模糊图像），该方法在应用中非常普遍，优于传统的二值化方法。

著录项

来源
《Image Processing, IET》 |2016年第7期|515-523|共9页
作者
Amira Belhedi; Beatriz Marcotegui;
展开▼
作者单位

PSL Research University, CMM – Centre for Mathematical Morphology, France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Markov random field regularisation models for adaptive binarisation of nonuniform images [J] . Shen D., Ip H.H.S. IEE Proceedings. Part K . 1998,第5期

机译：马尔可夫随机场正则化模型用于非均匀图像的自适应二值化
2. Markov random field regularisation models for adaptive binarisation of nonuniform images [J] . D. Shen, H.H.S. Ip IEE Proceedings. Part K . 1998,第5期

机译：马尔可夫随机场正则化模型用于非均匀图像的自适应二值化
3. Assessment of the usability of slit lamp adapters in conjunction with smartphones to capture anterior segment images [J] . Hudson Audrey Investigative ophthalmology & visual science . 2017,第8期

机译：与智能手机结合捕获前段图像的缝隙灯适配器的可用性评估
4. CMOS Image Sensors: More than capturing good images on your smartphones - (PPT) [C] . Charles Chong Sensors Expo and Conference . 2016

机译：CMOS图像传感器：超过智能手机上的好图像 - （PPT）
5. A New Image Processing Algorithm for Geological Structure Identification of Rock Slopes Based on Drone-Captured Images [D] . Zhao, Haochen. 2018

机译：基于无人机捕获图像的岩质边坡地质结构识别新图像处理算法
6. Artificial Intelligence-Based Grading Quality of Bovine Blastocyst Digital Images: Direct Capture with Juxtaposed Lenses of Smartphone Camera and Stereomicroscope Ocular Lens [O] . Marcelo Fábio Gouveia Nogueira, Vitória Bertogna Guilherme, Micheli Pronunciate, 2018

机译：牛胚囊数字图像基于人工智能的分级质量：智能手机相机和立体显微镜目镜并置镜头的直接捕获
7. Deep Learning-Based Fake-Banknote Detection for the Visually Impaired People Using Visible-Light Images Captured by Smartphone Cameras [O] . Tuyen Danh Pham, Chanhum Park, Dat Tien Nguyen, 2020

机译：使用智能手机相机捕获的可见光图像的视觉受损人员深度学习的假纸币检测

Adaptive scene-text binarisation on images captured by smartphones

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅