Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons

Yi C.; Tian Y.; Arditi A.

首页> 外文期刊>Mechatronics, IEEE/ASME Transactions on >Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons

【24h】

Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons

机译：基于便携式摄像机的盲人手持物体辅助文本和产品标签读取

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose a camera-based assistive text reading framework to help blind persons read text labels and product packaging from hand-held objects in their daily lives. To isolate the object from cluttered backgrounds or other surrounding objects in the camera view, we first propose an efficient and effective motion-based method to define a region of interest (ROI) in the video by asking the user to shake the object. This method extracts moving object region by a mixture-of-Gaussians-based background subtraction method. In the extracted ROI, text localization and recognition are conducted to acquire text information. To automatically localize the text regions from the object ROI, we propose a novel text localization algorithm by learning gradient features of stroke orientations and distributions of edge pixels in an Adaboost model. Text characters in the localized text regions are then binarized and recognized by off-the-shelf optical character recognition software. The recognized text codes are output to blind users in speech. Performance of the proposed text localization algorithm is quantitatively evaluated on ICDAR-2003 and ICDAR-2011 Robust Reading Datasets. Experimental results demonstrate that our algorithm achieves the state of the arts. The proof-of-concept prototype is also evaluated on a dataset collected using ten blind persons to evaluate the effectiveness of the system's hardware. We explore user interface issues and assess robustness of the algorithm in extracting and reading text from different objects with complex backgrounds.

机译：我们提出了一种基于摄像头的辅助文本阅读框架，以帮助盲人在日常生活中从手持对象中阅读文本标签和产品包装。为了将对象与杂乱的背景或相机视图中的其他周围对象隔离开来，我们首先提出一种有效且有效的基于运动的方法，通过要求用户摇动对象来定义视频中的关注区域（ROI）。该方法通过基于高斯混合的背景减法提取运动对象区域。在提取的ROI中，进行文本定位和识别以获取文本信息。为了从对象ROI自动定位文本区域，我们通过学习Adaboost模型中笔触方向的梯度特征和边缘像素的分布来提出一种新颖的文本定位算法。然后将本地化文本区域中的文本字符二值化，并通过现成的光学字符识别软件进行识别。识别出的文本代码以语音输出给盲人。在ICDAR-2003和ICDAR-2011稳健读取数据集上定量评估了所提出的文本定位算法的性能。实验结果表明，我们的算法达到了最新水平。还使用十个盲人收集的数据集对概念验证原型进行了评估，以评估系统硬件的有效性。我们探讨了用户界面问题，并评估了该算法在从具有复杂背景的不同对象中提取和读取文本时的鲁棒性。

著录项

来源
《Mechatronics, IEEE/ASME Transactions on》 |2014年第3期|808-817|共10页
作者
Yi C.; Tian Y.; Arditi A.;
展开▼
作者单位

Graduate Center, The City University of New York, New York, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Assistive devices; blindness; distribution of edge pixels; hand-held objects; optical character recognition (OCR); stroke orientation; text reading; text region localization;

机译：辅助设备;盲点;边缘像素的分布;手持对象;光学字符识别（OCR）;笔划方向;文本阅读;文本区域定位;

相似文献

外文文献
中文文献
专利

1. Portable Camera-Based Product Label Reading For Blind People [J] . Rajkumar N, Anand M.G, Barathiraja N International Journal of Engineering Trends and Technology . 2014,第11期

机译：基于便携式相机的盲人产品标签阅读
2. Portable Camera Based Text Reading of Objects for Blind Persons [J] . Zhihua Niu International journal of chemistry and applications . 2019,第1期

机译：基于便携式相机的盲人对象的文本读数
3. Portable Camera Based Text Reading of Objects for Blind Persons [J] . Sonal I. Shirke, Swati V. Patil International Journal of Applied Engineering Research . 2018,第17aPta1期

机译：基于便携式相机的盲人对象的文本读数
4. Improved text-detection methods for a camera-based text reading system for blind persons [C] . Ezaki, N., Kiyota, . 2005

机译：用于盲人的基于相机的文本阅读系统的改进的文本检测方法
5. Portable reading device for the blind. [D] . Dixit, Anish. 2003

机译：便携式盲人阅读装置。
6. HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments [O] . K. Fuchs, T. Grundmann, M. Haldimann, 2020

机译：HoloSelecta DataSet：10035 GTIN标记的产品实例用于对象检测零售环境包装产品的物体检测
7. Portable Camera-Based Product Label Reading For Blind People [O] . N, Rajkumar, G, Anand M., N, Barathiraja 2014

机译：基于便携式摄像头的盲人产品标签阅读

Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅