Scene to Text Conversion and Pronunciation for Visually Impaired People

机译：视障人士的场景到文本转换和发音

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The recent technological advancements are focusing on developing smart systems to improve the quality of life. Machine learning algorithms and artificial intelligence are becoming elementary tools, which are used in the establishment of modern smart systems across the globe. In this context, an effective approach is suggested for automated text detection and recognition for the natural scenes. The incoming image is firstly enhanced by employing Contrast Limited Adaptive Histogram Equalization (CLAHE). Afterward, the text regions of the enhanced image are detected by employing the Maximally Stable External Regions (MSER) feature detector. The non-text MSERs are removed by employing appropriate filters. The remaining MSERs are grouped into words. The text recognition is performed by employing an Optical Character Recognition (OCR) function. The extracted text is pronounced by using a suitable speech synthesizer. The proposed system prototype is realized. The system functionality is verified with the help of an experimental setup. Results prove the concept and working principle of the devised system. It shows the potential of employing the suggested method for the development of modern devices for visually impaired people.

机译：最近的技术进步集中在开发智能系统以改善生活质量上。机器学习算法和人工智能正在成为基本工具，已在全球范围内用于建立现代智能系统。在这种情况下，建议一种有效的方法来自动检测和识别自然场景。首先通过使用对比度受限的自适应直方图均衡化（CLAHE）增强输入图像。然后，通过使用最大稳定外部区域（MSER）特征检测器来检测增强图像的文本区域。通过使用适当的过滤器可以删除非文本MSER。其余的MSER分为单词。通过使用光学字符识别（OCR）功能来执行文本识别。通过使用合适的语音合成器来发音提取的文本。所提出的系统原型得以实现。通过实验设置验证了系统功能。结果证明了所设计系统的概念和工作原理。它显示了采用建议的方法开发视力障碍者现代设备的潜力。

著录项

来源
《Advances in Science and Engineering Technology International Conferences》|2019年|1-4|共4页
会议地点
作者
Saeed Mian Qaisar; Raviha Khan; Noofa Hammad;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Optical character recognition software; Cameras; Text recognition; Histograms; Speech recognition; Synthesizers; Visualization;

机译：光学字符识别软件;相机;文本识别;直方图;语音识别;合成器;可视化;

相似文献

外文文献
中文文献
专利

1. Image Acquisition and Text To Speech Conversion for Visually Impaired People [J] . Priyanka Rathod, Suvarna Nandyal International Journal of Engineering and Technology . 2016,第8期

机译：视障人士的图像采集和文本到语音的转换
2. Image Acquisition and Text To Speech Conversion for Visually Impaired People [J] . Priyanka Rathod, Suvarna Nandyal International Journal of Engineering and Technology . 2016,第8期

机译：视障人士的图像采集和文本到语音的转换
3. Text to Speech Conversion using Optical character Recognition for Visually Impaired Persons [J] . Prince saini, Rajesh Mehra International Journal of Computer Trends and Technology . 2015,第2期

机译：视障人士使用光学字符识别的文本到语音的转换
4. Scene to Text Conversion and Pronunciation for Visually Impaired People [C] . Saeed Mian Qaisar, Raviha Khan, Noofa Hammad Advances in Science and Engineering Technology International Conferences . 2019

机译：文本转换的场景，发音视障人士
5. Machine Vision Navigation System for Visually Impaired People [D] . Yang, Guojun. 2021

机译：用于视力受损人员的机器视觉导航系统
6. A Comparative Study in Real-Time Scene Sonification for Visually Impaired People [O] . Weijian Hu, Kaiwei Wang, Kailun Yang, 2020

机译：视障人士实时场景超声的比较研究
7. Design and Implementation of Text To Speech Conversion for Visually Impaired People [O] . Itunuoluwa Isewon, Jelili Oyelade, Olufunke Oladipupo 2015

机译：视障人士语文转换的设计与实现

Scene to Text Conversion and Pronunciation for Visually Impaired People

摘要

著录项

相似文献

相关主题

期刊订阅