Assistance For Visually Impaired Using Finger-Tip Text Reader Using Machine Learning

机译：使用机器学习的指尖文本阅读器为视障人士提供帮助

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visually impaired people report large number of difficulties in their day to day life. One of the main and most important difficulty is reading texts. With the help of the latest technologies, we tend to help such difficulties by providing them with a device which could assist them in their everyday activities and also help them in studies to improve reading and learning contents. This device is a vital part in visually impaired person's life as it assists them with almost everything they come across in their typical day. This device captures image when pointed by the user and locates the text present in the image. The text is then extracted from the image and is further converted into audio to give the user with a clarified outcome. This device can be used in any paper printed texts and also handwritten texts, thus providing users an effective and efficient in time output. This project helps us identify various difficulties in detecting and recognizing text in real time by an average visually impaired person and come up with solutions to help them. In our approach we have used Fully Convolutional neural network for text level predictions and the then we use NMS to obtain the boxed geometry output of all the texts in the images. Then for the purpose of recognition the text we pass it on to tesseract OCR to obtain the extracted text, and then we convert the text to speech for the final outcome. The main motivation behind our project is to help the visually impaired people to better recognize all the text in front of them and help them live their day to day life just like any other normal person.

机译：视障人士表示他们在日常生活中遇到许多困难。主要和最重要的困难之一是阅读文本。在最新技术的帮助下，我们倾向于通过为他们提供一种可以帮助他们进行日常活动并帮助他们进行学习以改善阅读和学习内容的设备来帮助他们解决此类困难。该设备是视障人士生活中不可或缺的一部分，因为它可以帮助他们完成平常生活中遇到的几乎所有事情。当用户指向时，此设备会捕获图像并查找图像中存在的文本。然后从图像中提取文本，然后将其进一步转换为音频，从而为用户提供明确的结果。该设备可用于任何纸质印刷文本以及手写文本，从而为用户提供了有效而高效的时间输出。该项目可帮助我们确定普通视障人士实时检测和识别文本时遇到的各种困难，并提出解决方案以帮助他们。在我们的方法中，我们使用Fully Convolutional神经网络进行文本级别的预测，然后使用NMS获得图像中所有文本的盒装几何输出。然后出于识别文本的目的，我们将其传递到tesseract OCR以获取提取的文本，然后将文本转换为语音以得到最终结果。我们项目的主要动机是帮助视障人士更好地识别他们面前的所有文字，并像其他普通人一样，帮助他们过上日常生活。

著录项

来源
《International Conference on Advanced Computing》|2019年|7-12|共6页
会议地点
作者
S. Kowshik; V.R Gautam; K. Suganthi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
audio signal processing; convolutional neural nets; handicapped aids; handwritten character recognition; image capture; learning (artificial intelligence); natural language processing; optical character recognition; speech processing; text analysis;

机译：音频信号处理;卷积神经网络;残障辅助工具;手写字符识别;图像捕获;学习（人工智能）;自然语言处理;光学字符识别;语音处理;文本分析;

相似文献

外文文献
中文文献
专利

1. Text formats and web design for visually impaired and dyslexic readers—Clear Text for All [J] . Lindsay Evett, David Brown Interacting with Computers . 2005,第4期

机译：视障人士和阅读障碍者的文本格式和网页设计—全民清除文本
2. DLWAVIP - Deep Learning Based Web Assistance for Visually Impaired People [J] . C.Santhosh Kumar, Yasaswini.P, Dr.R.Vijayabhasker, International Journal of Engineering Research and Applications . 2021,第5期

机译：DLWAVIP - 基于深度学习的Web帮助，用于视力受损人
3. iSeePlus : A cost effective smart assistance archetype based on deep learning model for visually impaired [J] . Virmani Deepali, Gupta Charu, Bamdev Pakhi, Journal of information and optimization sciences . 2020,第7期

机译：ISEEPLUS：基于视觉损害的深度学习模型的经济高效智能辅助
4. Assistance For Visually Impaired Using Finger-Tip Text Reader Using Machine Learning [C] . S. Kowshik, V.R Gautam, K. Suganthi International Conference on Advanced Computing . 2019

机译：使用机器学习使用手指提示文本读卡器的帮助障碍
5. Assessment of the impact chemistry text and figures have on visually impaired students' learning. [D] . Mayo, Provi M. 2004

机译：评估化学课文和数字对视障学生的学习有影响。
6. Map Learning with a 3D Printed Interactive Small-Scale Model: Improvement of Space and Text Memorization in Visually Impaired Students [O] . Stéphanie Giraud, Anke M. Brock, Marc J.-M. Macé, -1

机译：使用3D打印的交互式小规模模型进行地图学习：改善视障学生的空间和文本记忆
7. An Intelligent System to Enhance Visually-Impaired Navigation and Disaster Assistance using Geo-Based Positioning and Machine Learning [O] . Wenhua Liang, Ishmael Rico, Yu Sun 2021

机译：一种智能系统，可以使用基于地理的定位和机器学习来增强视觉障碍导航和灾害援助

Assistance For Visually Impaired Using Finger-Tip Text Reader Using Machine Learning

摘要

著录项

相似文献

相关主题

期刊订阅