首页> 外文会议>International Conference on Advanced Computing >Assistance For Visually Impaired Using Finger-Tip Text Reader Using Machine Learning
【24h】

Assistance For Visually Impaired Using Finger-Tip Text Reader Using Machine Learning

机译:使用机器学习的指尖文本阅读器为视障人士提供帮助

获取原文

摘要

Visually impaired people report large number of difficulties in their day to day life. One of the main and most important difficulty is reading texts. With the help of the latest technologies, we tend to help such difficulties by providing them with a device which could assist them in their everyday activities and also help them in studies to improve reading and learning contents. This device is a vital part in visually impaired person's life as it assists them with almost everything they come across in their typical day. This device captures image when pointed by the user and locates the text present in the image. The text is then extracted from the image and is further converted into audio to give the user with a clarified outcome. This device can be used in any paper printed texts and also handwritten texts, thus providing users an effective and efficient in time output. This project helps us identify various difficulties in detecting and recognizing text in real time by an average visually impaired person and come up with solutions to help them. In our approach we have used Fully Convolutional neural network for text level predictions and the then we use NMS to obtain the boxed geometry output of all the texts in the images. Then for the purpose of recognition the text we pass it on to tesseract OCR to obtain the extracted text, and then we convert the text to speech for the final outcome. The main motivation behind our project is to help the visually impaired people to better recognize all the text in front of them and help them live their day to day life just like any other normal person.
机译:视障人士表示他们在日常生活中遇到许多困难。主要和最重要的困难之一是阅读文本。在最新技术的帮助下,我们倾向于通过为他们提供一种可以帮助他们进行日常活动并帮助他们进行学习以改善阅读和学习内容的设备来帮助他们解决此类困难。该设备是视障人士生活中不可或缺的一部分,因为它可以帮助他们完成平常生活中遇到的几乎所有事情。当用户指向时,此设备会捕获图像并查找图像中存在的文本。然后从图像中提取文本,然后将其进一步转换为音频,从而为用户提供明确的结果。该设备可用于任何纸质印刷文本以及手写文本,从而为用户提供了有效而高效的时间输出。该项目可帮助我们确定普通视障人士实时检测和识别文本时遇到的各种困难,并提出解决方案以帮助他们。在我们的方法中,我们使用Fully Convolutional神经网络进行文本级别的预测,然后使用NMS获得图像中所有文本的盒装几何输出。然后出于识别文本的目的,我们将其传递到tesseract OCR以获取提取的文本,然后将文本转换为语音以得到最终结果。我们项目的主要动机是帮助视障人士更好地识别他们面前的所有文字,并像其他普通人一样,帮助他们过上日常生活。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号