Visual World to an Audible Experience: Visual Assistance for the Blind And Visually Impaired

机译：视觉世界到达声音体验：对盲目和视力障碍的视觉援助

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper aims at assisting visually impaired people through Deep Learning (DL) by providing a system that can describe the surroundings as well as answer questions about the surroundings of the user. The system majorly consists of two models, an Image Captioning (IC) model, and a Visual Question Answering (VQA) model. The IC model is a Convolutional Neural Network and Recurrent Neural Network based architecture that incorporates a form of attention while captioning. This paper proposes two models, Multi-Layer Perceptron based and Long Short Term Memory (LSTM) based, for the VQA task that answer questions related to the input image. The IC model has achieved an average BLUE 1 score of 0.46. The LSTM based VQA model has given an overall accuracy of 47 percent. These two models are integrated along with Speech to Text and Text to Speech components to form a single system that works in real time.

机译：本文旨在通过提供一种可以描述周围环境的系统来协助视障人士（DL），以及关于用户周围环境的答案问题。系统主要由两个模型，图像标题（IC）模型和视觉问题应答（VQA）模型组成。 IC模型是一种卷积神经网络和经常性的神经网络的基于神经网络的架构，其在标题时包括一种注意力。本文提出了两个模型，基于多层的基于短期内存（LSTM），用于应答与输入图像相关的问题的VQA任务。 IC模型实现了平均蓝色1分0.46分。基于LSTM的VQA模型给出了47％的整体准确性。这两种模型与文本和文本的语音相结合到语音组件中，以形成实时工作的单个系统。

著录项

来源
《IEEE India Council International Conference》|2020年|1-6|共6页
会议地点
作者
S Shiv Mohith; S Vijay; Sanjana V; Niranjana Krupa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Webcams; Blindness; Real-time systems; Integrated circuit modeling; Task analysis; Long short term memory;

机译：可视化;网络摄像头;失明;实时系统;集成电路建模;任务分析;长期内存;

相似文献

外文文献
中文文献
专利

1. An Evaluation of RetinaNet on Indoor Object Detection for Blind and Visually Impaired Persons Assistance Navigation [J] . Mouna Afif, Riadh Ayachi, Yahia Said, Neural processing letters . 2020,第3期

机译：视网膜对盲人和视障人士援助导航的室内物体检测评价
2. Deep Multi-Layer Perceptron-Based Obstacle Classification Method From Partial Visual Information: Application to the Assistance of Visually Impaired People [J] . Jarraya Salma Kammoun, Al-Shehri Wafa Saad, Ali Manar Salamah Quality Control, Transactions . 2020,第期

机译：基于多层的Perceptron的障碍物分类方法从部分视觉信息：应用于视力受损人民的帮助
3. Influence of Musical Experience of Blind and Visually Impaired Young Persons on Performance in Selected Auditory Tasks [J] . Bogusz-Witczak Edyta, Skrodzka Ewa, Turkowska Hanna Archives of acoustics . 2015,第3期

机译：盲人和视觉障碍者的音乐经验对某些听觉任务中表演的影响
4. Audible Mapper ShadowRine: Development of Map Editor Using only Sound in Accessible Game for Blind Users, and Accessible Action RPG for Visually Impaired Gamers [C] . Masaki Matsuo, Takahiro Miura, Masatsugu Sakajiri, International conference on computers helping people with special needs . 2016

机译：Audible Mapper和ShadowRine：开发地图编辑器，仅对盲人用户使用无障碍游戏中的声音，对视障用户使用无障碍动作RPG
5. Computer technology, education and disability: Experiences of postsecondary students who are blind or visually impaired. [D] . Kaplan, Nancy F. 2006

机译：计算机技术，教育和残疾：失明或视觉障碍的大专学生的经验。
6. Causes of severe visual impairment and blindness in schools for visually handicapped children in Iran [O] . S A Mirdehghan, M H Dehghan, M Mohammadpour, 2005

机译：伊朗视障儿童学校严重视力障碍和失明的原因
7. Teleguidance-based remote navigation assistance for visually impaired and blind people—usability and user experience [O] . Babar Chaudary, Sami Pohjolainen, Saima Aziz, 2021

机译：基于电视的远程导航援助，用于视障和盲人可用性和用户体验
8. Visual Impairment Services Teams July 1, 1972-June 30, 1973: An Analysis of Veterans Administration (VA) Outpatient Services Given to and Characteristics of Severely Visually Impaired and Blinded Veterans [R] . 1974

机译：视觉障碍服务小组1972年7月1日至1973年6月30日：对退伍军人管理局（Va）门诊服务的分析和严重视力受损和盲法退伍军人的特点

Visual World to an Audible Experience: Visual Assistance for the Blind And Visually Impaired

摘要

著录项

相似文献

相关主题

期刊订阅