首页> 外文会议>International Conference on Mechatronics and Machine Vision in Practice >A Vision Aid for the Visually Impaired using Commodity Dual-Rear-Camera Smartphones
【24h】

A Vision Aid for the Visually Impaired using Commodity Dual-Rear-Camera Smartphones

机译:使用商品双后器智能手机视力障碍的视野辅助辅助

获取原文

摘要

Dual- (or multiple) rear cameras on hand-held smartphones are believed to be the future of mobile photography. Recently, many of such new has been released (mainly with dual-rear cameras: one wide-angle and one telephoto). Some of the notable ones are Apple iPhone 7 and 8 Plus, iPhone X, Samsung Galaxy S9, LG V30, Huawei Mate 10. With built-in dual-camera systems, these devices are capable of not only producing better quality picture but also acquiring 3D stereo photos (with depth information collected). Thus, they are capable of capturing the moment in life with depth just like our two eye system. Thanks to this current trend, these phones are now getting cheaper while becoming more power complete. In this paper, we describe a system that makes use of the commercial dual rear-camera phones such as the iPhone X, to provide aids for people who are visually impaired. We propose a design to place the phone on the chest centre of the user who has one or two Bluetooth headphone(s) plugged into the ears to listen to the phone audio outputs. Our system is consist of three modules: (1) the scene context recognition to audio, (2) the 3D stereo reconstruction to audio, and (3) the interactive audio/voice controls. In slightly more detail, the wide-angle camera captures live photos to be investigated by a GPS guided Deep Learning process to describe the scene in front of him/herself (module 1). The telephoto camera captures the more narrow-angle and thus to be stereo reconstructed with the aids of the wide angle's one to form a depth map (densed area-based distance map). The map helps determine the distance to all visible object(s) to notify the user with critical ones (module 2). This module also makes the phone vibrate when an object(s) located close enough to the user, e.g. within hand reach distance. The user can also query the system by asking various questions to get automatic voice answering (module 3). In addition, a manual rescue module (module 4) is also added when other things have gone wrong. An example of the vision to audio could be ”Overall, likely a corridor, one medium object is 0.5 m away - central left”, or ”Overall, city pathway, front cleared”. Audio command input may be ”read texts”, and the phone will detect and read all texts on closest object. More details on the design and implementation are further described in this paper.
机译:手持式智能手机上的双(或多个)后摄像机被认为是移动摄影的未来。最近,许多这么新的释放(主要是用双后镜:一个广角和一个远摄)。一些非常值得注意的是Apple iPhone 7和8 Plus,iPhone X,三星Galaxy S9,LG V30,华为伴侣10.带内置双摄像机系统,这些设备不仅能够生产更好的质量图片,还可以获得3D立体声照片(收集深度信息)。因此,他们能够像我们的两个眼睛系统一样捕捉生活中的瞬间。由于这种目前的趋势,这些手机现在正在变得更便宜,同时变得更加便宜。在本文中,我们描述了一种利用商业双后镜手机如iPhone X的系统,为视力受损的人提供艾滋病。我们提出了一种设计,将手机放在有一个或两个蓝牙耳机的用户的胸部中心,插入耳朵以收听电话音频输出。我们的系统由三个模块组成:(1)场景上下文识别到音频,(2)3D立体声重建对音频,(3)交互式音频/语音控制。在稍微详细的细节中,广角相机捕获了通过GPS引导的深度学习过程来调查的实时照片,以描述他/她自己面前的场景(模块1)。远摄相机捕获更窄的角度,因此立体声与广角的辅助设备重建,以形成深度图(基于致密的区域的距离图)。该地图有助于确定与所有可见对象的距离以通知用户关键的距离(模块2)。当一个对象靠近用户的对象时,该模块还使手机振动,例如,在手中到达距离。用户还可以通过询问各种问题来获取自动语音应答(模块3)来查询系统。此外,当其他事情出现问题时,也会添加手动救援模块(模块4)。一个例子的视觉可能是“整体,可能是走廊,一个媒体对象是0.5米的 - 中央左”,或“整体,城市途径,正面清除”。音频命令输入可以是“读取文本”,手机将检测并读取最近的对象上的所有文本。本文进一步描述了关于设计和实施的更多细节。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号