首页> 美国卫生研究院文献>IEEE Journal of Translational Engineering in Health and Medicine >A Mobile Application for Keyword Search in Real-World Scenes
【2h】

A Mobile Application for Keyword Search in Real-World Scenes

机译:现实场景中关键字搜索的移动应用程序

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Keyword search in a cluttered environment is difficult in general, and even more challenging for people with low vision. While magnification can help in reading for low vision people, it does not facilitate efficient visual search due to the constriction of the field of view. The motivating observation for this study is that, in a large number of visual search tasks, people know what are they looking for (i.e., they know the keywords), they just do not know where to find them in the scene. We have developed a mobile application that allows the users to input keywords (by voice or by typing), uses an optical character recognition (OCR) engine to search for the provided keyword in the scene captured by the smartphone camera, and zooms in on the instances of the keyword detected in the captured images, to facilitate efficient information acquisition. In this paper we describe the development and evaluation of various aspects of the application, including comparing the various mainstream OCR engines that power the app, and an evaluation study comparing the app to the conventional optical magnifier vision aid. Normally sighted adults, while wearing blur glasses to lower their visual acuity, performed keyword searches for a series of items ranging from easy to difficult with the app and with a handheld magnifier. While there was no difference in the search times between the two methods for the easier tasks, the app was significantly faster than the magnifier for the difficult tasks.
机译:通常,在混乱的环境中进行关键字搜索非常困难,对于视力低下的人来说甚至更具挑战性。放大可以帮助低视力人群阅读,但由于视野狭窄,不能有效地进行视觉搜索。这项研究的动机在于,在大量的视觉搜索任务中,人们知道他们在寻找什么(即他们知道关键字),只是他们不知道在场景中哪里找到它们。我们开发了一种移动应用程序,该应用程序允许用户输入关键字(通过语音或键入),使用光学字符识别(OCR)引擎在智能手机相机捕获的场景中搜索提供的关键字,并放大在捕获的图像中检测到关键字的实例,以促进有效的信息获取。在本文中,我们描述了应用程序各个方面的开发和评估,包括比较为该应用程序提供动力的各种主流OCR引擎,以及将应用程序与常规光学放大镜视觉辅助工具进行比较的评估研究。正常视力的成年人戴着模糊眼镜以降低视力,却使用该应用程序和手持式放大镜对一系列项目进行了关键字搜索,从容易到困难。虽然对于较简单的任务,这两种方法之间的搜索时间没有差异,但对于较困难的任务,该应用程序的速度明显快于放大镜。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号