首页> 外文会议>2011 IEEE Workshop on Applications of Computer Vision >TranslatAR: A mobile augmented reality translator
【24h】

TranslatAR: A mobile augmented reality translator

机译:TranslatAR:移动增强现实翻译器

获取原文

摘要

We present a mobile augmented reality (AR) translation system, using a smartphone's camera and touchscreen, that requires the user to simply tap on the word of interest once in order to produce a translation, presented as an AR overlay. The translation seamlessly replaces the original text in the live camera stream, matching background and foreground colors estimated from the source images. For this purpose, we developed an efficient algorithm for accurately detecting the location and orientation of the text in a live camera stream that is robust to perspective distortion, and we combine it with OCR and a text-to-text translation engine. Our experimental results, using the ICDAR 2003 dataset and our own set of video sequences, quantify the accuracy of our detection and analyze the sources of failure among the system's components. With the OCR and translation running in a background thread, the system runs at 26 fps on a current generation smartphone (Nokia N900) and offers a particularly easy-to-use and simple method for translation, especially in situations in which typing or correct pronunciation (for systems with speech input) is cumbersome or impossible.
机译:我们提出了一种使用智能手机的摄像头和触摸屏的移动增强现实(AR)翻译系统,该系统要求用户只需轻按一次感兴趣的单词即可生成翻译,以AR叠加层的形式呈现。翻译可以无缝替换实时摄像机流中的原始文本,匹配从源图像估计的背景和前景色。为此,我们开发了一种有效的算法来准确检测实时摄像机流中文本的位置和方向,该算法对透视失真具有鲁棒性,并将其与OCR和文本到文本翻译引擎结合使用。我们的实验结果使用ICDAR 2003数据集和我们自己的视频序列集,对检测的准确性进行了量化,并分析了系统组件之间的故障源。由于OCR和翻译在后台线程中运行,因此该系统在当前的智能手机(Nokia N900)上以26 fps的速度运行,并提供了一种特别易于使用和简单的翻译方法,尤其是在键入或发音正确的情况下(对于具有语音输入的系统)比较麻烦或不可能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号