首页> 外文会议>12th international ACM SIGACCESS conference on computers and accessibility 2010 >Text Locating in Scene Images for Reading and Navigation Aids for Visually Impaired Persons
【24h】

Text Locating in Scene Images for Reading and Navigation Aids for Visually Impaired Persons

机译:场景图像中的文本定位,以供视觉障碍者阅读和导航

获取原文
获取原文并翻译 | 示例

摘要

Many reading assistants and navigation systems have been designed specifically for people who are blind or visually impaired, but text locating in scene image with complex background has not yet been successfully addressed. In this paper, we propose a novel method to locate scene text by combining color uniformity and high edge density together. We perform structural analysis of text strings which contain several characters in alignment. First, we calculate the edge image and then repaint the corresponding edge pixels in the original image by using a non-dominant color. Second, color reduction is performed by color histogram and K-means algorithms to segment the repainted image into color layers. Third, we perform edge detection and label the boundaries of both text characters and unexpected noises in each color layer. Each centroid is assigned a degree which is the number of overlap in the same position among color layers. Fourth, text line fitting among centroids with high degree is performed to cascade the character boundaries which belong to the same text string. The detected text string is presented by a rectangle region covering all character boundaries in its text line. Experimental results demonstrate that our algorithm is able to locate text strings with arbitrary orientations. The performance of our algorithm is comparable with the state-of-art algorithms.
机译:已经为盲人或视障人士专门设计了许多阅读助手和导航系统,但尚未成功解决文本在场景图像中具有复杂背景的问题。在本文中,我们提出了一种通过将颜色均匀性和高边缘密度结合在一起来定位场景文本的新方法。我们对包含几个对齐字符的文本字符串进行结构分析。首先,我们计算边缘图像,然后使用非主要颜色重新绘制原始图像中的相应边缘像素。其次,通过颜色直方图和K-means算法执行色彩还原,以将重新绘制的图像分割为色彩层。第三,我们执行边缘检测并标记文本字符的边界以及每个颜色层中的意外噪声。为每个质心分配一个度数,该度数是颜色层之间相同位置的重叠数。第四,执行高度重心之间的文本行拟合,以级联属于同一文本字符串的字符边界。检测到的文本字符串由覆盖其文本行中所有字符边界的矩形区域表示。实验结果表明,我们的算法能够定位任意方向的文本字符串。我们算法的性能可与最新算法相媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号