首页> 外文会议>Ubiquitous information technologies and applications >Recognizing Text in Low Resolution Born-Digital Images
【24h】

Recognizing Text in Low Resolution Born-Digital Images

机译:识别低分辨率Born数字图像中的文本

获取原文
获取原文并翻译 | 示例

摘要

Since born-digital images usually have low resolution, they are distinctly different from natural scene images. Extracting text information from born-digital images has been an increasing interest in document analysis and recognition field. We propose an automatic method to recognize word from low-resolution color image. First, the image is smoothed by using the bilateral filter, which preserves edge information. Then, it is binarized using global thresholding method and cleaned from noise. Finally, the open source Optical Character Recognition engine, with the incorporation of a post-processor trained on knowledge of English language, is applied to obtain meaningful words from the binary image. We experiment the proposed system on ICDAR 2011 and music sheet dataset, and the result shows better performance than several previous works.
机译:由于出生的数字图像通常具有较低的分辨率,因此它们与自然场景图像明显不同。从出生的数字图像中提取文本信息已成为文档分析和识别领域中越来越多的兴趣。我们提出了一种从低分辨率彩色图像中识别单词的自动方法。首先,通过使用双边滤波器对图像进行平滑处理,该双边滤波器保留了边缘信息。然后,使用全局阈值方法对其进行二值化并清除噪声。最终,开源光学字符识别引擎结合了对英语知识进行培训的后处理器,可以从二进制图像中获取有意义的单词。我们在ICDAR 2011和乐谱数据集上对提出的系统进行了实验,结果显示出比以前的工作更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号