首页> 外文会议>The 16th North-East Asia Symposium on Nano, Information Technology and Reliability >Intelligent text detection and extraction from natural scene images
【24h】

Intelligent text detection and extraction from natural scene images

机译:智能文本检测和自然场景图像提取

获取原文

摘要

There exist many texts and symbols in a natural scene, such as billboards and traffic signs, serving the purpose of relaying information or offering guidance. With rapid advances in information technology, detection and extraction of texts in images and related research into this area have become increasingly important. Here, we present an intelligent connected-component based text detection and extraction method involving three steps. First, candidate regions are searched via imaging processing and Canny edge detection. Second, a fast connected component (CC) algorithm enables noise filtering to obtain the candidate texts and their features. Lastly, AdaBoost classifier training is in place to categorize texts or non-text characters for the construction of strong classifiers. This three-step process can effectively filter out non-text CCs for the efficient extraction of text components. The present research integrates CC and AdaBoost algorithms in attaining a 94.65% precision rate for text extraction, which can help facilitate the application and development of text recognition techniques.
机译:在自然场景中存在许多文本和符号,例如广告牌和交通标志,用于传递信息或提供指导。随着信息技术的飞速发展,图像中文本的检测和提取以及对该领域的相关研究变得越来越重要。在这里,我们提出了一种基于智能连接组件的文本检测和提取方法,涉及三个步骤。首先,通过成像处理和Canny边缘检测来搜索候选区域。其次,快速连接组件(CC)算法可以进行噪声过滤以获得候选文本及其特征。最后,AdaBoost分类器培训到位以对文本或非文本字符进行分类,以构建强大的分类器。此三步过程可以有效过滤掉非文本CC,以有效提取文本成分。本研究将CC和AdaBoost算法集成在一起,以达到94.65%的文本提取精度,这有助于促进文本识别技术的应用和发展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号