首页> 外文会议>2017 IEEE 4th International Conference on Soft Computing amp; Machine Intelligence >A simple text detection in document images using classification-based techniques
【24h】

A simple text detection in document images using classification-based techniques

机译:使用基于分类的技术对文档图像进行简单的文本检测

获取原文
获取原文并翻译 | 示例

摘要

Text regions can be useful to computer vision applications. It can be used to label and train automatic layout learning systems or to detect and locate the title, keywords, subheadings, paragraphs and image regions in images. This work proposes a technique to separate text regions from image documents. Images are divided into small non-overlapping windows. Textural features are extracted from these image windows before a classification is performed. Two refinement processes are carried out to reject misclassified windows, i.e window merging and Markov Random Files (MRFs). Window merging determine the similarity of a window and its neighbouring windows (based-on a distance-based technique). MRF examines the relationships between each window and it's neighbouring one using an energy minimization technique. The experimental results demonstrate that the refinement method is superior to the original classification without a refinement.
机译:文本区域对于计算机视觉应用程序很有用。它可以用于标记和训练自动布局学习系统,或者用于检测和定位图像中的标题,关键字,副标题,段落和图像区域。这项工作提出了一种从图像文档中分离文本区域的技术。图像分为小的不重叠窗口。在执行分类之前,从这些图像窗口中提取纹理特征。进行两个改进过程以拒绝分类错误的窗口,即窗口合并和马尔可夫随机文件(MRF)。窗口合并确定一个窗口及其相邻窗口的相似性(基于基于距离的技术)。 MRF使用能量最小化技术检查每个窗口与其相邻窗口之间的关系。实验结果表明,改进后的方法优于未经改进的原始分类方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号