首页> 外文会议>International Conference on Fuzzy Systems and Knowledge Discovery >Word Image Decomposition from Mixed Text/Graphics Images Using Statistical Methods
【24h】

Word Image Decomposition from Mixed Text/Graphics Images Using Statistical Methods

机译:使用统计方法从混合文本/图形图像中的单词图像分解

获取原文

摘要

This paper describes the development and implementation of a algorithm to extract words from image regions mixed text/graphics in document images using statistical analyses, which is a component of DIPS (Document Images Processing System) using statistical methods. To extract word images from image regions, the character components need to be separated from graphic components. For this process, we propose a method to separate them with an analysis of box-plot using a statistics of structural components. An accuracy of this method is not sensitive to the changes of images because the criterion of separation is defined by the statistics of components. And then the character regions are determined by analyzing a local crowdedness of the separated character elements. Finally, we divide the character regions into text lines and word images using projection profile analysis and gap clustering, etc. The proposed system could reduce the influence resulted from the changes of images because it uses the criterion based on the statistics of image regions.
机译:本文介绍了一种算法的开发和实现,用于使用统计分析从文档图像中从图像区域混合文本/图形中提取单词,这是使用统计方法的DIPS(文档图像处理系统)的组件。为了从图像区域中提取字图像,需要将字符组件与图形组件分开。对于此过程,我们建议使用结构部件的统计分析箱图的分析方法。这种方法的准确性对图像的变化不敏感,因为分离的标准由组件的统计定义。然后通过分析分离的字符元素的本地拥挤度来确定字符区域。最后,我们将字符区域划分为使用投影简档分析和间隙聚类的文本线条和单词图像等。所提出的系统可以减少图像的变化导致的影响,因为它使用基于图像区域的统计数据的标准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号