首页> 外国专利> How to remove underlines and table lines in a document image while protecting character strokes

How to remove underlines and table lines in a document image while protecting character strokes

机译:如何在保护字符笔划的同时删除文档图像中的下划线和表格行

摘要

A method for removing horizontal and vertical lines in a document image while preserving integrity of the character strokes that intersect the lines. For each detected horizontal line, a vertical run length profile is calculated. Areas of the run length profile having two adjacent peaks with a valley in between are detected, which correspond to intersections of the horizontal line with non-vertical lines. A first derivative curve may be used to detect such peaks and valleys. Areas of the run length profile with large run length value for consecutive pixel locations are also detected, which corresponds to intersections of the horizontal line with near vertical lines. The horizontal line is removed in areas outside of the intersection areas, while preserving pixels within the intersection areas. Vertical line removal may be done similarly. This template-free method can remove lines in tables, forms, and underline and extract handwriting or printed characters.
机译:一种在保留与线条相交的字符笔划的完整性的同时删除文档图像中水平和垂直线条的方法。对于每个检测到的水平线,都会计算垂直游程长度分布。游程轮廓的区域具有两个相邻的峰,中间有一个谷,该区域对应于水平线与非垂直线的交点。一阶导数曲线可以用于检测这样的峰和谷。还检测到连续像素位置具有较大游程长度值的游程轮廓的区域,该区域对应于水平线与接近垂直线的交点。在相交区域之外的区域中删除水平线,同时保留相交区域内的像素。垂直线去除可以类似地完成。这种无需模板的方法可以删除表格,表格中的线条和下划线,并提取手写或印刷字符。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号