首页> 外文会议>IEEE Symposium Series on Computational Intelligence >Histogram-Based Fast Text Paragraph Image Detection
【24h】

Histogram-Based Fast Text Paragraph Image Detection

机译:基于直方图的快文段图像检测

获取原文

摘要

Rumormongers always use long paragraphs to spread slanderous stories so that they can convince readers. Those illegal or sensitive rumors uploaded into the internet can be written on images to by-pass text filters. These images can be detected by existing filters such as OCR, but the detection is very time consuming. To prohibit the dissemination of those commentaries, detecting whether an image contains a sufficient amount of words provides convenience to the government or internet service providers. Because of this, we focus on developing a fast pre-processor algorithm for detecting images embedded with sufficient text, such that the text filters (e.g. OCR) only need to focus on those suspected images. In this paper, we propose a histogram-based fast detection method to determine whether an image contains paragraphs of text or not. Binary histograms are extracted from the converted binary images. Then, due to the periodic pattern of the histograms, a step curve is designed to apply on the autocorrelation of those histograms. The area under the curve is further utilized to differentiate images with paragraphs and those without. To imitate the scenario, we construct a new dataset covering more than 2000 images of with and without paragraphs. The results show the effectiveness of the proposed detection system, which achieves 99.5% in accuracy and 15 millisecond per image in speed implemented in C++.
机译:RumOmrongers总是使用长段来传播诽谤的故事,以便他们可以说服读者。上传到Internet中的非法或敏感的谣言可以写在图像上以旁路文本过滤器。这些图像可以通过诸如OCR的现有滤波器来检测,但检测非常耗时。禁止传播这些评论,检测图像是否包含足够数量的单词,为政府或互联网服务提供商提供便利。因此,我们专注于开发一种快速预处理器算法,用于检测嵌入足够文本的图像,使得文本过滤器(例如,OCR)仅需要专注于那些可疑图像。在本文中,我们提出了一种基于直方图的快速检测方法,以确定图像是否包含文本段落。从转换后的二进制图像中提取二进制直方图。然后,由于直方图的周期性模式,旨在施加对这些直方图的自相关的步骤曲线。曲线下的区域还用于将图像与段落和那些区分开来。要模仿方案,我们构建了一个新的数据集,涵盖了2000多个与段落的2000张图像。结果表明了所提出的检测系统的有效性,其在C ++中实现了99.5%的精度和15毫秒的速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号