Histogram-Based Fast Text Paragraph Image Detection

机译：基于直方图的快文段图像检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Rumormongers always use long paragraphs to spread slanderous stories so that they can convince readers. Those illegal or sensitive rumors uploaded into the internet can be written on images to by-pass text filters. These images can be detected by existing filters such as OCR, but the detection is very time consuming. To prohibit the dissemination of those commentaries, detecting whether an image contains a sufficient amount of words provides convenience to the government or internet service providers. Because of this, we focus on developing a fast pre-processor algorithm for detecting images embedded with sufficient text, such that the text filters (e.g. OCR) only need to focus on those suspected images. In this paper, we propose a histogram-based fast detection method to determine whether an image contains paragraphs of text or not. Binary histograms are extracted from the converted binary images. Then, due to the periodic pattern of the histograms, a step curve is designed to apply on the autocorrelation of those histograms. The area under the curve is further utilized to differentiate images with paragraphs and those without. To imitate the scenario, we construct a new dataset covering more than 2000 images of with and without paragraphs. The results show the effectiveness of the proposed detection system, which achieves 99.5% in accuracy and 15 millisecond per image in speed implemented in C++.

机译：RumOmrongers总是使用长段来传播诽谤的故事，以便他们可以说服读者。上传到Internet中的非法或敏感的谣言可以写在图像上以旁路文本过滤器。这些图像可以通过诸如OCR的现有滤波器来检测，但检测非常耗时。禁止传播这些评论，检测图像是否包含足够数量的单词，为政府或互联网服务提供商提供便利。因此，我们专注于开发一种快速预处理器算法，用于检测嵌入足够文本的图像，使得文本过滤器（例如，OCR）仅需要专注于那些可疑图像。在本文中，我们提出了一种基于直方图的快速检测方法，以确定图像是否包含文本段落。从转换后的二进制图像中提取二进制直方图。然后，由于直方图的周期性模式，旨在施加对这些直方图的自相关的步骤曲线。曲线下的区域还用于将图像与段落和那些区分开来。要模仿方案，我们构建了一个新的数据集，涵盖了2000多个与段落的2000张图像。结果表明了所提出的检测系统的有效性，其在C ++中实现了99.5％的精度和15毫秒的速度。

著录项

来源
《IEEE Symposium Series on Computational Intelligence》|2015年||共8页
会议地点
作者
Devadeep Shyam; Yan Wang; Alex C. Kot;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Improved localization accuracy by LocNet for Faster R-CNN based text detection in natural scene images [J] . Zhong Zhuoyao, Sun Lei, Huo Qiang Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：通过LOCNET提高本地化精度，以便在自然场景图像中更快的基于R-CNN的文本检测
2. Fast and robust text detection in images and video frames [J] . Qixiang Ye, Qingming Huang, Wen Gao, Image and Vision Computing . 2005,第6期

机译：在图像和视频帧中进行快速可靠的文本检测
3. Fast two-step histogram-based image segmentation [J] . Krstinic D.Skelin A.K.Slapnicar I. Image Processing, IET . 2011,第1期

机译：基于两步直方图的快速图像分割
4. Histogram-Based Fast Text Paragraph Image Detection [C] . Devadeep Shyam, Yan Wang, Alex C. Kot IEEE Symposium Series on Computational Intelligence . 2015

机译：基于直方图的快速文本段落图像检测
5. Detection of text strings from mixed text/graphics images. [D] . Tsai, Chien-Hua. 2000

机译：从混合的文本/图形图像中检测文本字符串。
6. Artificial Intelligence-Based Mitosis Detection in Breast Cancer Histopathology Images Using Faster R-CNN and Deep CNNs [O] . Tahir Mahmood, Muhammad Arsalan, Muhammad Owais, 2020

机译：使用更快的R-CNN和深CNN在乳腺癌组织病理学图像中基于人工智能的有丝分裂检测
7. Object detection in images using extended set of Haar-like features and histogram-based method [O] . Králík Martin 2012

机译：使用扩展的类似Haar的特征集和基于直方图的方法对图像进行目标检测
8. Automated System for Text Detection Individual Video Images [R] . Du, Y. , Chang, C. , Thouin, P. D. 2003

机译：用于文本检测的自动化系统单个视频图像

Histogram-Based Fast Text Paragraph Image Detection

摘要

著录项

相似文献

相关主题

期刊订阅