A simple text detection in document images using classification-based techniques

机译：使用基于分类的技术对文档图像进行简单的文本检测

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text regions can be useful to computer vision applications. It can be used to label and train automatic layout learning systems or to detect and locate the title, keywords, subheadings, paragraphs and image regions in images. This work proposes a technique to separate text regions from image documents. Images are divided into small non-overlapping windows. Textural features are extracted from these image windows before a classification is performed. Two refinement processes are carried out to reject misclassified windows, i.e window merging and Markov Random Files (MRFs). Window merging determine the similarity of a window and its neighbouring windows (based-on a distance-based technique). MRF examines the relationships between each window and it's neighbouring one using an energy minimization technique. The experimental results demonstrate that the refinement method is superior to the original classification without a refinement.

机译：文本区域对于计算机视觉应用程序很有用。它可以用于标记和训练自动布局学习系统，或者用于检测和定位图像中的标题，关键字，副标题，段落和图像区域。这项工作提出了一种从图像文档中分离文本区域的技术。图像分为小的不重叠窗口。在执行分类之前，从这些图像窗口中提取纹理特征。进行两个改进过程以拒绝分类错误的窗口，即窗口合并和马尔可夫随机文件（MRF）。窗口合并确定一个窗口及其相邻窗口的相似性（基于基于距离的技术）。 MRF使用能量最小化技术检查每个窗口与其相邻窗口之间的关系。实验结果表明，改进后的方法优于未经改进的原始分类方法。

著录项

来源
《2017 IEEE 4th International Conference on Soft Computing amp; Machine Intelligence》|2017年|119-122|共4页
会议地点 Port Louis(MU)
作者
Khanabhorn Kawattikul; Phatthanaphong Chomphuwiset;
展开▼
作者单位

Department of Information System, Faculty of Social Technology, Rajamangala University of Technology Tawan-ok, (Chanthaburi Campus), Thailand;

Faculty of Informatics, Mahasarakham University, Kantarawichai District, Maha Sarakham, Thailand 44150;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Microsoft Windows; Feature extraction; Image edge detection; Merging; Decision trees; Image segmentation; Layout;

机译：Microsoft Windows；特征提取；图像边缘检测；合并；决策树；图像分割；布局；;

相似文献

外文文献
中文文献
专利

1. Text/Image Region Separation for Document Layout Detection of Old Document Images Using Non-linear Diffusion and Level Set [J] . S. Sachin Kumar, Parvathy Rajendran, P. Prabaharan, Procedia Computer Science . 2016,第1期

机译：文本/图像区域分离，用于使用非线性扩散和水平集的旧文档图像的文档布局检测
2. Skew detection for complex document images using robust borderlines in both text and non-text regions [J] . Hong Liu, Qi Wu, Hongbin Zha, Pattern recognition letters . 2008,第13期

机译：使用文本和非文本区域中的可靠边界线对复杂文档图像进行歪斜检测
3. Text/Background separation in the degraded document images by combining several thresholding techniques [J] . ABDERRAHMANE KEFALI, TOUFIK SARI, HALIMA BAHI WSEAS Transactions on Signal Processing . 2014,第Pta1期

机译：通过结合多种阈值化技术，在降级的文档图像中实现文本/背景分离
4. A simple text detection in document images using classification-based techniques [C] . Khanabhorn Kawattikul, Phatthanaphong Chomphuwiset International Conference on Soft Computing and Machine Intelligence . 2017

机译：使用基于分类的技术的文档图像中的简单文本检测
5. Document image analysis techniques for handwritten text segmentation, document image rectification and digital collation. [D] . Salvi, Dhaval. 2014

机译：用于手写文本分割，文档图像校正和数字整理的文档图像分析技术。
6. Developing a Research Instrument to Document Awareness Knowledge and Attitudes Regarding Breast Cancer and Early Detection Techniques for Pakistani Women: The Breast Cancer Inventory (BCI) [O] . Atta Abbas Naqvi, Fatima Zehra, Rizwan Ahmad, 2016

机译：开发一种研究工具来记录有关巴基斯坦妇女的乳腺癌和早期检测技术的意识知识和态度：乳腺癌清单（BCI）
7. 34th Bethesda Conference:“can atherosclerosis imaging techniques improve the detection of patients at risk for ischemic heart disease?”**The recommendations set forth in this report are those of the Conference participants and do not necessarily reflect the official position of the American College of Cardiology. When citing this document the American College of Cardiology would appreciate the following citation format: Can Atherosclerosis Imaging Techniques Improve the Detection of Patients at Risk for Ischemic Heart Disease. Presented at the 34thBethesda Conference, Bethesda, Maryland, October 7, 2002. J Am Coll Cardiol 2003;41:1855–917. This document is available on the American College of Cardiology Web site at htttp://www.acc.org. Single copies of this document are available for $5.00 each by calling 800-253-3636 (U.S. only) or by writing the Resource Center, American College of Cardiology, 9111 Old Georgetown Road, Bethesda, Maryland 20815. [O] . Taylor Allen J, Merz C.Noel Bairey, Udelson James E 2003

机译：第34届Bethesda会议：“动脉粥样硬化成像技术能否改善对有缺血性心脏病风险的患者的检测？” **本报告中提出的建议是会议参与者的建议，不一定反映美国心脏病学会的正式立场。心脏病学。当引用该文件时，美国心脏病学会将赞赏以下引用格式：动脉粥样硬化成像技术能否改善对有缺血性心脏病风险的患者的检测。于2002年10月7日在马里兰州贝塞斯达举行的第34届贝塞斯达会议上发表。J Am Coll Cardiol 2003; 41：1855-917。该文档可在美国心脏病学会网站htttp：//www.acc.org上找到。致电800-253-3636（仅限美国）或写信给美国心脏病学会资源中心，地址为9111 Old Georgetown Road，Bethesda，Maryland 20815，可单独获得本文档的一份副本，每本售价5.00美元。

A simple text detection in document images using classification-based techniques

摘要

著录项

相似文献

相关主题

期刊订阅