首页> 外文期刊>Advances in decision sciences >A Framework for the Selection of Binarization Techniques on Palm Leaf Manuscripts Using Support Vector Machine
【24h】

A Framework for the Selection of Binarization Techniques on Palm Leaf Manuscripts Using Support Vector Machine

机译:支持向量机选择棕榈叶手稿二值化技术的框架

获取原文
       

摘要

Challenges for text processing in ancient document images are mainly due to the high degree of variations in foreground and background. Image binarization is an image segmentation technique used to separate the image into text and background components. Although several techniques for binarizing text documents have been proposed, the performance of these techniques varies and depends on the image characteristics. Therefore, selecting binarization techniques can be a key idea to achieve improved results. This paper proposes a framework for selecting binarizing techniques of palm leaf manuscripts using Support Vector Machines (SVMs). The overall process is divided into three steps: (i) feature extraction: feature patterns are extracted from grayscale images based on global intensity, local contrast, and intensity; (ii) treatment of imbalanced data: imbalanced dataset is balanced by using Synthetic Minority Oversampling Technique as to improve the performance of prediction; and (iii) selection: SVM is applied in order to select the appropriate binarization techniques. The proposed framework has been evaluated with palm leaf manuscript images and benchmarking dataset from DIBCO series and compared the performance of prediction between imbalanced and balanced datasets. Experimental results showed that the proposed framework can be used as an integral part of an automatic selection process.
机译:古代文档图像中文本处理的挑战主要归因于前景和背景的高度变化。图像二值化是一种图像分割技术,用于将图像分为文本和背景成分。尽管已经提出了几种用于对文本文档进行二值化的技术,但是这些技术的性能有所不同,并且取决于图像特性。因此,选择二值化技术可能是获得改进结果的关键思想。本文提出了一种使用支持​​向量机(SVM)选择棕榈叶手稿二值化技术的框架。整个过程分为三个步骤:(i)特征提取:基于全局强度,局部对比度和强度从灰度图像中提取特征图案; (ii)不平衡数据的处理:不平衡数据集通过使用综合少数族群过采样技术进行平衡,以提高预测性能; (iii)选择:应用SVM以选择适当的二值化技术。该框架已通过DIBCO系列的棕榈叶手稿图像和基准数据集进行了评估,并比较了不平衡和平衡数据集的预测性能。实验结果表明,提出的框架可以作为自动选择过程的组成部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号