【24h】

Word Level Script Identification for Scanned Document Images

机译:扫描文档图像的字级脚本识别

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we compare the performance of three classifiers used to identify the script of words in scanned document images. In both training and testing, a Gabor filter is applied and 16 channels of features are extracted. Three classifiers (Support Vector Machines (SVM), Gaussian Mixture Model (GMM) and k-Nearest-Neighbor (k-NN)) are used to identify different scripts at the word level (glyphs separated by white space). These three classifiers are applied to a variety of bilingual dictionaries and their performance is compared. Experimental results show the capability of Gabor filter to capture script features and the effectiveness of these three classifiers for script identification at the word level.
机译:在本文中,我们比较了用于识别扫描文档图像中单词脚本的三个分类器的性能。在训练和测试中,均应用Gabor滤波器并提取16个通道的特征。三种分类器(支持向量机(SVM),高斯混合模型(GMM)和k最近邻(k-NN))用于在单词级别(由空白分隔的字形)上识别不同的脚本。这三个分类器适用于各种双语词典,并对其性能进行了比较。实验结果表明,Gabor滤波器能够捕获脚本特征,并且这三个分类器在单词级别识别脚本的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号