...
首页> 外文期刊>Neurocomputing >Sparse Discriminative Information Preservation for Chinese character font categorization
【24h】

Sparse Discriminative Information Preservation for Chinese character font categorization

机译:汉字字体分类的稀疏判别信息保存

获取原文
获取原文并翻译 | 示例

摘要

With the rapid development of optical character recognition (OCR), font categorization becomes more and more important. This is because font information has very wide usage and researchers came to know this point recently. In this paper, we propose a new scheme for Chinese character font categorization (CCFC), which applies LBP descriptor based Chinese character interesting points for representing font information. Specifically, it classifies Chinese character font through the cooperation between a new Sparse Discriminative Information Preservation (SDIP) for feature selection and NN classifier. SDIP focus three aspects as follows: (1) it preserves the local geometric structure of the intra-class samples and maximizes the margin between the inter-class samples on the local patch simultaneously; (2) it models the reconstruction error to preserve the prior information of the data distribution; and (3) it introduces the LI-norm penalty to achieve the sparsity of the projection matrix. We conduct experiments on our new collect text block images which include 25 popular Chinese fonts. The average recognition demonstrates the robustness and effectiveness of SDIP for CCFC.
机译:随着光学字符识别(OCR)的飞速发展,字体分类变得越来越重要。这是因为字体信息的用途非常广泛,并且研究人员最近才知道这一点。在本文中,我们提出了一种新的汉字字体分类方案(CCFC),该方案采用基于LBP描述符的汉字兴趣点表示字体信息。具体来说,它通过用于特征选择的新稀疏鉴别信息保存(SDIP)和NN分类器之间的协作对汉字字体进行分类。 SDIP着眼于以下三个方面:(1)保留类别内样本的局部几何结构,同时最大化局部补丁上类别间样本之间的余量; (2)对重构误差进行建模,以保留数据分布的先验信息; (3)引入LI范数惩罚以实现投影矩阵的稀疏性。我们对新收集的文本块图像进行了实验,其中包括25种流行的中文字体。平均认可证明了SDIP对于CCFC的鲁棒性和有效性。

著录项

  • 来源
    《Neurocomputing》 |2014年第10期|159-167|共9页
  • 作者单位

    School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510640, China;

    School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510640, China;

    School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510640, China;

    School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510640, China;

    School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510640, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Chinese character font categorization; LBP; Harris corner; Dimension reduction; Sparse learning;

    机译:汉字字体分类;LBP;哈里斯角;尺寸缩小;稀疏学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号