首页> 外文期刊>Pattern recognition letters >An EMD-based recognition method for Chinese fonts and styles
【24h】

An EMD-based recognition method for Chinese fonts and styles

机译:基于EMD的中文字体识别方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a novel method to recognize Chinese fonts based on empirical mode decomposition (EMD). By analyzing and comparing a great number of Chinese characters, five basic strokes have been selected to characterize the stroke features of Chinese fonts. Based on them, stroke feature sequences of a given text block are calculated. By decomposing them with EMD, some intrinsic mode functions are produced and then the first two, which are of the highest frequencies, are used to produce the so-called stroke high frequency energies, which is the average energy of the two intrinsic mode functions over the length of the sequence. By calculating the stroke high frequency energies for all the five basic strokes and combining them with the averages of the five residues, which are called stroke low frequency energies, a 10-dimensional feature vector is formed. Finally, the minimum distance classifier is used to recognize the fonts and encouraging experimental results have been obtained. The main advantages of our algorithm are that (1) the feature dimension is very low; (2) less samples are needed to train the classifier; (3) finally and most importantly, it is the first attempt to apply the new theory of Hilbert-Huang transform to document analysis and recognition.
机译:本文提出了一种基于经验模态分解(EMD)的中文字体识别新方法。通过分析和比较大量的汉字,已选择了五个基本笔画来表征汉字的笔画特征。基于它们,计算给定文本块的笔划特征序列。通过用EMD分解它们,会生成一些本征模式函数,然后使用前两个频率最高的函数来产生所谓的笔划高频能量,这是两个本征模式函数在整个过程中的平均能量。序列的长度。通过计算所有五个基本笔划的笔划高频能量,并将它们与五个残差的平均值(称为笔划低频能量)组合,可以形成10维特征向量。最后,最小距离分类器用于识别字体并获得了令人鼓舞的实验结果。该算法的主要优点是:(1)特征维数很低; (2)训练分类器所需的样本更少; (3)最后也是最重要的是,这是将希尔伯特-黄变换的新理论应用于文档分析和识别的首次尝试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号