首页> 外文期刊>IEICE Transactions on Information and Systems >Dimensionality Reduction by Locally Linear Discriminant Analysis for Handwritten Chinese Character Recognition
【24h】

Dimensionality Reduction by Locally Linear Discriminant Analysis for Handwritten Chinese Character Recognition

机译:基于局部线性判别分析的手写汉字识别降维

获取原文
获取原文并翻译 | 示例
       

摘要

Linear Discriminant Analysis (LDA) is one of the most popular dimensionality reduction techniques in existing handwritten Chinese character (HCC) recognition systems. However, when used for unconstrained handwritten Chinese character recognition, the traditional LDA algorithm is prone to two problems, namely, the class separation problem and multimodal sample distributions. To deal with these problems,we propose a new locally linear discriminant analysis (LLDA) method for handwritten Chinese character recognition.Our algorithm operates as follows. (1) Using the clustering algorithm, find clusters for the samples of each class. (2) Find the nearest neighboring clusters from the remaining classes for each cluster of one class. Then, use the corresponding cluster means to compute the between-class scatter matrix in LDA while keeping the within-class scatter matrix unchanged. (3) Finally, apply feature vector normalization to further improve the class separation problem. A series of experiments on both the HCL2000 and CASIA Chinese character handwriting databases show that our method can effectively improve recognition performance, with a reduction in error rate of 28.7% (HCL2000) and 16.7% (CASIA) compared with the traditional LDA method.Our algorithm also outperforms DLA (Discriminative Locality Alignment,one of the representative manifold learning-based dimensionality reduction algorithms proposed recently). Large-set handwritten Chinese character recognition experiments also verified the effectiveness of our proposed approach.
机译:线性判别分析(LDA)是现有手写汉字(HCC)识别系统中最流行的降维技术之一。然而,传统的LDA算法在用于无限制手写汉字识别时,容易出现两个问题,即类分离问题和多峰样本分布问题。为了解决这些问题,我们提出了一种新的用于手写汉字识别的局部线性判别分析方法。 (1)使用聚类算法,为每个类别的样本找到聚类。 (2)从一个类的每个类的其余类中找到最近的相邻类。然后,使用相应的聚类方法来计算LDA中的类间散布矩阵,同时保持类内散布矩阵不变。 (3)最后,应用特征向量归一化进一步改善类分离问题。在HCL2000和CASIA汉字手写数据库上进行的一系列实验表明,与传统的LDA方法相比,我们的方法可以有效地提高识别性能,错误率降低28.7%(HCL2000)和16.7%(CASIA)。该算法的性能也优于DLA(区分性局部对齐),这是最近提出的基于流形学习的代表性降维算法之一。大量的手写汉字识别实验也验证了我们提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号