Arabic Character Recognition Using Structural Shape Decomposition

机译：使用结构形状分解的阿拉伯字符识别

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a statistical framework for recognising 2D shapes which are represented as an arrangement of curves or strokes. The approach is a hierarchical one which mixes geometric and symbolic information in a three-layer architecture. Each curve primitive is represented using a point-distribution model which describes how its shape varies over a set of training data. We assign stroke labels to the primitives and these indicate to which class they belong. Shapes are decomposed into an arrangement of primitives and the global shape representation has two components. The first of these is a second point distribution model that is used to represent the geometric arrangement of the curve centre-points. The second component is a string of stroke labels that represents the symbolic arrangement of strokes. Hence each shape can be represented by a set of centre-point deformation parameters and a dictionary of permissible stroke label configurations. The hierarchy is a two-level architecture in which the curve models reside at the nonterminal lower level of the tree. The top level represents the curve arrangements allowed by the dictionary of permissible stroke combinations. The aim in recognition is to minimise the cross entropy between the probability distributions for geometric alignment errors and curve label errors. We show how the stroke parameters, shape-alignment parameters and stroke labels may be recovered by applying the expectation maximization EM algorithm to the utility measure. We apply the resulting shape-recognition method to Arabic character recognition.

机译：本文提出了一个统计框架，用于识别二维形状，这些二维形状表示为曲线或笔触的排列。该方法是一种分层的方法，在三层体系结构中混合了几何和符号信息。每个曲线图元都使用点分布模型表示，该模型描述了形状在一组训练数据上的变化方式。我们将笔划标签分配给图元，这些标签指示它们属于哪个类。形状被分解为图元的排列，并且全局形状表示具有两个组成部分。第一个是第二个点分布模型，用于表示曲线中心点的几何排列。第二部分是一串笔画标签，代表笔画的符号排列。因此，每种形状都可以由一组中心点变形参数和允许的行程标签配置字典来表示。层次结构是两级体系结构，其中曲线模型位于树的非终端较低层。顶层表示允许的笔划组合字典所允许的曲线排列。识别的目的是最小化几何对齐误差和曲线标签误差的概率分布之间的交叉熵。我们展示了如何通过将期望最大化EM算法应用于效用度量来恢复笔画参数，形状对齐参数和笔画标签。我们将所得的形状识别方法应用于阿拉伯字符识别。

著录项

来源
《Computer Analysis of Images and Patterns》|2003年|p.478-486|共9页
会议地点
作者
Abdullah Al Shaher; Edwin R. Hancock;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词
point distribution models; expectation maximization algorithm; discrete relaxation; hierarchical mixture of experts; arabic scripts;

机译：点分布模型;期望最大化算法;离散松弛;专家的分层混合;阿拉伯文字;

相似文献

外文文献
中文文献
专利

1. Arabic Handwritten Character Recognition Using Structural Shape Decomposition [J] . Abdullah A. Al-Shaher, Edwin R. Hancock Signal & Image Processing : An International Journal (SIPIJ) . 2017,第2期

机译：利用结构形状分解的阿拉伯手写字符识别
2. Handwritten Character Recognition Using Structural Shape Decomposition [J] . Abdullah A. Al-Shaher, Edwin R. Hancock Computer Science & Information Technology . 2017,第4期

机译：利用结构形状分解的手写字符识别
3. Character context: a shape descriptor for Arabic handwriting recognition [J] . Mudhsh Mohammed, Almodfer Rolla, Duan Pengfei, Journal of electronic imaging . 2017,第6期

机译：字符上下文：用于阿拉伯手写识别的形状描述符
4. Arabic Character Recognition Using Structural Shape Decomposition [C] . Abdullah Al Shaher, Edwin R. Hancock, Lecture Notes in Computer Science 2756 International Conference on Computer Analysis of Images and Patterns . 2003

机译：使用结构形状分解的阿拉伯语字符识别
5. Optical Character Recognition of Printed Persian/Arabic Documents. [D] . Shafii, Mahnaz. 2014

机译：印刷的波斯/阿拉伯文档的光学字符识别。
6. Synthesis of Common Arabic Handwritings to Aid Optical Character Recognition Research [O] . Laslo Dinges, Ayoub Al-Hamadi, Moftah Elzobi, 2016

机译：常用阿拉伯文字的合成对光学字符识别的研究
7. Arabic Handwritten Character Recognition Using Structural Shape Decomposition [O] . Abdullah A. Al-Shaher, Edwin R. Hancock 2017

机译：使用结构形状分解的阿拉伯手写字符识别

Arabic Character Recognition Using Structural Shape Decomposition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅