...
首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >COLLABORATIVE COMBINATION OF NEURON-LINGUISTIC CLASSIFIERS FOR LARGE ARABIC WORD VOCABULARY RECOGNITION
【24h】

COLLABORATIVE COMBINATION OF NEURON-LINGUISTIC CLASSIFIERS FOR LARGE ARABIC WORD VOCABULARY RECOGNITION

机译:神经语言分类器在大型阿拉伯语词汇语音识别中的协作组合

获取原文
获取原文并翻译 | 示例
           

摘要

Most of the actual research in writing recognition focuses on specific applications where the vocabulary is relatively small. Many applications can be opened up when handling with large vocabulary. In this paper, we studied the classifier collaboration interest for the recognition of a large vocabulary of arabic words. The proposed approach is based on three classifiers, named Transparent Neuronal Networks (TNN), which exploit the morphological aspect of the Arabic word and collaborate for a better word recognition. We focused on decomposable words which are derived from healthy tri-consonantal roots and easy to proof the decomposition. To perform word recognition, the system extracts a set of global structural features. Then it learns and recognizes roots, schemes and conjugation elements that compose the word. To help the recognition, some local perceptual information is used in case of ambiguities. This interaction between global recognition and local checking makes easier the recognition of complex scripts as Arabic. Several experiments have been performed using a vocabulary of 5757 words, organized in a corpus of more than 17 200 samples. In order to validate our approach and to compare the proposed system with systems reported in ICDAR 2011 competition, extensive experiments were conducted using the Arabic Printed Text Image (APTI) database. The best recognition performances achieved by our system have shown very promising results.
机译:写作识别的大部分实际研究都集中在词汇量相对较小的特定应用程序上。处理大量词汇时,可以打开许多应用程序。在本文中,我们研究了分类器协作兴趣,以识别大量的阿拉伯语单词。所提出的方法基于名为透明神经元网络(TNN)的三个分类器,这些分类器利用了阿拉伯语单词的词法方面,并协作以实现更好的单词识别。我们专注于可分解词,这些词源自健康的三辅音词根,易于证明其分解。为了执行单词识别,系统提取了一组全局结构特征。然后,它学习并识别出构成单词的词根,方案和共轭元素。为了帮助识别,在出现歧义的情况下会使用一些本地感知信息。全局识别和本地检查之间的这种交互使将复杂脚本识别为阿拉伯语变得更加容易。使用5757个单词的词汇进行了几次实验,这些词汇组织了17200个样本以上的语料库。为了验证我们的方法并将拟议的系统与ICDAR 2011竞赛中报告的系统进行比较,使用阿拉伯语印刷文本图像(APTI)数据库进行了广泛的实验。我们的系统获得的最佳识别性能已显示出令人鼓舞的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号