首页> 外文会议>International Conference on Asia-Pacific Digital Libraries >Using Deep Learning to Recognize Handwritten Thai Noi Characters in Ancient Palm Leaf Manuscripts
【24h】

Using Deep Learning to Recognize Handwritten Thai Noi Characters in Ancient Palm Leaf Manuscripts

机译:使用深度学习识别古代棕榈叶手稿中的手写泰国NOI字符

获取原文

摘要

Extracting knowledge from ancient palm leaf manuscripts is essential for historians and other scholars who would like to access accumulated knowledge in the Thai Noi language manuscripts. In the absence of Thai Noi language readers, computer technologies play an important role in fulfilling this need. This research aims to apply deep learning approaches to recognize Thai Noi characters written in palm leaf manuscripts. The experiments were carried out by firstly collecting the page images of the manuscripts archived in the Museum of Art and Culture of Loei. Then the page images were preprocessed by converting to grayscale. To recognize Thai Noi characters, four convolutional neural network models based on inception and mobilenet networks namely Inception-v3, Inception-v4, MobileNetV1, and MobileNetV2 were evaluated. Handwritten Thai Noi characters were segmented from the grayscale images based on 26 Thai Noi characters. In this process, 100 images of each character were segmented and the whole dataset contained 2,600 images. Two image augmentation methods were applied to increase the amount of training data. Three experiments were carried out with three different datasets based on a 10-fold cross-validation design. The results indicate that MobileNetV1 outperformed other models in all experiments with an accuracy rate higher than 90%, while MobileNetV2 showed an interesting performance, which was almost equivalent to MobileNetV1 in the last experiment.
机译:从古代棕榈叶手稿中提取知识对于历史学家和其他学者希望在泰国NOI语言手稿中获得累积知识的其他学者至关重要。在没有泰国Noi语言读者的情况下,计算机技术在满足这种需求方面发挥着重要作用。本研究旨在应用深入学习方法,以识别用棕榈叶稿件写的泰国NOI字符。通过首先收集载入艺术博物馆和Loei文化博物馆的手稿的页面图像来进行实验。然后通过转换为灰度来预处理页面图像。为了识别泰国NOI字符,评估基于Inception和MobileNet网络的四种卷积神经网络模型即Inception-V3,Inception-V4,MobileNetv1和MobileNetv2。根据26泰国NOI字符从灰度图像分段手写泰铢字符。在此过程中,分段为每个字符的100个图像,整个数据集包含2,600个图像。应用了两个图像增强方法以增加培训数据量。基于10倍交叉验证设计,使用三种不同的数据集进行三个实验。结果表明,MobileNetv1在所有实验中表明了其他模型,精度高于90%,而MobileNetv2表现出有趣的性能,这几乎相当于上次实验中的MobileNetv1。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号