首页> 外文会议>International Conference on Multimedia Analysis and Pattern Recognition >Preliminary Results on Ancient Cham Glyph Recognition from Cham Inscription images
【24h】

Preliminary Results on Ancient Cham Glyph Recognition from Cham Inscription images

机译:从可汗题字图像识别远古可汗字形的初步结果

获取原文

摘要

This paper presents an original work on ancient Cham glyph which is a language of Cham people in the Southeast Asia from the 6th to 15th century. Unfortunately it is in danger of being destroyed by times as well as of being ignored when the specialists of ancient Cham disappear. Motivated by this fact, we contribute to build a corpus of ancient Cham glyph that contains of 1607 images of Cham inscriptions. These images have been carefully pre-processed then annotated into 37 classes by a specialist/historian in ancient Cham. This digitized version of Cham glyph could be stored anywhere as long as wanted and available to wide audience for studying and exploration. As the corpus is the first introduced world wide, no work on automatic Cham recognition has been considered. We then investigate computer vision and machine learning techniques to show how effective a machine learning technique could be for this case study on a still limited amount of data. The best recognition F1-score is 86.3% with features extracted from GoogleNet and K-NN classifier, showing very promising performance.
机译:本文介绍了古代可汗字形的原始作品,该字形是6世纪至15世纪东南亚的可汗人的语言。不幸的是,它有被时代摧毁的危险,而当古老的湛专家消失时,它有可能被忽视。受这一事实的激励,我们致力于建立一个古代可汗字形的语料库,其中包含1607个可汗铭文的图像。这些图像已经过仔细的预处理,然后由古代Cham的专家/历史学家注释为37类。 Cham字形的此数字化版本可以存储在任何需要的地方,并可供广大读者进行研究和探索。由于语料库是世界上第一个引入的语料库,因此尚未考虑有关自动Cham识别的工作。然后,我们研究计算机视觉和机器学习技术,以显示在有限的数据量下,这种情况下机器学习技术的有效性。从GoogleNet和K-NN分类器提取的功能中,最佳F1得分为86.3%,显示出非常有前途的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号