首页> 外文会议>International conference on microelectronics, electromagnetics and telecommunications >Soft Clustering for Segmenting Touching Characters in Printed Scripts
【24h】

Soft Clustering for Segmenting Touching Characters in Printed Scripts

机译:用于在打印脚本中分割触摸字符的软聚类

获取原文

摘要

Segmentation of characters from the printed script is an important preprocessing step in automatic Optical Character Recognition (OCR). The performances of the various machine learning algorithms depend on the results of segmentation of the characters. The situation is more challenging when the scripts contain touching characters. Touching characters are predominant in different Indian scripts like Assamese, Bangla, Devanagari, Oriya, Gurmukhi, and many others. In such cases, the accuracy of an OCR system depends on the quality of segmentation of touching characters. In this paper, we explore the effectiveness of fuzzy, rough, and rough fuzzy k-means clustering to segment touching characters. We use different compound characters dataset from Devanagari, Assamese, and Bangla printed scripts for experimentation. Our results reveal that soft k-means are an effective alternative method for segmenting touching characters.
机译:来自打印脚本的字符的分割是自动光学字符识别(OCR)中的一个重要预处理步骤。各种机器学习算法的性能取决于字符分割结果。当脚本包含触摸字符时,情况更具挑战性。触摸角色在不同的印度脚本中是敏捷的,如assamese,bangla,devanagari,oriya,gurmukhi等许多人。在这种情况下,OCR系统的准确性取决于触摸字符的分割质量。在本文中,我们探讨了模糊,粗糙和粗糙和粗略模糊K-MERIAL聚类到段触摸字符的有效性。我们使用来自Devanagari,Assamese和Bangla印刷脚本的不同复合字符数据集进行实验。我们的结果表明,软k-means是用于分割触摸字符的有效替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号