Online Handwritten Gurmukhi Strokes Dataset Based on Minimal Set of Words

SUKHDEEP SINGH; ANUJ SHARMA; INDU CHHABRA

首页> 外文期刊>ACM transactions on Asian language information processing >Online Handwritten Gurmukhi Strokes Dataset Based on Minimal Set of Words

【24h】

Online Handwritten Gurmukhi Strokes Dataset Based on Minimal Set of Words

机译：基于最小单词集的在线手写古尔穆奇笔画数据集

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The online handwriting data are an integral part of data analysis and classification research, as collected handwritten data offers many challenges to group handwritten stroke classes. The present work has been done for grouping handwritten strokes from the Indic script Gurmukhi. Gurmukhi is the script of the popular and widely spoken language Punjabi. The present work includes development of the dataset of Gurmukhi words in the context of online handwriting recognition for real-life use applications, such as maps navigation. We have collected the data of 100 writers from the largest cities in the Punjab region. The writers' variations, such as writing skill level (beginner, moderate, and expert), gender, right or left handedness, and their adaptability to digital handwriting, have been considered in dataset development. We have introduced a novel technique to form handwritten stroke classes based on a limited set of words. The presence of all alphabets including vowels of Gurmukhi script has been considered before selection of a word. The developed dataset includes 39,411 strokes from handwritten words and forms 72 classes of strokes after using a k-means clustering technique and manual verification through expert and moderate writers. We have achieved recognition results using the Hidden Markov Model as 87.10%, 85.43%, and 84.33% for middle zone strokes when using training data as 66%, 50%, and 80% of the developed dataset. The present work is a step in a direction to find groups for unknown handwriting strokes with reasonably higher levels of accuracy.

机译：在线手写数据是数据分析和分类研究不可或缺的一部分，因为收集的手写数据给分组手写笔划类带来了许多挑战。目前的工作已经完成，用于对印度文字Gurmukhi中的手写笔划进行分组。古尔穆希语是流行且广泛使用的旁遮普语的脚本。目前的工作包括在在线手写识别的背景下开发古鲁米奇语单词的数据集，以用于现实生活中的应用程序，例如地图导航。我们收集了旁遮普地区最大城市的100位作家的数据。在数据集开发中已经考虑了作者的变化，例如写作技巧水平（初学者，中级和专家），性别，右手或左手习惯以及他们对数字手写的适应性。我们介绍了一种新颖的技术，可以基于一组有限的单词来构成手写笔画类。在选择一个单词之前，已经考虑过所有字母的存在，包括古尔穆希语字母的元音。所开发的数据集包括来自手写单词的39,411个笔划，并使用k-means聚类技术并通过专家和中度作者的手动验证后形成72类笔划。当使用训练数据分别占已开发数据集的66％，50％和80％时，我们使用Hidden Markov模型获得了针对中间区域笔划的识别结果，分别为87.10％，85.43％和84.33％。当前的工作是朝着找到具有合理较高水平准确性的未知笔画笔划的方向迈出的一步。

著录项

来源
《ACM transactions on Asian language information processing》 |2017年第1期|1.1-1.20|共20页
作者
SUKHDEEP SINGH; ANUJ SHARMA; INDU CHHABRA;
展开▼
作者单位

Department of Computer Science and Applications, Panjab University Chandigarh;

Department of Computer Science and Applications, Panjab University Chandigarh;

Department of Computer Science and Applications, Panjab University Chandigarh;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Online handwriting recognition; data collection; digital handwriting; clustering; classification; k-means; HMM;

机译：在线手写识别;数据采集;数字手写;集群分类;k均值HMM;

相似文献

外文文献
中文文献
专利

1. Recognition of online handwritten Gurmukhi characters based on zone and stroke identification [J] . KARUN VERMA, R K SHARMA Sadhana . 2017,第5期

机译：基于区域和笔划识别的在线手写Gurmukhi字符识别
2. Online Handwritten Gurmukhi Words Recognition: An Inclusive Study [J] . Singh Sukhdeep, Sharma Anuj ACM transactions on Asian language information processing . 2019,第3期

机译：在线手写Gurmukhi单词识别：一项包容性研究
3. ONLINE PREPROCESSING OF HANDWRITTEN GURMUKHI STROKES [J] . Anuj Sharma, R. K. Sharma, Rajesh Kumar Machine Graphics & Vision . 2009,第1期

机译：手写古鲁木奇笔的在线预处理
4. Rearrangement of Recognized Strokes in Online Handwritten Gurmukhi Words Recognition [C] . Anuj Sharma, Rajesh Kumar, R. K. Sharma International Conference on Document Analysis and Recognition . 2009

机译：在线手写的Gurmukhi单词识别中的公认笔画重新排列
5. Physics-based methodologies for recognizing handwritten signatures, words, and line drawings. [D] . Pavlidis, Ioannis. 1996

机译：基于物理的方法，用于识别手写签名，单词和线条图。
6. Hough Transform-Based Angular Features for Learning-Free Handwritten Keyword Spotting [O] . Subhranil Kundu, Samir Malakar, Zong Woo Geem, 2021

机译：基于Hough的转换的角度特征用于无学习手写关键字斑点
7. Online Handwritten Gurmukhi Character Recognition using Hybrid Feature Set [O] . Mandeep Singh, Karun Verma, Bob Gill, 2018

机译：使用混合功能集的在线手写gurmukhi字符识别
8. Handwritten Word Recognition Based on Fourier Coefficients. [R] . Shartle, G. 1993

机译：基于傅立叶系数的手写单词识别。

Online Handwritten Gurmukhi Strokes Dataset Based on Minimal Set of Words

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅