首页> 美国卫生研究院文献>Data in Brief >BanglaWriting: A multi-purpose offline Bangla handwriting dataset
【2h】

BanglaWriting: A multi-purpose offline Bangla handwriting dataset

机译:Banglawriting:多用途离线Bangla手写数据集

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This article presents a Bangla handwriting dataset named BanglaWriting that contains single-page handwritings of 260 individuals of different personalities and ages. Each page includes bounding-boxes that bounds each word, along with the unicode representation of the writing. This dataset contains 21,234 words and 32,787 characters in total. Moreover, this dataset includes 5,470 unique words of Bangla vocabulary. Apart from the usual words, the dataset comprises 261 comprehensible overwriting and 450 handwritten strikes and mistakes. All of the bounding-boxes and word labels are manually-generated. The dataset can be used for complex optical character/word recognition, writer identification, handwritten word segmentation, and word generation. Furthermore, this dataset is suitable for extracting age-based and gender-based variation of handwriting.
机译:本文介绍了名为Banglawriting的Bangla手写数据集,其中包含260个人的单页手写,不同的个性和年龄。每个页面都包括绑定每个单词的边界框,以及写入的Unicode表示。此数据集包含21,234个单词和32,787个字符。此外,此数据集包含5,470个孟加拉词汇单词。除了通常的单词之外,数据集包括261个可理解的覆盖和450个手写的罢工和错误。手动生成所有边界盒和单词标签。数据集可用于复杂的光学字符/ Word识别,Writer识别,手写词分割和Word生成。此外,该数据集适用于提取基于年龄和基于性别的手写的变型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号