Segmentation-Free Speech Text Recognition for Comic Books

机译：对漫画书的分割语音文本识别

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Speech text in comic books is written in a particular manner by the scriptwriter which raises unusual challenges for text recognition. We first detail these challenges and present different approaches to solve them. We compare the performances of pre-trained OCR and segmentation-free approach for speech text of comic books written in Latin script. We demonstrate that few good quality pre-trained OCR output samples, associated with other unlabeled data with the same writing style, can feed a segmentation-free OCR and improve text recognition. Thanks to the help of the lexicality measure that automatically accept or reject the pretrained OCR output as pseudo ground truth for a subsequent segmentation-free OCR training and recognition.

机译：漫画书中的语音文本由Scriptwriter以特定方式编写，这对文本识别提出了不寻常的挑战。我们首先详细调查这些挑战并呈现不同的方法来解决它们。我们比较了在拉丁文脚本中编写的漫画书籍语言文本的预训练OCR和分割方法的表演。我们展示了与具有相同写入风格的其他未标记的数据相关的少数好的质量训练有素的OCR输出样本，可以提供免费的OCR并提高文本识别。由于对词汇量措施的帮助，自动接受或拒绝预磨削的OCR输出作为伪基地的真实性，以获得后续分割的OCR培训和识别。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|84p|共6页
会议地点
作者
Christophe Rigaud; Jean-Christophe Burie; Jean-Marc Ogier;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Optical character recognition software; Text recognition; Speech; Speech recognition; Training; Image segmentation; Writing;

机译：光学字符识别软件;文本识别;语音;语音识别;训练;图像分割;写作;

相似文献

外文文献
中文文献
专利

1. CNN-based segmentation of speech balloons and narrative text boxes from comic book page images [J] . Dutta Arpita, Biswas Samit, Das Amit Kumar International Journal on Document Analysis and Recognition . 2021,第1a2期

机译：基于CNN的语音气球和叙事文本框的分割来自漫画书页面图像
2. A segmentation-free approach to text recognition with application to Arabic text [J] . Badr Al-Badr, Robert M. Haralick International Journal on Document Analysis and Recognition . 1998,第3期

机译：一种无分段的文本识别方法，适用于阿拉伯文本
3. From comics, graphic novels and picturebooks to fusion texts: a new kid on the block! [J] . Janet Evans Education 3-13: International Journal of Primary, Elementary and Early Years Education . 2012,第2期

机译：从漫画，图画小说和图画书到融合文本：一个新孩子正在崛起！
4. Segmentation-Free Speech Text Recognition for Comic Books [C] . Christophe Rigaud, Jean-Christophe Burie, Jean-Marc Ogier IAPR International Conference on Document Analysis and Recognition . 2017

机译：漫画的无段语音文本识别
5. A segmentation-free approach to text recognition with application to Arabic text. [D] . Al-Badr, Badr H. 1995

机译：一种无分段的文本识别方法，适用于阿拉伯文本。
6. Differences in sentence complexity in the text of children’s picture books and child-directed speech [O] . Jessica L. Montag -1

机译：儿童图画书和儿童指导语音中句子复杂度的差异
7. Toward speech text recognition for comic books [O] . Christophe Rigaud, Srikanta Pal, Jean-Christophe Burie, 2016

机译：对漫画书的言语文本认可

Segmentation-Free Speech Text Recognition for Comic Books

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅