REPRESENTING WORD IMAGE USING VISUAL WORD EMBEDDINGS AND RNN FOR KEYWORD SPOTTING ON HISTORICAL DOCUMENT IMAGES

机译：代表Word Image使用Visual Word Embeddings和RNN用于历史文档图像上的关键字发现

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Visual words of Bag-of-Visual-Words (BoVW) framework are independent each other, which results in not only discarding spatial orders between visual words but also lacking semantic information. This study is inspired by word embeddings that a similar embedding procedure is applied to a large number of visual words. By this way, the corresponding embedding vectors of the visual words can be formulated. For a word image, the average of embedding vectors of all visual words within the word image is taken as its embedding vector. Moreover, Recurrent Neural Network (RNN) is utilized to encode each word image into embeddings like an auto-encoder. The RNN embeddings and the visual word embeddings are complementary. In this study, all word images are represented by combining visual word embeddings and RNN embeddings. Experimental results show that the proposed representation approach is superior to the traditional BoVW, spatial pyramid matching and latent Dirichlet allocation.

机译：Visual-Lords（BOVW）框架的视觉词语是彼此独立的，这不仅导致丢弃视觉单词之间的空间令，而且缺少语义信息。本研究启发了Word Embeddings，类似嵌入程序应用于大量的视觉单词。通过这种方式，可以配制视觉词的相应嵌入矢量。对于单词图像，将单词图像中的所有视觉单词的嵌入向量的平均值作为其嵌入式向量。此外，经常性的神经网络（RNN）被用来将每个单词图像编码为嵌入式，如自动编码器。 RNN Embeddings和Visual Word Embeddings是互补的。在本研究中，通过组合视觉单词嵌入和RNN嵌入来表示所有字图像。实验结果表明，该代表方法优于传统的BOVW，空间金字塔匹配和潜在的Dirichlet分配。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2017年|781-1559p|共6页
会议地点
作者
Hongxi Wei; Hui Zhang; Guanglai Gao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP37-53;
关键词
Visual Word Embeddings; Recurrent Neural Network; Keyword Spotting; Historical Document Images; Bag-of-Visual-Words;

机译：视觉词嵌入式;复发性神经网络;关键词斑点;历史文档图像;袋 - 视觉词;

相似文献

外文文献
中文文献
专利

1. Images and Words： Aristotle＇s Mimesis Revisited in the Unique Visual Work of Antonis Panagopoulos [J] . Katerina Mandroni 哲学研究：英文版 . 2017,第004期
2. Bag-of-visual-words model for artificial pornographic images recognition [J] . 李芳芳, 罗四伟, 刘熙尧, 中南大学学报（英文版） . 2016,第006期
3. Visual processing of word and image in the fusiform gyrus [J] . Guihua Jiang, Junzhang Tian, Yingwei Qiu, 中国神经再生研究（英文版） . 2010,第005期
4. Visual processing of word and image in the fusiform gyrus [J] . Guihua Jiang, Junzhang Tian, Yingwei Qiu, 中国神经再生研究：英文版 . 2010,第005期
5. HMM word graph based keyword spotting in handwritten document images [J] . Toselli Alejandro Hector, Vidal Enrique, Romero Veronica, Information Sciences: An International Journal . 2016,第Null期

机译：手写文档图像中基于HMM词图的关键词识别
6. A keyword retrieval system for historical Mongolian document images [J] . Hongxi Wei, Guanglai Gao International Journal on Document Analysis and Recognition . 2014,第1期

机译：蒙古文历史文献图像关键词检索系统
7. A survey of keyword spotting techniques for printed document images [J] . Abirami Murugappan, Baskaran Ramachandran, P. Dhavachelvan Artificial Intelligence Review: An International Science and Engineering Journal . 2011,第2期

机译：用于打印文档图像的关键字识别技术的调查
8. Representing word image using visual word embeddings and RNN for keyword spotting on historical document images [C] . Hongxi Wei, Hui Zhang, Guanglai Gao IEEE International Conference on Multimedia and Expo . 2017

机译：使用视觉单词嵌入和RNN表示单词图像以在历史文档图像上发现关键字
9. Keywords in the mist: Automated keyword extraction for very large documents and back of the book indexing. [D] . Csomai, Andras. 2008

机译：薄雾中的关键字：自动提取非常大的文档并在书后建立索引的关键字。
10. Click-words: learning to predict document keywords from a user perspective [O] . Rezarta Islamaj Doğan, Zhiyong Lu -1

机译：点击字词：从用户角度学习预测文档关键字
11. Keyword Spotting in Document Images through Word Shape Coding [O] . Shuyong Bai, Linlin Li, Chew Lim Tan 2010

机译：通过词形编码发现文档图像中的关键词

REPRESENTING WORD IMAGE USING VISUAL WORD EMBEDDINGS AND RNN FOR KEYWORD SPOTTING ON HISTORICAL DOCUMENT IMAGES

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅