Evaluating Word String Embeddings and Loss Functions for CNN-Based Word Spotting

机译：评估基于CNN的单词点的单词字符串嵌入和损失函数

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The recent past has seen CNNs take over the field of word spotting. The dominance of these neural networks is fueled by learning to predict a word string embedding for a given input image. While the PHOC (Pyramidal Histogram of Characters) is most prominently used, other embeddings such as the Discrete Cosine Transform of Words have been used as well. In this work, we investigate the use of different word string embeddings for word spotting. For this, we make use of the recently proposed PHOCNet and modify it to be able to not only learn binary representations. Our extensive evaluation shows that a large number of combinations of word string embeddings and loss functions achieve roughly the same results on different word spotting benchmarks. This leads us to the conclusion that no word string embedding is really superior to another and new embeddings should focus on incorporating more information than only character counts and positions.

机译：最近，CNN接管了单词发现领域。通过学习预测给定输入图像的词串嵌入，可以增强这些神经网络的优势。虽然最主要使用PHOC（字符金字塔形直方图），但也使用了其他嵌入方式，例如单词的离散余弦变换。在这项工作中，我们研究了使用不同的词串嵌入进行词发现。为此，我们利用了最近提出的PHOCNet并对其进行了修改，使其不仅能够学习二进制表示形式。我们的广泛评估表明，单词串嵌入和损失函数的大量组合在不同的单词发现基准上获得了大致相同的结果。这导致我们得出的结论是，没有任何一个字串嵌入确实比另一个字串嵌入更优越，而新的嵌入应该着重于整合更多的信息，而不仅仅是字符数和位置。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|493-498|共6页
会议地点
作者
Sebastian Sudholt; Gernot A. Fink;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Histograms; Training; Image segmentation; Discrete cosine transforms; Logistics; Feature extraction; Toy manufacturing industry;

机译：直方图;训练;图像分割;离散余弦变换;物流;特征提取;玩具制造业;

相似文献

外文文献
中文文献
专利

1. Are complex function words processed as semantically empty strings? A reading time and ERP study of collocational complex prepositions [J] . Molinaro N., Canal P., Vespignani F., Language and cognitive processes . 2013,第6期

机译：复杂功能词是否作为语义上空的字符串处理？搭配复杂介词的阅读时间与ERP研究
2. Use of words for evaluation of hearing loss in signal-to-babble ratio: A clinic protocol [J] . Richard H. Wilson PhD, Christopher A. Burks MS Journal of Rehabilitation Research and Development . 2005,第6期

机译：评估言语和听力障碍的听力损失：临床方案
3. Numerical evaluation of special power series including the numbers of Lyndon words: an approach to interpolation functions for Apostol-type numbers and polynomials [J] . Irem Kucukoglu, Yilmaz Simsek Electronic Transactions on Numerical Analysis . 2018,第1期

机译：特殊功率级数的数值评估，包括Lyndon单词的数量：Apostol型数字和多项式的插值函数的方法
4. Evaluating Word String Embeddings and Loss Functions for CNN-Based Word Spotting [C] . Sebastian Sudholt, Gernot A. Fink IAPR International Conference on Document Analysis and Recognition . 2017

机译：评估基于CNN的Word Spotting的单词字符串嵌入和丢失函数
5. Things and Strings and More: Improving Place Name Disambiguation from Short Texts by Combining Entity Co-Occurrence, Topic Modeling, and Word Embedding [D] . Ju, Yiting. 2017

机译：事物和字符串和更多：通过组合实体共同发生，主题建模和单词嵌入来改善从短文本的歧义
6. Large-scale functional networks connect differently for processing words and symbol strings [O] . Mia Liljeström, Johanna Vartiainen, Jan Kujala, 2012

机译：大型功能网络连接方式不同，用于处理单词和符号字符串
7. SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity [O] . Sergio Jimenez 2016

机译：Semeval-2016的Sergiojimenez任务1：有效地组合了释义数据库，字符串匹配，Wordnet和Word嵌入用于语义文本相似性

Evaluating Word String Embeddings and Loss Functions for CNN-Based Word Spotting

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅