首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >Word Beam Search: A Connectionist Temporal Classification Decoding Algorithm
【24h】

Word Beam Search: A Connectionist Temporal Classification Decoding Algorithm

机译:单词光束搜索:连接员时间分类解码算法

获取原文

摘要

Recurrent Neural Networks (RNNs) are used for sequence recognition tasks such as Handwritten Text Recognition (HTR) or speech recognition. If trained with the Connectionist Temporal Classification (CTC) loss function, the output of such a RNN is a matrix containing character probabilities for each time-step. A CTC decoding algorithm maps these character probabilities to the final text. Token passing is such an algorithm and is able to constrain the recognized text to a sequence of dictionary words. However, the running time of token passing depends quadratically on the dictionary size and it is not able to decode arbitrary character strings like numbers. This paper proposes word beam search decoding, which is able to tackle these problems. It constrains words to those contained in a dictionary, allows arbitrary non-word character strings between words, optionally integrates a word-level language model and has a better running time than token passing. The proposed algorithm outperforms best path decoding, vanilla beam search decoding and token passing on the IAM and Bentham HTR datasets. An open-source implementation is provided.
机译:经常性神经网络(RNN)用于序列识别任务,例如手写文本识别(HTR)或语音识别。如果以连接员时间分类(CTC)丢失函数训练,则这种RNN的输出是每个时间步骤的包含字符概率的矩阵。 CTC解码算法将这些字符概率映射到最终文本。令牌传递是这样的算法,并且能够将识别的文本限制为一系列字典单词。但是,令牌传递的运行时间在二次上依赖于字典大小,并且无法解码类似数字的任意字符串。本文提出了单词梁搜索解码,能够解决这些问题。它将单词限制为包含在字典中的单词,允许单词之间的任意非字字符串,可选地集成单词级语言模型,并且具有比令牌传递更好的运行时间。所提出的算法优于最佳路径解码,vanilla梁搜索解码和在IAM和Bentham HTR数据集上传递的令牌。提供了一个开源实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号