首页> 外文会议>IEEE International Conference on High Performance Computing and Communications >Word Image Representation Based on Sequence to Sequence Model with Attention Mechanism for Out-of-Vocabulary Keyword Spotting
【24h】

Word Image Representation Based on Sequence to Sequence Model with Attention Mechanism for Out-of-Vocabulary Keyword Spotting

机译:基于序列序列模型的文字图像表示与词汇外关键字拍摄的关注机制

获取原文

摘要

To realize keyword spotting by means of query-by-example, learning efficient representation for word images is an essential issue. However, the amount of vocabulary at the training stage is often far less than the complete vocabulary of a certain language in various learning based representation approaches. Thus, unseen vocabularies might be taken as query keywords which may not exist in training set. Therefore, out-of-vocabulary (OOV) is frequently occurred in keyword spotting. In this paper, a sequence to sequence model with attention mechanism has been proposed to generate representation vectors of word images for solving the problem of OOV. After that, similarities can be calculated between each word image and a given query keyword image on their representation vectors. And then, a ranking list can be formed in descending order of the similarities for a collection of word images. Experimental results demonstrate that the proposed representation approach can be competent for the task of OOV keyword spotting and outperforms various baseline and state-of-the-art methods.
机译:要通过逐个查询实现关键字发现,Word Images的学习高效表示是一个重要问题。然而,培训阶段的词汇量往往远远低于各种基于学习的代表方法的某种语言的完整词汇量。因此,看不见的词汇表可以作为训练集可能不存在的查询关键字。因此,在关键字点化中经常发生词汇(OOV)。在本文中,已经提出了一种序列模型的序列模型,以生成词图像的表示向量,以解决OOV的问题。之后,可以在每个字图像和给定查询关键字图像之间计算其表示向量的相似性。然后,排名列表可以以用于单词图像集合的相似度的降序形成。实验结果表明,拟议的代表方法可以称赞OOV关键词点的任务,优于各种基线和最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号