首页> 外文会议>IEEE International Conference on High Performance Computing and Communications >Word Image Representation Based on Sequence to Sequence Model with Attention Mechanism for Out-of-Vocabulary Keyword Spotting

【24h】

Word Image Representation Based on Sequence to Sequence Model with Attention Mechanism for Out-of-Vocabulary Keyword Spotting

机译：基于序列序列模型的文字图像表示与词汇外关键字拍摄的关注机制

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To realize keyword spotting by means of query-by-example, learning efficient representation for word images is an essential issue. However, the amount of vocabulary at the training stage is often far less than the complete vocabulary of a certain language in various learning based representation approaches. Thus, unseen vocabularies might be taken as query keywords which may not exist in training set. Therefore, out-of-vocabulary (OOV) is frequently occurred in keyword spotting. In this paper, a sequence to sequence model with attention mechanism has been proposed to generate representation vectors of word images for solving the problem of OOV. After that, similarities can be calculated between each word image and a given query keyword image on their representation vectors. And then, a ranking list can be formed in descending order of the similarities for a collection of word images. Experimental results demonstrate that the proposed representation approach can be competent for the task of OOV keyword spotting and outperforms various baseline and state-of-the-art methods.

机译：要通过逐个查询实现关键字发现，Word Images的学习高效表示是一个重要问题。然而，培训阶段的词汇量往往远远低于各种基于学习的代表方法的某种语言的完整词汇量。因此，看不见的词汇表可以作为训练集可能不存在的查询关键字。因此，在关键字点化中经常发生词汇（OOV）。在本文中，已经提出了一种序列模型的序列模型，以生成词图像的表示向量，以解决OOV的问题。之后，可以在每个字图像和给定查询关键字图像之间计算其表示向量的相似性。然后，排名列表可以以用于单词图像集合的相似度的降序形成。实验结果表明，拟议的代表方法可以称赞OOV关键词点的任务，优于各种基线和最先进的方法。

著录项

来源
《IEEE International Conference on High Performance Computing and Communications 》|2019年|lxxi705 p. :|共8页
会议地点
作者
Hongxi Wei; Yanke Kang; Hui Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术 ;
关键词
Visualization; Task analysis; Logic gates; Training; Vocabulary; Image segmentation; Decoding;

机译：可视化;任务分析;逻辑门;培训;词汇;图像分割;解码;

相似文献

外文文献
中文文献
专利

1. Querying out-of-vocabulary words in lexicon-based keyword spotting [J] . Puigcerver Joan, Toselli Alejandro H., Vidal Enrique Neural computing & applications . 2017 ,第9期

机译：查询基于词汇的关键字斑点中的词汇单词
2. HMM word graph based keyword spotting in handwritten document images [J] . Toselli Alejandro Hector, Vidal Enrique, Romero Veronica, Information Sciences: An International Journal . 2016 ,第Null期

机译：手写文档图像中基于HMM词图的关键词识别
3. A software program combining sequence motif searches with keywords for finding repeats containing DNA sequences [J] . Mehmet Bilgen, Mehmet Karaca, A. Nad Onus, Bioinformatics . 2004 ,第18期

机译：结合了序列基序搜索和关键字的软件程序，用于查找包含DNA序列的重复序列
4. Word Image Representation Based on Sequence to Sequence Model with Attention Mechanism for Out-of-Vocabulary Keyword Spotting [C] . Hongxi Wei, Yanke Kang, Hui Zhang IEEE International Conference on High Performance Computing and Communications;IEEE International Conference on Smart City;IEEE International Conference on Data Science and Systems . 2019

机译：基于序列到序列模型的单词图像表示法及注意机制
5. Whisper speech processing: Analysis, modeling, and detection with applications to keyword spotting. [D] . Zhang, Chi. 2012

机译：悄悄话语处理：分析，建模和检测，以及关键词发现的应用。
6. An effective content-based image retrieval technique for image visuals representation based on the bag-of-visual-words model [O] . Safia Jabeen, Zahid Mehmood, Toqeer Mahmood, -1

机译：基于视觉袋模型的基于内容的有效图像检索技术
7. Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models [O] . He, Yanzhang, Prabhavalkar, Rohit, Rao, Kanishka, 2017

机译：使用序列到序列流式传输小足迹关键字楷模

Word Image Representation Based on Sequence to Sequence Model with Attention Mechanism for Out-of-Vocabulary Keyword Spotting

摘要

著录项

相似文献

相关主题

期刊订阅