...
首页> 外文期刊>International Journal of Innovative Research in Science, Engineering and Technology >An Advanced Approach for Text Query Searching and Word Spotting In Word Images
【24h】

An Advanced Approach for Text Query Searching and Word Spotting In Word Images

机译:Word图像中文本查询搜索和单词发现的高级方法

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Word-spotting refers to the problem of detecting specific keywords in document images. Here we focus on handwritten word images. Keyword spotting in handwritten image document in the existing work is based upon BLSTM Neural Networks which consist of two parts. First part is preprocessing phase, performed by the neural network. It maps each position of an input sequence to a vector, indicating the probability of each character possibly being written at that position. The second part, called the CTC Token Passing algorithm, takes this sequence of letter probabilities, as well as a dictionary and a language model, as its input and computes a likely sequence of words. By extending this work, the present work proposes Information retrieval and information (text) extraction methods from all handwritten documents of images. In Information retrieval approach the input query is text format .The text is matched with template character then the query image is created from template characters. This proposed approach provides an efficient way of searching text like queries in document images. The text extraction from the images includes thresholding, segmentation, edge detection and text extraction algorithm. The experimental results show the performance of the proposed algorithms achieves higher accuracy rates than existing approaches.
机译:单词发现是指检测文档图像中特定关键字的问题。在这里,我们专注于手写文字图像。现有工作中手写图像文档中的关键词识别是基于BLSTM神经网络的,该网络由两部分组成。第一部分是预处理阶段,由神经网络执行。它将输入序列的每个位置映射到一个向量,指示每个字符可能在该位置写入的概率。第二部分称为CTC令牌传递算法,将字母概率的这种序列以及字典和语言模型作为其输入,并计算可能的单词序列。通过扩展这项工作,本工作提出了从图像的所有手写文档中进行信息检索和信息(文本)提取的方法。在信息检索方法中,输入查询为文本格式。文本与模板字符匹配,然后从模板字符创建查询图像。提出的方法提供了一种有效的方式来搜索文本,例如文档图像中的查询。从图像中提取文本包括阈值化,分割,边缘检测和文本提取算法。实验结果表明,与现有方法相比,所提算法的准确率更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号