EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction

机译：就餐：单次视觉文本提取中的实体感知注意

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Extracting Text of Interest (ToI) from images is a crucial part of many OCR applications, such as entity recognition of cards, invoices, and receipts. Most of the existing works employ complicated engineering pipeline, which contains OCR and structure information extraction, to fulfill this task. This paper proposes an Entity-aware Attention Text Extraction Network called EATEN, which is an end-to-end trainable system to extract the ToIs without any post-processing. In the proposed framework, each entity is parsed by its corresponding entity-aware decoder, respectively. Moreover, we innovatively introduce a state transition mechanism which further improves the robustness of visual ToI extraction. In consideration of the absence of public benchmarks, we construct a dataset of almost 0.6 million images in three real-world scenarios (train ticket, passport and business card), which is publicly available at https://github.com/beacandler/EATEN. To the best of our knowledge, EATEN is the first single shot method to extract entities from images. Extensive experiments on these benchmarks demonstrate the state-of-the-art performance of EATEN.

机译：从图像中提取感兴趣的文本（TOI）是许多OCR应用程序的重要组成部分，例如卡片，发票和收据的实体识别。大多数现有工程采用复杂的工程管道，其中包含OCR和结构信息提取，以满足这项任务。本文提出了一个名为EATEN的实体感知注意文本提取网络，这是一个端到端的培训系统，可以在没有任何后处理的情况下提取TOIS。在所提出的框架中，每个实体分别由其相应的实体感知解码器解析。此外，我们创新地引入了一种状态转换机制，该机制进一步提高了视觉TOI提取的鲁棒性。考虑到缺乏公共基准，我们在三个现实世界场景（火车票，护照和名片）中构建了近60万个图像的数据集，该数据集在HTTPS://github.com/beacandler/eaten公开提供。据我们所知，Eaten是第一个从图像中提取实体的单一拍摄方法。这些基准的广泛实验证明了食用的最先进的性能。

著录项

来源
《International Conference on Document Analysis and Recognition》|2019年|254-259|共6页
会议地点
作者
He Guo; Xiameng Qin; Jiaming Liu; Junyu Han; Jingtuo Liu; Errui Ding;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Decoding; Feature extraction; Text recognition; Visualization; Image recognition; Business; Optical character recognition software;

机译：解码;特征提取;文本识别;可视化;图像识别;业务;光学字符识别软件;

相似文献

外文文献
中文文献
专利

1. Validation of the TOtal Visual acuity extraction Algorithm (TOVA) for automated extraction of visual acuity and intraocular pressure data from free text clinical records [J] . Baughman Doug, Lee Cecilia, Lee Aaron Y. Investigative ophthalmology & visual science . 2017,第8期

机译：从自由文本临床记录中验证可视敏锐度和眼内压力数据的自动提取敏锐提取算法（TOVA）
2. Validation of the TOtal Visual acuity extraction Algorithm (TOVA) for automated extraction of visual acuity and intraocular pressure data from free text clinical records [J] . Baughman Doug, Lee Cecilia, Lee Aaron Y. Investigative ophthalmology & visual science . 2017,第8期

机译：从自由文本临床记录中验证可视敏锐度和眼内压力数据的自动提取敏锐提取算法（TOVA）
3. A gated piecewise CNN with entity-aware enhancement for distantly supervised relation extraction [J] . Haixu Wen, Xinhua Zhu, Lanfang Zhang, Information Processing & Management . 2020,第6期

机译：具有实体感知增强的门控分段CNN，用于远处监督相关性提取
4. EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction [C] . He Guo, Xiameng Qin, Jiaming Liu, International Conference on Document Analysis and Recognition . 2019

机译：Eaten：单次视觉文本提取的实体感知注意力
5. Reduced working memory capacity leads to attentional capture by an irrelevant color singleton during inefficient visual search. [D] . Burnham, Bryan R. 2007

机译：降低的工作存储容量会导致无效的视觉搜索过程中不相关的颜色单调引起注意。
6. Validation of the Total Visual Acuity Extraction Algorithm (TOVA) for Automated Extraction of Visual Acuity Data From Free Text Unstructured Clinical Records [O] . Douglas M. Baughman, Grace L. Su, Irena Tsui, -1

机译：从自由文本非结构化临床记录中自动提取视敏度数据的总视敏度提取算法（TOVA）的验证
7. EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction [O] . He Guo, Xiameng Qin, Jiaming Liu, 2019

机译：Eaten：单次视觉文本提取的实体感知注意力

EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction

摘要

著录项

相似文献

相关主题

期刊订阅