Detecting figures and part labels in patents: competition-based development of graphics recognition algorithms

Riedl Christoph; Zanibbi Richard; Hearst Marti A.; Zhu Siyu; Menietti Michael; Crusan Jason; Metelsky Ivan; Lakhani Karim R.

首页> 外文期刊>International Journal on Document Analysis and Recognition >Detecting figures and part labels in patents: competition-based development of graphics recognition algorithms

【24h】

Detecting figures and part labels in patents: competition-based development of graphics recognition algorithms

机译：检测专利中的图形和零件标签：基于竞争的图形识别算法开发

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most United States Patent and Trademark Office (USPTO) patent documents contain drawing pages which describe inventions graphically. By convention and by rule, these drawings contain figures and parts that are annotated with numbered labels but not with text. As a result, readers must scan the document to find the description of a given part label. To make progress toward automatic creation of 'tool-tips' and hyperlinks from part labels to their associated descriptions, the USPTO hosted a monthlong online competition in which participants developed algorithms to detect figures and diagram part labels. The challenge drew 232 teams of two, of which 70 teams (30 %) submitted solutions. An unusual feature was that each patent was represented by a 300-dpi page scan along with an HTML file containing patent text, allowing integration of text processing and graphics recognition in participant algorithms. The design and performance of the top-5 systems are presented along with a system developed after the competition, illustrating that the winning teams produced near state-of-the-art results under strict time and computation constraints. The first place system used the provided HTML text, obtaining a harmonic mean of recall and precision (F-measure) of 88.57 % for figure region detection, 78.81 % for figure regions with correctly recognized figure titles, and 70.98 % for part label detection and recognition. Data and source code for the top-5 systems are available through the online UCI Machine Learning Repository to support follow-on work by others in the document recognition community.

机译：大多数美国专利商标局（USPTO）专利文件都包含以图形方式描述发明的绘图页。按照惯例和规则，这些图形包含带有编号标签但不带有文本的图形和零件。结果，读者必须扫描文档以找到给定零件标签的描述。为了在自动创建“工具提示”和从零件标签到其相关描述的超链接方面取得进展，USPTO举办了为期一个月的在线竞赛，参与者开发了检测图形和图表零件标签的算法。这项挑战吸引了232个团队，每两个团队，其中70个团队（30％）提交了解决方案。一项不寻常的功能是，每项专利均以300 dpi的页面扫描以及包含专利文本的HTML文件表示，从而可以将文本处理和图形识别集成到参与者算法中。展示了前五名系统的设计和性能以及比赛后开发的系统，这说明获胜的团队在严格的时间和计算约束下产生了近乎最新的结果。第一名系统使用提供的HTML文本，对图形区域检测获得88.57％的谐波查全率和精确度（F-measure），对于具有正确识别图形标题的图形区域，获得78.81％，对于部件标签检测和零件，获得70.98％承认。前五名系统的数据和源代码可通过在线UCI机器学习存储库获得，以支持文档识别社区中其他人员的后续工作。

著录项

来源
《International Journal on Document Analysis and Recognition》 |2016年第2期|155-172|共18页
作者
Riedl Christoph; Zanibbi Richard; Hearst Marti A.; Zhu Siyu; Menietti Michael; Crusan Jason; Metelsky Ivan; Lakhani Karim R.;
展开▼
作者单位

Northeastern Univ, DAmore McKim Sch Business, Boston, MA 02115 USA|Northeastern Univ, Coll Comp & Informat Sci, Boston, MA 02115 USA|Harvard Univ, Inst Quantitat Social Sci, Cambridge, MA 02138 USA;

Rochester Inst Technol, Dept Comp Sci, Rochester, NY 14623 USA;

Univ Calif Berkeley, Sch Informat, Berkeley, CA 94720 USA;

Rochester Inst Technol, Ctr Imaging Sci, Rochester, NY 14623 USA;

Harvard Univ, Inst Quantitat Social Sci, Cambridge, MA 02138 USA;

NASA, Adv Explorat Syst Div, Washington, DC 20546 USA;

TopCoder Inc, Glastonbury, CT 06033 USA;

Harvard Univ, Sch Business, Dept Technol & Operat Management, Boston, MA 02134 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Graphics recognition; Text detection; Optical character recognition (OCR); Competitions; Crowdsourcing;

机译：图形识别;文本检测;光学字符识别（OCR）;竞争;众包;

相似文献

外文文献
中文文献
专利

1. Maximal Figure-of-Merit Framework to Detect Multi-Label Phonetic Features for Spoken Language Recognition [J] . Ivan Kukanov, Trung Ngo Trong, Ville Hautamäki, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：最大值框架用于检测用于口语语言识别的多标签语音特征
2. A New Gait Recognition System based on Hierarchical Fair Competition-based Parallel Genetic Algorithm and Selective Neural Network Ensemble [J] . Heesung Lee, Heejin Lee, Euntai Kim International Journal of Control, Automation, and Systems . 2014,第1期

机译：基于分层公平竞争的并行遗传算法和选择性神经网络集成的步态识别新系统
3. The changing face of patents in generic pharmaceutical development: Facts and figures [J] . Leighton Howard Journal of generic medicines . 2007,第2期

机译：通用制药发展专利的变化面临：事实和数据
4. Development of Pattern Recognition Algorithms to Detect Intense Convective Storms from Multispectral Satellite Imagery [C] . Konstantin V. Khlopenkov, Kristopher M. Bedka IEEE International Geoscience and Remote Sensing Symposium . 2018

机译：用于从多光谱卫星影像中检测强对流风暴的模式识别算法的开发
5. Antigone Figures: Performativity and Rhythm in the Graphics of the Text A Commentary on Texts by Carol Jacobs, Martin Heidegger, and Jacques Derrida. [D] . Lewis, Melanie. 2011

机译：安提戈涅图形：文本图形中的表现力和节奏Carol Jacobs，Martin Heidegger和Jacques Derrida对文本的评论。
6. Effect of Arrangement of Stick Figures on Estimates of Proportion in Risk Graphics [O] . Jessica S. Ancker, Elke U. Weber, Rita Kukafka -1

机译：棒状物排列对风险图形比例估计的影响
7. Detecting Figures and Part Labels in Patents: Competition-Based Development of Image Processing Algorithms [O] . Riedl, Christoph, Zanibbi, Richard, Hearst, Marti A., 2014

机译：检测专利中的数字和部分标签：基于竞争的图像处理算法的发展
8. Integrated Graphics Operations and Analysis Lab Development of Advanced Computer Graphics Algorithms [R] . Wheaton, I. M. 2011

机译：高级计算机图形算法的集成图形操作和分析实验室开发

Detecting figures and part labels in patents: competition-based development of graphics recognition algorithms

摘要

著录项

相似文献

相关主题

期刊订阅