Features for word spotting in historical manuscripts

机译：历史手稿中的单词发现功能

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

For the transition from traditional to digital libraries, the large number of handwritten manuscripts that exist pose a great challenge. Easy access to such collections requires an index, which is currently created manually at great cost. Because automatic handwriting recognizers fail on historical manuscripts, the word spotting technique has been developed: the words in a collection are matched as images and grouped into clusters which contain all instances of the same word. By annotating "interesting" clusters, an index that links words to the locations where they occur can be built automatically. Due to the noise in historical documents, selecting the right features for matching words is crucial. We analyzed a range of features suitable for matching words using dynamic time warping (DTW), which aligns and compares sets of features extracted from two images. Each feature's individual performance was measured on a test set. With an average precision of 72%, a combination of features outperforms competing techniques in speed and precision.

机译：对于从传统图书馆到数字图书馆的过渡，现有的大量手写手稿构成了巨大的挑战。要轻松访问此类集合，需要一个索引，该索引当前是手动创建的，成本很高。由于自动手写识别器在历史手稿上失败，因此开发了单词识别技术：将集合中的单词作为图像进行匹配，并分组为包含同一单词所有实例的簇。通过注释“有趣的”群集，可以自动建立将单词链接到单词出现位置的索引。由于历史文献中的杂音，选择合适的特征以匹配单词至关重要。我们使用动态时间规整（DTW）分析了适合匹配单词的一系列特征，该特征对齐并比较了从两个图像中提取的特征集。每个功能的个别性能均在测试集上进行了测量。这些功能的组合平均精度为72％，在速度和精度方面均优于同类技术。

著录项

来源
《Industrial Applications of AI (Artificial Intelligence)》|1992年|p.218-222|共5页
会议地点
作者
Rath T.M.; Manmatha R.;
展开▼
作者单位

Center for Intelligent Inf. Retrieval, Massachusetts Univ., Amherst, MA, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
入库时间 2022-08-26 13:52:27

相似文献

外文文献
中文文献
专利

1. Graph-based keyword spotting in historical manuscripts using Hausdorff edit distance [J] . Ameri Mohammad Reza, Stauffer Michael, Riesen Kaspar, Pattern recognition letters . 2019,第APRa期

机译：使用Hausdorff编辑距离在历史手稿中基于图的关键字识别
2. A Word Spotting Method for Arabic Manuscripts Based on Speeded Up Robust Features Technique [J] . Noureddine El Makhfi Advances in Science, Technology and Engineering Systems . 2019,第6期

机译：基于加速鲁棒特征技术的阿拉伯语手稿的单词发现方法
3. A line-based representation for matching words in historical manuscripts [J] . Ethem F. Can, Pinar Duygulu Pattern recognition letters . 2011,第8期

机译：基于行的表示形式，用于匹配历史手稿中的单词
4. Features for word spotting in historical manuscripts [C] . Rath, T.M., Manmatha, . 2003

机译：历史手稿中的单词发现功能
5. Design of Keyword Spotting System Based on Segmental Time Warping of Quantized Features. [D] . Karmacharya, Piush. 2012

机译：基于量化特征分段时间规整的关键词识别系统设计。
6. Hough Transform-Based Angular Features for Learning-Free Handwritten Keyword Spotting [O] . Subhranil Kundu, Samir Malakar, Zong Woo Geem, 2021

机译：基于Hough的转换的角度特征用于无学习手写关键字斑点
7. Features for Word Spotting in Historical Manuscripts [O] . Toni Rath And 2008

机译：历史手稿中的词语识别功能

Features for word spotting in historical manuscripts

摘要

著录项

相似文献

相关主题

期刊订阅