首页> 美国卫生研究院文献>PhytoKeys >The use of Optical Character Recognition (OCR) in the digitisation of herbarium specimen labels

【2h】

The use of Optical Character Recognition (OCR) in the digitisation of herbarium specimen labels

机译：光学字符识别（OCR）在植物标本标签数字化中的使用

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

At the Royal Botanic Garden Edinburgh (RBGE) the use of Optical Character Recognition (OCR) to aid the digitisation process has been investigated. This was tested using a herbarium specimen digitisation process with two stages of data entry. Records were initially batch-processed to add data extracted from the OCR text prior to being sorted based on Collector and/or Country. Using images of the specimens, a team of six digitisers then added data to the specimen records. To investigate whether the data from OCR aid the digitisation process, they completed a series of trials which compared the efficiency of data entry between sorted and unsorted batches of specimens. A survey was carried out to explore the opinion of the digitisation staff to the different sorting options. In total 7,200 specimens were processed.When compared to an unsorted, random set of specimens, those which were sorted based on data added from the OCR were quicker to digitise. Of the methods tested here, the most successful in terms of efficiency used a protocol which required entering data into a limited set of fields and where the records were filtered by Collector and Country. The survey and subsequent discussions with the digitisation staff highlighted their preference for working with sorted specimens, in which label layout, locations and handwriting are likely to be similar, and so a familiarity with the Collector or Country is rapidly established.

机译：在爱丁堡皇家植物园（RBGE），研究了使用光学字符识别（OCR）来辅助数字化过程。使用标本馆数字化过程和两个阶段的数据输入进行了测试。首先对记录进行批处理，以添加从OCR文本中提取的数据，然后再根据收集者和/或国家/地区对它们进行排序。然后，由六个数字化仪团队使用标本图像，将数据添加到标本记录中。为了研究OCR的数据是否对数字化过程有所帮助，他们完成了一系列试验，比较了已分类和未分类样品之间的数据输入效率。进行了一项调查，以探讨数字化人员对不同分类选项的看法。总共处理了7200个样本。与未分类的随机样本集相比，基于OCR添加的数据进行分类的样本更快地数字化。在此处测试的方法中，就效率而言最成功的方法是使用一种协议，该协议要求将数据输入到一组有限的字段中，并且记录由收集器和国家/地区过滤。调查和随后与数字化人员的讨论都强调了他们偏爱使用分类标本，因为标本的布局，位置和笔迹很可能相似，因此对收集者或国家/地区的熟悉迅速建立。

著录项

期刊名称 PhytoKeys
作者
Robyn E. Drinkwater; Robert W. N. Cubey; Elspeth M. Haston;
展开▼
作者单位

展开▼
年(卷),期 2014(-1),38
年度 2014
页码 15–30
总页数 16
原文格式 PDF
正文语种
中图分类
关键词
OCR Digitisation Data entry Specimen Label Herbarium;

机译：OCR;数字化;数据输入;标本;标签;标本室;
入库时间 2022-08-17 12:31:11

相似文献

外文文献
中文文献
专利

1. The use of Optical Character Recognition (OCR) in the digitisation of herbarium specimen labels [J] . Robyn E. Drinkwater, Robert W. N. Cubey, Elspeth M. Haston PhytoKeys . 2014,第38期

机译：光学字符识别（OCR）在植物标本标签数字化中的使用
2. Automated system inspects radioactive medical imaging product labels A contact image sensor (CIS) line scan camera provides clear images of radiotracer labels for optical character recognition and optical character verification tasks. [J] . James Carroll Vision Systems Design . 2019,第10期

机译：自动化系统检查放射性医学成像产品标签接触式图像传感器（CIS）线扫描相机可提供放射性示踪剂标签的清晰图像，以进行光学字符识别和光学字符验证任务。
3. Gocator® 3D smart sensors now support optical character recognition (OCR) and barcode reading [J] . Innovations in processing and packaging . 2020,第50期

机译：Gocator®3D智能传感器现在支持光学字符识别（OCR）和条形码读取
4. Foreground/background segmentation of optical character recognition (OCR) labels by a single-layer recurrent neural network [C] . Lee F. Holeva, United Parcel Service Research Development, New Milford, Applications and Science of Artificial Neural Networks . 1995

机译：单层递归神经网络对光学字符识别（OCR）标签进行前景/背景分割
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Quantitative Computed Tomography (QCT) as a Radiology Reporting Tool by Using Optical Character Recognition (OCR) and Macro Program [O] . Young Han Lee, Ho-Taek Song, Jin-Suck Suh 2012

机译：通过使用光学字符识别（OCR）和宏程序将定量计算机断层扫描（QCT）作为放射学报告工具
7. Figure 3 from: Drinkwater R, Cubey R, Haston E (2014) The use of Optical Character Recognition (OCR) in the digitisation of herbarium specimen labels. PhytoKeys 38: 15-30. https://doi.org/10.3897/phytokeys.38.7168 [O] . Drinkwater, Robyn, Cubey, Robert, Haston, Elspeth 2014

机译：图3来自：Drinkwater R，Cubey R，Haston E（2014）在标本馆标本标签的数字化中使用光学字符识别（OCR）。 PhytoKeys 38：15-30。 https://doi.org/10.3897/phytokeys.38.7168
8. Optical Character Recognition (OCR) Inks. Category: Hardware Standard. Subcategory: Character Recognition [R] . Owen, R. K. 1980

机译：光学字符识别（OCR）油墨。类别：硬件标准。子类别：字符识别

The use of Optical Character Recognition (OCR) in the digitisation of herbarium specimen labels

摘要

著录项

相似文献

相关主题

期刊订阅