Greek Alphabet Recognition Technique for Biomedical Documents

机译：生物医学文献的希腊字母识别技术

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most current commercial optical character recognition (OCR) systems can accurately recognize the text in documents written in a single language. However, when dealing with Greek characters embedded in predominantly English text, these systems do not perform well, and most OCR systems do not recognize the characters as belonging to the Greek alphabet. As a result, the degree of manual review required to validate and correct OCR errors is high. To handle this problem, we propose a new technique based on features calculated from the output of multiple OCR systems, and combined with string pattern matching and document content analysis to improve the recognition of both Greek characters and regular text. Our proposed technique uses two passes of a document page image through OCR systems that use different recognition languages. Experiments carried out on a sample of medical journals show the feasibility of using the proposed technique for Greek character recognition. Preliminary evaluation conducted on a sample of medical journal page images shows that our approach improves the recognition of Greek characters embedded within predominantly English language text.

机译：当前，大多数商业光学字符识别（OCR）系统都可以准确地识别以单一语言编写的文档中的文本。但是，当处理主要嵌入在英语文本中的希腊字符时，这些系统的性能不佳，并且大多数OCR系统都不认为该字符属于希腊字母。结果，验证和纠正OCR错误所需的手动检查程度很高。为了解决这个问题，我们提出了一种新技术，该技术基于从多个OCR系统的输出中计算出的特征，并与字符串模式匹配和文档内容分析相结合，以提高对希腊字符和常规文本的识别能力。我们提出的技术通过使用不同识别语言的OCR系统两次使用文档页面图像。在医学期刊样本上进行的实验表明，使用提议的技术进行希腊字符识别的可行性。对医学期刊页面图像样本进行的初步评估表明，我们的方法提高了对嵌入英语文本为主的希腊字符的识别能力。

著录项

来源
《World Multiconference on Systemics, Cybernetics and Informatics(SCI 2002) v.3: Image, Acoustic, Speech and Signal Processing I; 20020714-20020718; Orlando,FL; US》|2002年|P.86-91|共6页
会议地点 Orlando FL(US);Orlando FL(US)
作者
Daniel X. Le; Scott R. Straughan; George R. Thoma;
展开▼
作者单位

National Library of Medicine 8600 Rockville Pike, Bethesda, MD 20894;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
greek character recognition; optical character recognition; automated document data entry; MEDLINEreg; database; national library of medicine;

机译：希腊字符识别;光学字符识别;自动文档数据输入;MEDLINE®数据库;国家医学图书馆;

相似文献

外文文献
中文文献
专利

1. A Comparison of Techniques for Sign Language Alphabet Recognition Using Armband Wearables [J] . PRAJWAL PAUDYAL, JUNCHYO LEE, AYAN BANERJEE, ACM Transactions on Interactive Intelligent Systems . 2019,第2a3期

机译：使用袖标可穿戴设备进行手语字母识别技术的比较
2. A Comparison of Techniques for Sign Language Alphabet Recognition Using Armband Wearables [J] . PRAJWAL PAUDYAL, JUNCHYO LEE, AYAN BANERJEE, ACM Transactions on Interactive Intelligent Systems . 2019,第2a3期

机译：使用臂带可穿戴物的手语字母识别技术比较
3. Comparative study of classification techniques on biomedical data from hypertext documents [J] . Rashedur M. Rahman, Sazia Salahuddin International journal of knowledge engineering and soft data paradigms . 2013,第1期

机译：超文本文件生物医学数据分类技术的比较研究
4. Greek Alphabet Recognition Technique for Biomedical Documents [C] . Daniel X. Le, Scott R. Straughan, George R. Thoma World Multi-conference on Systemics, Cybernetics and Informatics . 2002

机译：生物医学文件的希腊字母识别技术
5. Artificial neural network and its applications in quality process control, document recognition and biomedical imaging. [D] . Islam, Mohammed Jahirul. 2010

机译：人工神经网络及其在质量过程控制，文档识别和生物医学成像中的应用。
6. Detecting experimental techniques and selecting relevant documents for protein-protein interactions from biomedical literature [O] . Xinglong Wang, Rafal Rak, Angelo Restificar, 2011

机译：从生物医学文献中检测实验技术并选择蛋白质-蛋白质相互作用的相关文献
7. Detecting experimental techniques and selecting relevant documents for protein-protein interactions from biomedical literature [O] . Xinglong Wang, Rafal Rak, Angelo Restificar, 2011

机译：从生物医学文献中检测实验技术并选择蛋白质-蛋白质相互作用的相关文献

Greek Alphabet Recognition Technique for Biomedical Documents

摘要

著录项

相似文献

相关主题

期刊订阅