Word Extraction Method by Generating Multiple Character Hypotheses

机译：生成多个字符假设的单词提取方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is necessary to extract precisely words of headers and data for recognizing logical structure of form images. However, word extraction often fails because of layout analysis or character recognition error, which leads correct character hypotheses not to be generated. We propose a word extraction method which generates multiple character hypotheses and extracts their combinations which correspond with the character orders of words. Firstly character hypotheses which overlap with each other are generated by combinatorial recognition of connected components and their combinations which correspond with words are extracted by clique extraction from a graph. And then, character hypotheses are generated by recognition with limited target and their combinations which correspond with words areextracted by matching between lattices based on local optimum, in which variety of recognition results and regular expression of words are considered. We confirmed the effect of our method by the experiment for form images.

机译：为了识别表格图像的逻辑结构，有必要精确地提取标题和数据的单词。但是，由于布局分析或字符识别错误，单词提取通常会失败，从而导致无法生成正确的字符假设。我们提出了一种单词提取方法，该方法生成多个字符假设并提取与单词的字符顺序相对应的组合。首先，通过对所连接的组件进行组合识别来生成彼此重叠的字符假设，并且通过从图上进行集团提取来提取与单词相对应的它们的组合。然后，通过对目标进行有限的识别来生成字符假设，并根据局部最优值通过格间匹配来提取与单词相对应的组合，其中考虑了各种识别结果和单词的正则表达。我们通过表格图像实验确认了我们方法的效果。

著录项

来源
《Document Analysis Systems, DAS, 2008 Eighth IAPR Workshop on》||P.299-306|共8页
会议地点
作者
Takebe Hiroaki; Fujimoto Katsuhito;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. Joint Modeling of Characters, Words, and Conversation Contexts for Microblog Keyphrase Extraction [J] . Yinqyi Zhang, Chengzhi Zhang, Jing Li Journal of the American Society for Information Science . 2020,第5期

机译：字符，单词和会话上下文的联合建模，用于微博客关键词提取
2. An original methodology for the analysis and interpretation of word-count based methods: multiple factor analysis for contingency tables complemented by consensual words. [J] . Kostov B, Becue-Bertaut M, Husson F. Food Quality and Preference . 2014,第Pta1期

机译：用于分析和解释基于单词计数的方法的原始方法：对列联表的多因素分析，并辅以达成共识的单词。
3. Analytical Methods for Observational Data to Generate Hypotheses and Inform Clinical Decisions [J] . DeWees Todd A., Vargas Carlos E., Golafshar Michael A., Seminars in radiation oncology . 2019,第4期

机译：用于产生假设的观测数据的分析方法，并提供临床决策
4. Word Extraction Method by Generating Multiple Character Hypotheses [C] . Takebe Hiroaki, Fujimoto Katsuhito IAPR International Workshop on Document Analysis Systems . 2008

机译：通过生成多个角色假设来提取方法
5. Evolving neural net circuit modules to detect characters of the alphabet and sequences of characters (words) using the cellular automata module-brain machine. [D] . DeCesare, Derek. 2001

机译：不断发展的神经网络电路模块，使用元胞自动机模块-大脑机器来检测字母字符和字符序列（单词）。
6. The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction [O] . Elham Najafi, Amir H. Darooneh -1

机译：文本中词的分形模式：一种自动关键词提取方法
7. ABSTRACT Various body parts or organs can be analysed to identify the different diseases in the human body. Fingernail analysis is one of the ways to identify disease in the human body. Nails are the body part which are farthest from the heart and therefore receive oxygen at last. As a result the nails are the first who show the symptoms of a disease in the human body. Fingernails can be easily captured for diagnosis and there are no heavy equipment or no specific conditions required to use nail image for disease diagnosis, like in other tests and scanning processes. Human nails deliver beneficial information about complaints or any nutritive imbalances in the human body depending upon their shape, texture and colour. In human beings, numerous systemic and skin diseases can be easily analyzed through careful examination of nails of both the limbs. A lot of nail illnesses have been found to be primary signs of numerous underlying systemic illnesses. The colour, texture or shape changes in nails are signs of many diseases mainly affecting nails. Considering all these properties of nails a system is proposed that uses digital image processing (DIP) methods for identifying such changes in the human nail to get more precise results and predict numerous diseases effortlessly. With the emerging Internet of Things (IOT) concept the generated report is made available remotely, this will help users to reduce transportation efforts. As the system has to deal with large and private data, the security of data must be ensured. To keep the data confidential, the Blockchain concept which is one of the most emerging concepts in the field of data management is used. The paper contains the implementation of the digital image processing for feature extraction of nail images, usage of IOT (ThingSpeak cloud) for data storage and implementation of Blockchain to keep the system secured and theft free. KEY WORDS: Int ernet of thin gs (IOT), Image proc essin g, Thin gSpeak, RG B vavalues, Mean pi xel vavalues, Bloc kchain , Hash key. Disease Diagnostic System: Abnormalities in Human Nail [O] . Pranav S. Wazarkar 2020

机译：摘要的各个身体部位或器官可被分析以识别在人体内的不同的疾病。指甲分析来识别人体疾病的方法之一。指甲是身体一部分是离心脏最远，因此在最后接受氧气。作为结果，指甲是第一谁表现出人体疾病的症状。指甲可以容易地捕获用于诊断和没有重装或需要使用指甲图像用于疾病诊断，比如在其他测试和扫描过程没有特定的条件。人的指甲提供有关投诉或取决于它们的形状，纹理和色彩在人体内的任何营养失衡有益的信息。在人类中，许多全身性皮肤疾病是可以很容易地通过两个四肢指甲的仔细检查分析。很多指甲病已发现众多潜在系统性疾病的主要症状。在指甲的颜色，质地和形状的变化是许多疾病主要影响指甲的迹象。考虑到所有的指甲的这些性能的系统被提出，用于识别人指甲这样的变化以获得更精确的结果，并毫不费力预测许多疾病用途的数字图像处理（DIP）方法。随着物联网（IOT）的概念，新兴的互联网将生成的报告提供远程，这将帮助用户降低运输工作。由于系统必须处理大量的私人数据，数据的安全性必须得到保证。为了保持数据的机密性，使用Blockchain的概念，它是在数据管理领域的大多数新兴的概念之一。本文包含了数字图像处理的指甲图像，IOT（ThingSpeak云）的使用为数据存储和执行Blockchain的特征提取的执行，以保持固定的系统和盗窃免费。关键词：诠释薄GS（IOT），图像的ERNET PROC essin克，薄型gSpeak，RG乙vavalues，平均数PI XEL vavalues，阵营kchain，哈希密钥。疾病诊断系统：在人类指甲异常

Word Extraction Method by Generating Multiple Character Hypotheses

摘要

著录项

相似文献

相关主题

期刊订阅