...
首页> 外文期刊>International Journal of Image Processing >Header Based Classification of Journals Using Document Image Segmentation and Extreme Learning Machine
【24h】

Header Based Classification of Journals Using Document Image Segmentation and Extreme Learning Machine

机译:使用文档图像分割和极限学习机的基于标题的期刊分类

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Document image segmentation plays an important role in classification of journals, magazines, newspaper, etc., It is a process of splitting the document into distinct regions. Document layout analysis is a key process of identifying and categorizing the regions of interest in the scanned image of a text document. A reading system requires the segmentation of text zones from non- textual ones and the arrangement in their correct reading order. Detection and labelling of text zones play different logical roles inside the document such as titles, captions, footnotes, etc. This research work proposes a new approach to segment the document and classify the journals based on the header block. Documents are collected from different journals and used as input image. The image is segmented into blocks like heading, header, author name and footer using Particle Swarm optimization algorithm and features are extracted from header block using Gray Level Co-occurrences Matrix. Extreme Learning Machine has been used for classification based on the header blocks and obtained 82.3% accuracy.
机译:文档图像分割在期刊,杂志,报纸等的分类中起着重要作用,这是将文档分为不同区域的过程。文档布局分析是识别和分类文本文档扫描图像中感兴趣区域的关键过程。阅读系统要求将文本区域从非文本区域中分割出来,并按照其正确的阅读顺序进行排列。文本区域的检测和标记在文档中扮演着不同的逻辑角色,例如标题,标题,脚注等。这项研究工作提出了一种新的方法来对文档进行分段,并基于页眉块对期刊进行分类。从不同的期刊收集文档,并将其用作输入图像。使用粒子群优化算法将图像分成标题,标题,作者姓名和页脚等块,并使用灰度共生矩阵从标题块中提取特征。极限学习机已用于基于标题块的分类,并获得82.3%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号