Optimal feature and classifier selection for text region classification in natural scene images using Weka tool

Soni Rituraj; Kumar Bijendra; Chand Satish

首页> 外文期刊>Multimedia Tools and Applications >Optimal feature and classifier selection for text region classification in natural scene images using Weka tool

【24h】

Optimal feature and classifier selection for text region classification in natural scene images using Weka tool

机译：使用Weka工具在自然场景图像中进行文本区域分类的最佳特征和分类器选择

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The problem of text detection and localization in scene images has always been challenging for the researchers over the years due to diversities present in these images. This diversity includes variation in fonts, size, color, different backgrounds, etc. The textual content in such images can be helpful for humans in many different domains like visually impaired people, scene understanding, intelligent navigation, etc. The natural scene contains some non-text objects along with relevant text objects, and it is necessary to classify them appropriately & accurately to increase the performance of the detection and localization method. The classification of text regions in scene images depends on the selection of optimal features and optimal classifier. This work contributes to finding both the optimal feature set and the optimal classifier with the help of weka tool. In this paper, first, we detect the possible text regions with the help of the improved MSER algorithm; then, we extract 11 features on these potential text regions. From these 11 features, we choose an optimal feature set for discrimination between text and non-text components with the help of the CfsSubsetEval and BFS parameter of the Weka Tool. We trained several classifiers using these optimal features with the help of Weka tool on the ICDAR 2013 training set. The performance of these classifiers is compared empirically based on the classification accuracy obtained using Weka tool. Based on this empirical estimation, Naive Bayes Classifier with the highest accuracy of 92.5% is proposed as an optimal choice for classification purpose.

机译：多年来，由于这些图像中存在多样性，因此场景研究中的文本检测和本地化问题一直是研究人员所面临的挑战。这种多样性包括字体，大小，颜色，不同背景等的变化。此类图像中的文本内容可对许多不同领域的人们有所帮助，例如视力障碍者，场景理解，智能导航等。自然场景包含一些非文本对象以及相关的文本对象，因此有必要对它们进行适当而准确的分类，以提高检测和定位方法的性能。场景图像中文本区域的分类取决于最佳特征和最佳分类器的选择。这项工作有助于在weka工具的帮助下找到最佳特征集和最佳分类器。在本文中，首先，我们借助改进的MSER算法检测可能的文本区域。然后，我们在这些潜在的文本区域上提取11个特征。从这11个功能中，我们借助Weka工具的CfsSubsetEval和BFS参数选择一个最佳的功能集来区分文本和非文本组件。我们在ICDAR 2013训练集上借助Weka工具使用这些最佳功能训练了多个分类器。基于使用Weka工具获得的分类准确性，经验比较这些分类器的性能。基于这一经验估计，提出了最准确的92.5％朴素贝叶斯分类器作为分类目的的最佳选择。

著录项

来源
《Multimedia Tools and Applications》 |2019年第22期|31757-31791|共35页
作者
Soni Rituraj; Kumar Bijendra; Chand Satish;
展开▼
作者单位

NSIT Dept Comp Engn New Delhi India;

JNU Sch Comp & Syst Sci New Delhi India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Extraction of text regions; MSER; Feature selection and extraction; Classification; Weka tool;

机译：提取文本区域;MSER;特征选择和提取;分类;威卡工具;

相似文献

外文文献
中文文献
专利

1. Multi-script text versus non-text classification of regions in scene images [J] . Sriman Bowornrat, Schomaker Lambert Journal of visual communication & image representation . 2019,第JULa期

机译：场景图像中区域的多脚本文本与非文本分类
2. Multi-script text versus non-text classification of regions in scene images [J] . Sriman Bowornrat, Schomaker Lambert Journal of visual communication & image representation . 2019,第Jula期

机译：多脚本文本与场景图像中区域的非文本分类
3. Text Localization and Character Extraction in Natural Scene Images using Contourlet Transform and SVM Classifier [J] . Shivananda V. Seeri, J. D. Pujari, P. S. Hiremath International Journal of Image, Graphics and Signal Processing . 2016,第5期

机译：使用Contourlet变换和SVM分类器在自然场景图像中进行文本本地化和字符提取
4. Text region extraction from low resolution natural scene images using texture features [C] . Angadi S.A., Kodabagi M.M. Advance Computing Conference (IACC) . 2010

机译：使用纹理特征从低分辨率自然场景图像中提取文本区域
5. Text Detection in Natural Scenes and Technical Diagrams with Convolutional Feature Learning and Cascaded Classification. [D] . Zhu, Siyu. 2016

机译：具有卷积特征学习和级联分类的自然场景和技术图中的文本检测。
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. An Optimized Feature Selection Technique in Diversified Natural Scene Text for Classification Using Genetic Algorithm [O] . Ghulam Jillani Ansari, Jamal Hussain Shah, Mylene C. Q. Farias, 2021

机译：不同遗传算法分类分类自然场景文本的优化特征选择技术
8. Selection of optimal textural features for maximum likelihood image classification. [R] . Rosenblum, W. I., Salvaggio, C., Schott, J. R. 1990

机译：选择最佳可能性图像分类的最佳纹理特征。

Optimal feature and classifier selection for text region classification in natural scene images using Weka tool

摘要

著录项

相似文献

相关主题

期刊订阅