首页> 外文会议>International Symposium on Advances in Visual Computing >A Robust Two Level Classification Algorithm for Text Localization in Documents
【24h】

A Robust Two Level Classification Algorithm for Text Localization in Documents

机译:一种强大的两级分类算法,用于文档中的文本本地化

获取原文

摘要

This paper describes a two level classification algorithm to discriminate the handwritten elements from the printed text in a printed document. The proposed technique is independent of size, slant, orientation, translation and other variations in handwritten text. At the first level of classification, we use two classifiers and present a comparison between the nearest neighbour classifier and Support Vector Machines (SVM) classifier to localize the handwritten text. The features that are extracted from the document are seven invariant central moments and based on these features, we classify the text as hand-written. At the second level, we use Delaunay triangulation to reclassify the misclassified elements. When Delaunay triangulation is imposed on the centroid points of the connected components, we extract features based on the triangles and reclassify the misclassified elements. We remove the noise components in the document as part of the pre-processing step.
机译:本文介绍了一种两个级别的分类算法,可以在打印文档中区分手写元素。所提出的技术与手写文本的大小,倾斜,方向,翻译和其他变体无关。在第一级分类中,我们使用两个分类器并在最近的邻居分类器和支持向量机(SVM)分类器之间呈现比较以本地化手写文本。从文档中提取的功能是七个不变的中央瞬间,并根据这些功能,将文本分类为手写。在第二级,我们使用Delaunay三角测量来重新分类错误分类的元素。当施加到连接组件的质心指数上施加了Delaunay三角测量时,我们基于三角形提取特征,并重新分类错误分类的元素。我们将文档中的噪声组件作为预处理步骤的一部分删除。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号