首页> 外文期刊>IEEE Transactions on Image Processing >Stochastic language models for style-directed layout analysis of document images
【24h】

Stochastic language models for style-directed layout analysis of document images

机译:随机语言模型,用于文档图像的样式导向布局分析

获取原文
获取原文并翻译 | 示例

摘要

Image segmentation is an important component of any document image analysis system. While many segmentation algorithms exist in the literature, very few i) allow users to specify the physical style, and ii) incorporate user-specified style information into the algorithm's objective function that is to be minimized. We describe a segmentation algorithm that models a document's physical structure as a hierarchical structure where each node describes a region of the document using a stochastic regular grammar. The exact form of the hierarchy and the stochastic language is specified by the user, while the probabilities associated with the transitions are estimated from groundtruth data. We demonstrate the segmentation algorithm on images of bilingual dictionaries.
机译:图像分割是任何文档图像分析系统的重要组成部分。尽管文献中存在许多分割算法,但极少有i)允许用户指定物理样式,ii)将用户指定的样式信息合并到要最小化的算法目标函数中。我们描述了一种分割算法,该算法将文档的物理结构建模为分层结构,其中每个节点都使用随机规则语法描述文档的区域。层次结构和随机语言的确切形式由用户指定,而与过渡相关的概率则根据地面数据进行估算。我们展示了双语词典图像的分割算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号