首页> 美国政府科技报告 >Statistical, Nonparametric Methodology for Document Degradation Model Validation
【24h】

Statistical, Nonparametric Methodology for Document Degradation Model Validation

机译:文献退化模型验证的统计,非参数方法

获取原文

摘要

Printing, photocopying and scanning processes degrade the image quality of a document. Statistical models of these degradation processes are crucial for document image understanding research. Models allow us to predict system performance; conduct controlled experiments to study the break-down points of the systems; create large multi-lingual data sets with ground truth for training classifiers; design optimal noise removal algorithms; choose values for the free parameters of the algorithms; and so on. Although research in document understanding started many decades ago, only two document degradation models have been proposed this far. Furthermore, no attempts have been to statistically validate these models. In this paper we present a statistical methodology that can be used to validate local degradation models. This method is based on a non-parametric, two-sample permutation test. Another standard statistical device - the power function - is then used to choose between algorithm variables such as distance functions. Since the validation and the power function procedures are independent of the model, they can be used to validate any other degradation model. A method for comparing any two models is also described. It uses p-values associated with the estimated models to select the model that is closer to the real world.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号