首页> 外文期刊>The Computer journal >The NoisyOffice Database: A Corpus To Train Supervised Machine Learning Filters For Image Processing
【24h】

The NoisyOffice Database: A Corpus To Train Supervised Machine Learning Filters For Image Processing

机译:noisyoffice数据库:用于培训监督机器学习过滤器的语料库,用于图像处理

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper presents the 'NoisyOffice' database. It consists of images of printed text documents with noise mainly caused by uncleanliness from a generic office, such as coffee stains and footprints on documents or folded and wrinkled sheets with degraded printed text. This corpus is intended to train and evaluate supervised learning methods for cleaning, binarization and enhancement of noisy images of grayscale text documents. As an example, several experiments of image enhancement and binarization are presented by using deep learning techniques. Also, double-resolution images are also provided for testing super-resolution methods. The corpus is freely available at UCI Machine Learning Repository. Finally, a challenge organized by Kaggle Inc. to denoise images, using the database, is described in order to show its suitability for benchmarking of image processing systems.
机译:本文介绍了“诺斯Yyoffice”数据库。它由印刷文本文档的图像组成,主要由通用办公室的不合情不处理引起的噪音,例如咖啡渍和文件上的脚印或折叠和皱纹的印刷文本的褶皱床单。该语料库旨在培训和评估监督的清洁,二值化和增强灰度文本文档的噪声图像的监督学习方法。作为示例,通过使用深度学习技术来呈现图像增强和二值化的几个实验。此外,还提供了用于测试超分辨率方法的双分辨率图像。语料库在UCI机器学习存储库中自由使用。最后,描述了通过Kaggle Inc.组织的挑战来描述使用数据库的欺骗图像,以便显示其适用于图像处理系统的基准测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号