【24h】

An Impact of Parts Of Speech Analysis on Automatic Classification of OCR texts

机译:An Impact of Parts Of Speech Analysis on Automatic Classification of OCR texts

获取原文
获取原文并翻译 | 示例
           

摘要

Automatic classification of Optical Character Reader (OCR) texts is important in applications such as institutional repositories and information retrieval. However it is currently impossible for OCR technology to recognize all characters with accuracy of 100. Furthermore it is not known whether part of speech (POS) analysis contributes to OCR texts representation in a discriminative way. In this paper we experimentally evaluated POS analysis on OCR texts to formulate an informative feature set. Empirical results indicate that the selection of suitable POS improved classification performance of OCR texts.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号