首页> 外文会议>International Conference on Agents and Artificial Intelligence >Enhance Text Recognition by Image Pre-Processing to Facilitate Library Services by Mobile Devices
【24h】

Enhance Text Recognition by Image Pre-Processing to Facilitate Library Services by Mobile Devices

机译:通过图像预处理提高文本识别,以便通过移动设备促进库服务

获取原文

摘要

Facing the popularity of web searching, libraries continuously invest in the provision of online searching and refurnish physical facilities to attract users during the past decades. In this study, we conducted a technical feasibility study to facilitate library services by applying a novel image pre-processing technique to enhance performance of OCR via mobile devices. In the binarization stage, a grayscale image is usually assigned a global threshold value to be binary, while this will not be suitable for some scenarios, such as non-uniform lightness and complicated background. Instead of segregating the grayscale image into many regions like other studies, our approach only partitioned an image into three equal-sized horizontal segments to identify the local threshold value of each segment and then restored the three segments back to the original state. The experimental results illustrate that the proposed method efficiently and effectively improves the text recognition. The accuracy rate was raised from 17.7% to 72.05% of all test images. Without counting eight unrecognizable images, the average accuracy rates of our treatment can reach 90.06%. To compare with other studies we conducted another evaluation to examine the validity of our approach. The result showed that our treatment outperforms most of the other studies and the performance achieves 74.6% in precision and 80.2% in the recall. We are confident that this design will not only bring users more convenience in using libraries but help library staff and businessmen to manage the status of books.
机译:面对网络搜索的普及,图书馆不断投资于在过去几十年中提供在线搜索和翻新的物理设施,以吸引用户。在这项研究中,我们通过应用新颖的图像预处理技术来通过移动设备提高OCR的性能来促进图书馆服务的技术可行性研究。在二值化阶段,通常将灰度图像分配为二进制的全局阈值,而这不适用于某些场景,例如不均匀的亮度和复杂的背景。不是将灰度图像分成许多区域,而不是将灰度图像分成其他研究,我们的方法仅将图像分为三个相等大小的水平段,以识别每个段的本地阈值,然后将三个段恢复回原始状态。实验结果说明了所提出的方法有效且有效地改善了文本识别。精度率从所有测试图像的17.7%升至72.05%。在不计数八个无法辨认的图像的情况下,我们治疗的平均精度率可以达到90.06%。为了与其他研究进行比较,我们进行了另一个评估来检查我们方法的有效性。结果表明,我们的治疗优于大多数其他研究,并且性能在召回的精度下实现74.6%,80.2%。我们相信这种设计不仅将使用户更加便利地使用图书馆,但帮助图书馆员工和商人来管理书籍的状态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号