首页> 外文会议>2012 IEEE International Conference on Emerging Signal Processing Applications >Improved parcel sorting by combining automatic speech and character recognition
【24h】

Improved parcel sorting by combining automatic speech and character recognition

机译:通过结合自动语音和字符识别功能来改进包裹分拣

获取原文
获取原文并翻译 | 示例

摘要

Automatic postal sorting systems have traditionally relied on optical character recognition (OCR) technology. While OCR systems perform well for flat mail items such as envelopes, the performance deteriorates for parcels. In this study, we propose a new multimodal solution for parcel sorting which combines automatic speech recognition (ASR) technology with OCR in order to deliver better performance. Our multimodal approach is based on estimating OCR output confidence, and then optionally using ASR system output when OCR results show low confidence. Particularly, we proposed a Levenshtein edit distance (LED) based measure to compute OCR confidence. Based on the OCR confidence measure, a dynamic fusion strategy is developed that forms its final decision based on (i) OCR output alone, (ii) ASR output alone, and (iii) combination of ASR and OCR outputs. The proposed system is evaluated on speech and image data collected in real-world conditions. Our experiments show that the proposed multimodal solution achieves an overall zip code recognition rate of 90.2%, which is a substantial improvement over ASR alone (81%) and OCR alone (80.6%) systems. This advancement represents an important contribution that leverages OCR and ASR technologies to improve address recognition in parcels.
机译:传统上,自动邮政分拣系统依靠光学字符识别(OCR)技术。尽管OCR系统在诸如信封之类的扁平邮件项目中表现良好,但对于包裹来说,性能却下降了。在这项研究中,我们提出了一种新的用于包裹分拣的多模式解决方案,该解决方案将自动语音识别(ASR)技术与OCR相结合,以提供更好的性能。我们的多模式方法基于估计OCR输出的置信度,然后在OCR结果显示低置信度时选择使用ASR系统输出。特别是,我们提出了一种基于Levenshtein编辑距离(LED)的方法来计算OCR置信度。基于OCR置信度,开发了一种动态融合策略,该动态融合策略基于(i)仅OCR输出,(ii)仅ASR输出以及(iii)ASR和OCR输出的组合来形成最终决策。所提出的系统是根据在现实条件下收集的语音和图像数据进行评估的。我们的实验表明,提出的多模式解决方案实现了90.2%的总体邮政编码识别率,这相对于仅使用ASR(81%)和仅使用OCR(80.6%)的系统来说是一个重大的改进。这一进步代表了利用OCR和ASR技术改善包裹中地址识别的重要贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号