首页> 外文会议>International Conference for Convergence in Technology >Handwriting Recognition for Medical Prescriptions using a CNN-Bi-LSTM Model
【24h】

Handwriting Recognition for Medical Prescriptions using a CNN-Bi-LSTM Model

机译:使用CNN-BI-LSTM模型的医疗处方手写识别

获取原文

摘要

It is commonly seen that it is tough to read the handwritten text from medical prescriptions. It is mostly due to the different style of handwriting and the use of Latin abbreviations for medical terms which is usually unknown to the general public. This can make it difficult for both patients and even pharmacists to read the prescription, which can have negative or even fatal consequences if read incorrectly. This paper demonstrates the use of a CNN-Bi-LSTM model along with Connectionist Temporal Classification. The prescribed model consists of three components, the convolutional layers for feature extraction, the Bi-LSTM network for making predictions for each frame of the context vector and the final decoding to translate each character in the recognized sequence by LSTM layers into an alphabetic character using the CTC loss function. A linear layer is added after the bi-LSTM layer to compute the final probabilities, which will be decoded. We also built a corpus manually containing the terms widely used in the medical domain, commonly used in prescriptions. We then use string matching algorithms, and string distance functions to find the nearest word in the corpus, so that bias is given to medical terms for increasing accuracy of the predicted output.
机译:通常可以看出,从医疗处方阅读手写文本很难。它主要是由于普通公众通常未知的医疗术语的手写风格和拉丁语缩写的使用。这可能使患者甚至药剂师们都难以阅读处方,这是如果读错了,这可能会产生负面甚至致命的后果。本文演示了使用CNN-BI-LSTM模型以及连接人的时间分类。规定模型由三个组件组成,用于特征提取的卷积层,用于对上下文向量的每个帧进行预测的Bi-LSTM网络以及最终解码以将LSTM层将识别的序列中的每个字符转换为字母字符CTC损耗功能。在Bi-LSTM层之后添加线性层以计算最终概率,这将被解码。我们还在手动内建立了一个常用于医疗领域的术语的语料库,通常用于处方。然后,我们使用字符串匹配算法和字符串距离函数来查找语料库中最接近的单词,从而给予偏差以增加预测输出的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号