首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >Ontology-Based Information Extraction from Handwritten Documents
【24h】

Ontology-Based Information Extraction from Handwritten Documents

机译:基于本体的信息提取文献

获取原文

摘要

In this paper we introduce a new layer for the task of handwriting recognition. We add semantic information by means of ontologies. The task of our recognizer therefore is not only to recognize the ASCII transcription of the handwritten document, but also to identify the semantic concepts which appear in the text. This task is called ontology-based information extraction (OBIE), which has been applied to electronic documents recently. OBIE methods first segment the text into tokens, then identify their values and their corresponding instances of the ontology, and finally try to generate new facts based on the text. To the authors’ knowledge, in this paper OBIE is proposed for the first time in handwriting literature. In our experiments we have evaluated the process up to the instantiation. We have found that using not only the top alternative, but also the k-best alternatives increases the performance of information extraction. Furthermore, the use of an ontology-based lexicon results in another performance increase.
机译:在本文中,我们为手写识别的任务介绍了一个新图层。我们通过本体添加语义信息。因此,我们的认识器的任务不仅要识别手写文档的ASCII转录,还不仅要识别文本中出现的语义概念。此任务称为基于本体的信息提取(obie),最近已应用于电子文件。 obie方法首先将文本分段为令牌,然后识别它们的值和它们的本体的相应实例,最后尝试基于文本生成新的事实。对于作者的知识,在本文中,在手写文学中首次提出了奥格。在我们的实验中,我们已经评估了对实例化的过程。我们发现不仅使用顶级替代方案,而且使用K-Best替代方案增加了信息提取的性能。此外,使用基于本体的词汇导致另一种性能增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号