首页> 外文会议>International Conference on Big Data, Small Data, Linked Data and Open Data >From Linguistic Resources to Medical Entity Recognition: a Supervised Morpho-syntactic Approach
【24h】

From Linguistic Resources to Medical Entity Recognition: a Supervised Morpho-syntactic Approach

机译:从语言资源到医学实体识别:监督的态度句法方法

获取原文

摘要

Due to the importance of the information it conveys, Medical Entity Recognition is one of the most investigated tasks in Natural Language Processing. Many researches have been aiming at solving the issue of Text Extraction, also in order to develop Decision Support Systems in the field of Health Care. In this paper, we propose a Lexicon-grammar method for the automatic extraction from raw texts of the semantic information referring to medical entities and, furthermore, for the identification of the semantic categories that describe the located entities. Our work is grounded on an electronic dictionary of neoclassical formative elements of the medical domain, an electronic dictionary of nouns indicating drugs, body parts and internal body parts and a grammar network composed of morphological and syntactical rules in the form of Finite-State Automata. The outcome of our research is an Extensible Markup Language (XML) annotated corpus of medical reports with information pertaining to the medical Diseases, Treatments, Tests, Symptoms and Medical Branches, which can be reused by any kind of machine learning tool in the medical domain.
机译:由于信息的重要性,医学实体识别是自然语言处理中最多的调查任务之一。许多研究旨在解决文本提取问题,也为了制定医疗保健领域的决策支持系统。在本文中,我们提出了一种词汇 - 语法方法,用于从指的语义信息的原始文本中提取对医学实体的原始文本,而且为了识别描述所在实体的语义类别。我们的作品基于电子域的新古典主义形成元素的电子词典,是名词的电子词典,指示药物,身体部位和内部身体部位以及由有限状态自动机形式的形态和句法规则组成的语法网络。我们的研究结果是一种可扩展的标记语言(XML)注释的医疗报告语料库,其信息与医学疾病,治疗,测试,症状和医疗分支有关,可以通过医学领域的任何类型的机器学习工具重复使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号