...
首页> 外文期刊>Journal of biomedical informatics. >Biomedical named entity recognition using two-phase model based on SVMs.
【24h】

Biomedical named entity recognition using two-phase model based on SVMs.

机译:使用基于SVM的两阶段模型进行生物医学命名实体识别。

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Named entity (NE) recognition has become one of the most fundamental tasks in biomedical knowledge acquisition. In this paper, we present a two-phase named entity recognizer based on SVMs, which consists of a boundary identification phase and a semantic classification phase of named entities. When adapting SVMs to named entity recognition, the multi-class problem and the unbalanced class distribution problem become very serious in terms of training cost and performance. We try to solve these problems by separating the NE recognition task into two subtasks, where we use appropriate SVM classifiers and relevant features for each subtask. In addition, by employing a hierarchical classification method based on ontology, we effectively solve the multi-class problem concerning semantic classification. The experimental results on the GENIA corpus show that the proposed method is effective not only in reducing computational cost but also in improving performance. The F-score (beta=1) for the boundary identification is 74.8 and the F-score for the semantic classification is 66.7.
机译:命名实体(NE)的识别已成为生物医学知识获取中最基本的任务之一。在本文中,我们提出了一种基于支持向量机的两阶段命名实体识别器,它由命名实体的边界识别阶段和语义分类阶段组成。当使支持向量机适应命名实体识别时,在训练成本和性能方面,多类别问题和不平衡类别分布问题变得非常严重。我们尝试通过将NE识别任务分为两个子任务来解决这些问题,其中我们对每个子任务使用适当的SVM分类器和相关功能。另外,通过采用基于本体的层次分类方法,有效解决了语义分类中的多类问题。在GENIA语料库上的实验结果表明,该方法不仅有效降低了计算成本,而且提高了性能。边界标识的F分数(beta = 1)为74.8,语义分类的F分数为66.7。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号