首页> 外文会议>International Conference on Advances in Computing, Communications and Informatics >A comparative study of segment representation for biomedical named entity recognition
【24h】

A comparative study of segment representation for biomedical named entity recognition

机译:生物医学名称实体识别分部代表的比较研究

获取原文

摘要

Biomedical Named Entity Recognition (Bio-NER) is an important subtask of Biomedical Text Mining (BioTM), where the performance of further tasks, such as relation extraction, protein-protein interaction and hypothesis generation depend on the performance of Bio-NER. Bio-NER involves determining the biomedical named entities, such as DNA, RNA, cell types, gene and protein present in the biomedical research articles. Annotating the dataset for training the classifier to recognize and classify named entities is the crucial task in BioNER. Segment representation (SR) is an efficient way of annotating Biomedical Named Entities (BioNEs) within a sentence to differentiate them from non-BioNEs. In this paper, we have used Support Vector Machines (SVMs) and Conditional Random fields (CRFs) to train different BioNER models with the benchmark JNLPBA 2004 and i2b2 2010 shared task dataset using different SRs. The performance of SR models shows that more complex the model worse performance of f-score.
机译:生物医学命名实体识别(Bio-ner)是生物医学文本挖掘(Biotm)的重要子任务,其中进一步任务的性能,例如关系提取,蛋白质 - 蛋白质相互作用和假设产生取决于生物内的性能。 Bio-ner涉及确定生物医学研究制品中存在的DNA,RNA,细胞类型,基因和蛋白质的生物医学命名实体。注释用于训练分类器以识别并对命名实体进行分类的数据集是均质中的重要任务。段表示(SR)是在句子中注释生物医学命名实体(二极管)的有效方式,以将它们与非直接区分化。在本文中,我们使用了使用不同SRS与基准JNLPBA 2004和I2B2 2010共享任务数据集培训不同的均衡器模型和条件随机字段(CRFS)和有条件的随机字段(CRFS)。 SR模型的性能显示,更复杂的模型对F分的性能更糟。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号