首页> 外文会议>International Conference on Intelligent Systems for Molecular biology >GENIA corpus--a semantically annotated com for bio-textmining
【24h】

GENIA corpus--a semantically annotated com for bio-textmining

机译:Genia Corpus - 用于生物教学的语义注释COM

获取原文

摘要

Motivation: Natural language processing (NLP) methods are regarded as being useful to raise the potential of text mining from biological literature. The lack of an extensively annotated corpus of this literature, however, causes a major bottleneck forapplying NLP techniques. GENIA corpus is being developed to provide reference materials to let NLP techniques work for bio-textmining. Results: GENIA corpus version 3.0 consisting of 2000 MEDLINE abstracts has been released with more than 400000 words and almost 100000 annotations for biological terms.
机译:动机:自然语言处理(NLP)方法被认为是有助于提高生物文学中的文本挖掘的潜力。然而,缺乏这种文献的广泛注释的语料库,导致主要的瓶颈面向NLP技术。正在开发Genia Corpus以提供参考资料,以便让NLP技术为生物教育工作。结果:Genia Corpus版本3.0由2000个Medline摘要组成,已释放超过40万字,近100000个生物术语注释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号