首页> 外文会议>Workshop on building and evaluating resources for biomedical text mining >A Corpus of Tables in Full-Text Biomedical Research Publications
【24h】

A Corpus of Tables in Full-Text Biomedical Research Publications

机译:全文生物医学研究出版物中的表格

获取原文

摘要

The development of text mining techniques for biomedical research literature has received increased attention in recent times. However, most of these techniques focus on prose, while much important biomedical data reside in tables. In this paper, we present a corpus created to serve as a gold standard for the development and evaluation of techniques for the automatic extraction of information from biomedical tables. We describe the guidelines used for corpus annotation and the manner in which they were developed. The high inter-annotator agreement achieved on the corpus, and the generic nature of our annotation approach, suggest that the developed guidelines can serve as a general framework for table annotation in biomedical and other scientific domains.
机译:生物医学研究文献的文本挖掘技术的发展在近期得到了更多的关注。然而,这些技术中的大多数都侧重于散文,而许多重要的生物医学数据驻留在表中。在本文中,我们提出了一种创建的语料库,作为一种用于开发和评估从生物医学表的自动提取信息的技术的黄金标准。我们描述了用于语料库注释的指导方针以及它们的开发方式。在语料库上实现的高度注释协议以及我们的注释方法的通用性表明,开发的指导方针可以作为生物医学和其他科学域中的表注释的一般框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号