首页> 外文会议>Workshop on building and evaluating resources for biomedical text mining >A Corpus of Tables in Full-Text Biomedical Research Publications

【24h】

A Corpus of Tables in Full-Text Biomedical Research Publications

机译：全文生物医学研究出版物中的表格

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The development of text mining techniques for biomedical research literature has received increased attention in recent times. However, most of these techniques focus on prose, while much important biomedical data reside in tables. In this paper, we present a corpus created to serve as a gold standard for the development and evaluation of techniques for the automatic extraction of information from biomedical tables. We describe the guidelines used for corpus annotation and the manner in which they were developed. The high inter-annotator agreement achieved on the corpus, and the generic nature of our annotation approach, suggest that the developed guidelines can serve as a general framework for table annotation in biomedical and other scientific domains.

机译：生物医学研究文献的文本挖掘技术的发展在近期得到了更多的关注。然而，这些技术中的大多数都侧重于散文，而许多重要的生物医学数据驻留在表中。在本文中，我们提出了一种创建的语料库，作为一种用于开发和评估从生物医学表的自动提取信息的技术的黄金标准。我们描述了用于语料库注释的指导方针以及它们的开发方式。在语料库上实现的高度注释协议以及我们的注释方法的通用性表明，开发的指导方针可以作为生物医学和其他科学域中的表注释的一般框架。

著录项

来源
《Workshop on building and evaluating resources for biomedical text mining 》|2016年|xi 142 p.|共10页
会议地点
作者
Tatyana Shmanina; Ingrid Zukerman; Ai Lee Cheam; Thomas Bochynek; Lawrence Cavedon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程 ;
关键词

相似文献

外文文献
中文文献
专利

1. GNI Corpus Version 1.0: Annotated Full-Text Corpus of Genomics & Informatics to Support Biomedical Information Extraction [J] . So-Yeon Oh, Ji-Hyeon Kim, Seo-Jin Kim, Genomics & Informatics . 2018 ,第3期

机译：GNI语料库版本1.0：带有基因组学和信息学的注释全文语料库，以支持生物医学信息提取
2. A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools [J] . Karin Verspoor, Kevin B Cohen, Arrick Lanfranchi, BMC Bioinformatics . 2012 ,第1期

机译：全文期刊文章集是一种强大的评估工具，可揭示生物医学自然语言处理工具的性能差异
3. Distribution of information in biomedical abstracts and full-text publications [J] . Schuemie MJ, Weeber M, Schijvenaars BJA, Bioinformatics . 2004 ,第16期

机译：在生物医学摘要和全文出版物中分发信息
4. A Corpus of Tables in Full-Text Biomedical Research Publications [C] . Tatyana Shmanina, Ingrid Zukerman, Ai Lee Cheam, Fifth workshop on building and evaluating resources for biomedical text mining . 2016

机译：全文生物医学研究出版物中的表集
5. Annotating a corpus of biomedical research texts: Two models of rhetorical analysis. [D] . White, Barbara Ellen. 2010

机译：注释生物医学研究文献集：修辞分析的两种模型。
6. GNI Corpus Version 1.0: Annotated Full-Text Corpus of Genomics Informatics to Support Biomedical Information Extraction [O] . So-Yeon Oh, Ji-Hyeon Kim, Seo-Jin Kim, 2018

机译：GNI语料库版本1.0：带注释的基因组学和信息学全文语料库支持生物医学信息提取
7. A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools [O] . Karin Verspoor, Kevin Cohen, Arrick Lanfranchi, 2012

机译：全文期刊文章集是一种强大的评估工具，可揭示生物医学自然语言处理工具的性能差异
8. Scholarly Electronic Full-Text Publications via the Internet: Issues and Impacts [R] . Kosmin, Linda J. 1999

机译：通过互联网的学术电子全文出版物：问题和影响

A Corpus of Tables in Full-Text Biomedical Research Publications

摘要

著录项

相似文献

相关主题

期刊订阅