Sentence Role Identification in Medline Abstracts: Training Classifier with Structured Abstracts

机译：Medline文摘中的句子角色识别：使用结构化文摘训练分类器

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The abstract of a scientific paper typically consists of sentences describing the background of study, its objective, experimental method and results, and conclusions. We discuss the task of identifying which of these "structural roles" each sentence in abstracts plays, with a particular focus on its application in building a literature retrieval system. By annotating sentences in an abstract collection with role labels, we can build a literature retrieval system in which users can specify the roles of the sentences in which query terms should be sought. We argue that this facility enables more goal-oriented search, and also makes it easier to narrow down search results when adding extra query terms does not work. To build such a system, two issues need to be addressed: (1) how we should determine the set of structural roles presented to users from which they can choose the target search area, and (2) how we should classify each sentence in abstracts by their structural roles, without relying too much on human supervision. We view the task of role identification as that of text classification based on supervised machine learning. Our approach is characterized by the use of structured abstracts for building training data. In structured abstracts, which is a format of abstracts popular in biomedical domains, sections are explicitly marked with headings indicating their structural roles, and hence they provide us with an inexpensive way to collect training data for sentence classifiers. Statistics on the structured abstracts contained in Medline give an insight on determining the set of sections to be presented to users as well.

机译：科学论文的摘要通常由描述研究背景，研究目的，实验方法和结果以及结论的句子组成。我们讨论了确定摘要中每个句子在这些“结构性角色”中扮演哪个角色的任务，并特别关注其在构建文献检索系统中的应用。通过用角色标签注释抽象集合中的句子，我们可以构建一个文献检索系统，用户可以在其中指定应在其中查询查询词的句子的角色。我们认为，此功能可以实现更多面向目标的搜索，并且在添加额外的查询词不起作用时，还可以更轻松地缩小搜索范围。要构建这样的系统，需要解决两个问题：（1）我们应该如何确定呈现给用户的结构角色集，以便他们可以从中选择目标搜索区域；（2）我们应该如何对摘要中的每个句子进行分类通过它们的结构作用，而无需过多地依靠人工监督。我们将角色识别的任务视为基于监督机器学习的文本分类任务。我们的方法的特点是使用结构化摘要来构建培训数据。在结构化摘要中，该摘要是在生物医学领域中流行的摘要格式，在各节中明确标有表明其结构作用的标题，因此它们为我们提供了一种廉价的方式来收集句子分类器的训练数据。 Medline中包含的结构化摘要的统计信息还可以帮助您确定要呈现给用户的部分集。

著录项

来源
《International Workshop on Active Mining(AM 2003); 20031028; Maebashi(JP)》|2003年|P.236-254|共19页
会议地点 Maebashi(JP)
作者
Masashi Shimbo; Takahiro Yamasaki; Yuji Matsumoto;
展开▼
作者单位

Graduate School of Information Science, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma, Nara 630-0192, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类数据处理、数据处理系统;
关键词
medline; structured abstracts; information retrieval; text classification;

机译：医学，结构化摘要，信息检索，文本分类;

相似文献

外文文献
中文文献
专利

1. A deep learning classifier for sentence classification in biomedical and computer science abstracts [J] . Neural computing & applications . 2020,第11期

机译：生物医学与计算机科学句子句子分类的深层学习分类器
2. A retrospective cohort study of structured abstracts in MEDLINE, 1992–2006 [J] . Anna M Ripple Bulletin of the Medical Library Association. . 2011,第2期

机译：1992–2006年在MEDLINE中对结构性摘要进行的回顾性队列研究
3. A retrospective cohort study of structured abstracts in MEDLINE, 1992-2006. [J] . Ripple AM, Mork JG, Knecht LS, Journal of the Medical Library Association : . 2011,第2期

机译：1992-2006年在MEDLINE中对结构性摘要进行的回顾性队列研究。
4. Sentence Role Identification in Medline Abstracts: Training Classifier with Structured Abstracts [C] . Masashi Shimbo, Takahiro Yamasaki, Yuji Matsumoto International Workshop on Active Mining . 2005

机译：Medline摘要中的句子角色识别：带有结构化摘要的培训分类器
5. The role of abstract lexical structure in first language attrition: Germans in America. [D] . Gross, Steven. 2000

机译：抽象词汇结构在母语损耗中的作用：美国的德国人。
6. Automatic Summarization of Mouse Gene Information by Clustering and Sentence Extraction from MEDLINE Abstracts [O] . Jianji Yang, Aaron M. Cohen, William Hersh 2007

机译：通过MEDLINE摘要的聚类和句子提取自动总结小鼠基因信息
7. MINING MEDLINE: ABSTRACTS, SENTENCES, OR PHRASES? [O] . J. Ding A, D. Berleant A, D. Nettleton B, 2013

机译：采矿mEDLINE：摘要，句子或短语？
8. Finding Functionally Related Genes by Local and Global Analysis of MEDLINE Abstracts. [R] . Nakken, S., Kauffman, C., Karypis, G. 2004

机译：通过mEDLINE摘要的局部和全局分析寻找功能相关基因。

Sentence Role Identification in Medline Abstracts: Training Classifier with Structured Abstracts

摘要

著录项

相似文献

相关主题

期刊订阅