首页> 外文期刊>Journal of Biomedical Semantics >Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora
【24h】

Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora

机译:BioScope和Genia Event语料库中基于语言范围和基于生物事件的推测和否定注释

获取原文
           

摘要

Background The treatment of negation and hedging in natural language processing has received much interest recently, especially in the biomedical domain. However, open access corpora annotated for negation and/or speculation are hardly available for training and testing applications, and even if they are, they sometimes follow different design principles. In this paper, the annotation principles of the two largest corpora containing annotation for negation and speculation – BioScope and Genia Event – are compared. BioScope marks linguistic cues and their scopes for negation and hedging while in Genia biological events are marked for uncertainty and/or negation. Results Differences among the annotations of the two corpora are thematically categorized and the frequency of each category is estimated. We found that the largest amount of differences is due to the issue that scopes – which cover text spans – deal with the key events and each argument (including events within events) of these events is under the scope as well. In contrast, Genia deals with the modality of events within events independently. Conclusions The analysis of multiple layers of annotation (linguistic scopes and biological events) showed that the detection of negation/hedge keywords and their scopes can contribute to determining the modality of key events (denoted by the main predicate). On the other hand, for the detection of the negation and speculation status of events within events, additional syntax-based rules investigating the dependency path between the modality cue and the event cue have to be employed.
机译:背景技术最近,在自然语言处理中对否定和套期的处理引起了人们的极大兴趣,特别是在生物医学领域。但是,标注为否定和/或推测的开放式访问语料库几乎无法用于培训和测试应用程序,即使存在,它们有时也会遵循不同的设计原则。在本文中,比较了两个包含否定和推测注释的最大语料库的注释原理– BioScope和Genia Event –。 BioScope标记了语言暗示及其否定和对冲的范围,而在Genia中,生物学事件则被标记为不确定和/或否定。结果对两个语料库的注释之间的差异进行了主题分类,并估计了每个类别的频率。我们发现,差异最大的原因是作用域(涵盖文本范围)处理关键事件,而这些事件的每个自​​变量(包括事件内的事件)也在范围内。相反,Genia独立地处理事件内事件的形式。结论对注释的多层层次(语言范围和生物事件)的分析表明,否定/对冲关键字及其范围的检测可以有助于确定关键事件的方式(由主谓词表示)。另一方面,为了检测事件中事件的否定和推测状态,必须采用调查句法提示与事件提示之间的依赖路径的其他基于语法的规则。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号