首页> 外文会议>LREC-2012 >Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions
【24h】

Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions

机译:用细粒度语义功能构建无限制的无限用具的语料

获取原文

摘要

Natural languages possess a wealth of indefinite forms that typically differ in distribution and interpretation. Although formal semanticists have strived to develop precise meaning representations for different indefinite functions, to date there has hardly been any corpus work on the topic. In this paper, we present the results of a small corpus study where English indefinite forms any and some were labelled with fine-grained semantic functions well-motivated by typological studies. We developed annotation guidelines that could be used by non-expert annotators and calculated inter-annotator agreement amongst several coders. The results show that the annotation task is hard, with agreement scores ranging from 52% to 62% depending on the number of functions considered, but also that each of the independent annotations is in accordance with theoretical predictions regarding the possible distributions of indefinite functions. The resulting annotated corpus is available upon request and can be accessed through a searchable online database.
机译:自然语言具有丰富的无限形式,通常在分配和解释方面差异。迄今为止,正式的语义主人竞争发展不同的无限职能意义表征,迄今为止几乎没有任何关于该主题的语料库。在本文中,我们介绍了一个小语料库研究的结果,其中英语无限期形式的任何形式,有些是用类型学研究的细粒度的语义功能标记。我们开发了非专家注释器可以使用的注释指南,并在几个编码器之间计算了共注入者协议。结果表明,注释任务很难,协议分数范围从52%到62%,具体取决于所考虑的职能数量,但也是每个独立的注释都符合关于可能的无限职能分布的理论预测。生成的注释语料库可根据要求提供,可以通过可搜索的在线数据库访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号