首页> 外文会议>Pacific Asia Conference on Language, Information and Computation >Building a Diverse Document Leads Corpus Annotated with Semantic Relations
【24h】

Building a Diverse Document Leads Corpus Annotated with Semantic Relations

机译:建立带有语义关系的多样化文档线索语料库

获取原文

摘要

In these days, semantic analysis has been actively studied in natural language processing. For the study of semantic analysis, corpora with semantic annotations are essential. Although there are such corpora annotated on newspaper articles, there are various genres and styles, including linguistic expressions that are not found in newspaper articles. In this paper, we build a diverse document leads corpus annotated with semantic relations. To reduce the workload of annotators and annotate as many various documents as possible, we restrict the annotation target of each document to only the first three sentences. We have completed building a corpus of 1,000 documents and report the statistics of this corpus.
机译:如今,语义分析已在自然语言处理中得到了积极的研究。对于语义分析的研究,带有语义注释的语料库是必不可少的。尽管在报纸文章上标注了这样的语料库,但是存在各种流派和样式,包括报纸文章中没有的语言表达。在本文中,我们构建了一个带有语义关系注解的多样化文档线索语料库。为了减少注释者的工作量并尽可能多地注释各种文档,我们将每个文档的注释目标限制为仅前三个句子。我们已完成构建1000个文档的语料库,并报告该语料库的统计信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号