Building a Diverse Document Leads Corpus Annotated with Semantic Relations

机译：建立带有语义关系的多样化文档线索语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In these days, semantic analysis has been actively studied in natural language processing. For the study of semantic analysis, corpora with semantic annotations are essential. Although there are such corpora annotated on newspaper articles, there are various genres and styles, including linguistic expressions that are not found in newspaper articles. In this paper, we build a diverse document leads corpus annotated with semantic relations. To reduce the workload of annotators and annotate as many various documents as possible, we restrict the annotation target of each document to only the first three sentences. We have completed building a corpus of 1,000 documents and report the statistics of this corpus.

机译：如今，语义分析已在自然语言处理中得到了积极的研究。对于语义分析的研究，带有语义注释的语料库是必不可少的。尽管在报纸文章上标注了这样的语料库，但是存在各种流派和样式，包括报纸文章中没有的语言表达。在本文中，我们构建了一个带有语义关系注解的多样化文档线索语料库。为了减少注释者的工作量并尽可能多地注释各种文档，我们将每个文档的注释目标限制为仅前三个句子。我们已完成构建1000个文档的语料库，并报告该语料库的统计信息。

著录项

来源
《Pacific Asia Conference on Language, Information and Computation》|2012年|535-544|共10页
会议地点
作者
Masatsugu Hangyo; Daisuke Kawahara; Sadao Kurohashi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Building a semantically annotated corpus for chronic disease complications using two document types [J] . Noha Alnazzawi PLoS One . 2021,第3期

机译：使用两种文档类型构建用于慢性疾病并发症的语义注释的语料
2. A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annotated Text corpus (MERLOT) [J] . Campillos Leonardo, Deleger Louise, Grouin Cyril, Language Resources and Evaluation . 2018,第2期

机译：具有全面语义注释的法语临床语料库：医学实体和关系LIMSI注释文本语料库（MERLOT）的开发
3. Building semantically annotated corpus for text classification of Indian defence news articles [J] . aurabh A. Kanekar, Alind Sharma, Gaurang S. Patkar, International Journal of Information Technology . 2021,第4期

机译：建立语义注释的印度国防新闻文本分类语料库
4. A Multi-level Annotated Corpus of Scientific Papers for Scientific Document Summarization and Cross-document Relation Discovery [C] . Ahmed AbuRaed, Horacio Saggion, Luis Chiruzzo International Conference on Language Resources and Evaluation . 2020

机译：科学文件摘要和跨文档关系发现的科学论文的多级注释语料库
5. Building High-frequency Word Lists for the Semantic Domain of ?āINA (‘land’) Using a Raw Corpus of Spoken ?ōlelo Hawai?i [D] . Brockway, Catherine Elizabeth Lee. 2021

机译：使用原始语料库的语义域构建高频词列表？lelo hawai？我
6. Building a semantically annotated corpus for chronic disease complications using two document types [O] . Noha Alnazzawi 2021

机译：使用两种文件类型构建用于慢性疾病并发症的语义注释的语料
7. Building a Diverse Document Leads Corpus Annotated with Semantic Relations [O] . Hangyo Masatsugu, Kawahara Daisuke, Kurohashi Sadao 2012

机译：建立带有语义关系的多样化文档线索语料库

Building a Diverse Document Leads Corpus Annotated with Semantic Relations

摘要

著录项

相似文献

相关主题

期刊订阅