首页> 外文会议>LREC-2012 >Corpus+WordNet Thesaurus Generation for Ontology Enriching
【24h】

Corpus+WordNet Thesaurus Generation for Ontology Enriching

机译:语料库+ Wordnet词库生成本体丰富

获取原文

摘要

This paper presents a model to enrich an ontology with a thesaurus based on a domain corpus and WordNet. The model is applied to the data privacy domain and the initial domain resources comprise a data privacy ontology, a corpus of privacy laws, regulations and guidelines for projects. Based on these resources, a thesaurus is automatically generated. The thesaurus seeds are composed by the ontology concepts. For these seeds similar terms are extracted from the corpus using known thesaurus generation methods. A filtering process searches for semantic relations between seeds and similar terms within WordNet. As a result, these semantic relations are used to expand the ontology with relations between them and related terms in the corpus. The resulting resource is a hierarchical structure that can help on the ontology investigation and maintenance. The results allow the investigation of the domain knowledge with the support of semantic relations not present on the original ontology.
机译:本文提出了一种基于域语法和Wordnet的中征与叙词的本体的模型。该模型应用于数据隐私域,初始域资源包括数据隐私本体,隐私法,项目条例和项目指南。基于这些资源,自动生成一个词库。词库种子由本体概念组成。对于这些种子,使用已知的词库生成方法,从语料库中提取类似术语。过滤过程搜索种子和WordNet中类似术语之间的语义关系。因此,这些语义关系用于扩展本体论与语料库之间的关系和相关术语的关系。生成的资源是一个层次结构,可以帮助本体调查和维护。结果允许通过对原始本体上不存在的语义关系的支持来调查域知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号