首页> 外文期刊>Science of Computer Programming >A domain-independent process for automatic ontology population from text
【24h】

A domain-independent process for automatic ontology population from text

机译:来自文本的自动本体填充的域独立过程

获取原文
获取原文并翻译 | 示例
           

摘要

Ontology Population looks for instantiating the constituent elements of an ontology, like properties and non-taxonomic relationships. Manual population by domain experts and knowledge engineers is an expensive and time consuming task. Fast ontology population is critical for the success of knowledge-based applications. Thus, automatic or semi-automatic approaches are needed. This work proposes a generic process approaching the Automatic Ontology Population problem by specifying its phases and the techniques used to perform the activities on each phase. The main contribution of the work here described is a domain-independent process for the automatic population of ontologies from text that applies natural language processing and information extraction techniques to acquire and classify ontology instances. This is a new approach for automatic ontology population that uses an ontology to automatically generate rules to extract instances from text and classify them in ontology classes. These rules can be generated from ontologies of any domain, making the proposed process domain-independent and therefore, allowing the instantiation of ontologies quickly and at a low cost Four experiments using a legal and a tourism corpora were conducted in order to evaluate the proposed process. Results indicate that this approach can extract and classify instances with high effectiveness with the additional advantage of domain independence. Some techniques representing the state of the art of this field are also described along with the solutions they adopt for each phase of the Automatic Ontology Population process with their advantages and limitations.
机译:本体论人口寻求实例化本体论的组成元素,例如属性和非分类关系。领域专家和知识工程师进行手动填充是一项昂贵且耗时的任务。快速的本体填充对于基于知识的应用程序的成功至关重要。因此,需要自动或半自动方法。这项工作提出了一个通用流程,通过指定其阶段以及在每个阶段执行活动的技术来解决自动本体人口问题。这里描述的工作的主要贡献是从文本自动填充本体的领域独立过程,该过程应用自然语言处理和信息提取技术来获取和分类本体实例。这是一种用于自动本体填充的新方法,该方法使用本体自动生成规则以从文本中提取实例并将其分类为本体类。这些规则可以从任何领域的本体中生成,从而使所提出的过程与领域无关,因此可以低成本快速地实例化本体进行了四个使用法律和旅游语料库的实验,以评估所提出的过程。结果表明,该方法可以有效地提取和分类实例,并具有域独立性的其他优势。还描述了一些代表该领域最新技术的技术,以及它们为自动本体填充过程的每个阶段所采用的解决方案,以及它们的优点和局限性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号