首页> 外文期刊>Computational linguistics >OntoLearn Reloaded: A Graph-Based Algorithm for Taxonomy Induction
【24h】

OntoLearn Reloaded: A Graph-Based Algorithm for Taxonomy Induction

机译:重新加载了OntoLearn:分类学归纳的基于图的算法

获取原文
           

摘要

In 2004 we published in this journal an article describing OntoLearn, one of the first systems to automatically induce a taxonomy from documents and Web sites. Since then, OntoLearn has continued to be an active area of research in our group and has become a reference work within the community. In this paper we describe our next-generation taxonomy learning methodology, which we name OntoLearn Reloaded. Unlike many taxonomy learning approaches in the literature, our novel algorithm learns both concepts and relations entirely from scratch via the automated extraction of terms, definitions, and hypernyms. This results in a very dense, cyclic and potentially disconnected hypernym graph. The algorithm then induces a taxonomy from this graph via optimal branching and a novel weighting policy. Our experiments show that we obtain high-quality results, both when building brand-new taxonomies and when reconstructing sub-hierarchies of existing taxonomies.
机译:2004年,我们在该期刊上发表了一篇文章,描述了OntoLearn,这是从文档和网站自动引入分类法的首批系统之一。从那时起,OntoLearn一直是我们小组中活跃的研究领域,并已成为社区中的参考工作。在本文中,我们描述了下一代分类法学习方法,我们将其命名为OntoLearn Reloaded。与文献中的许多分类法学习方法不同,我们的新颖算法通过自动提取术语,定义和上位词从头开始学习概念和关系。这将导致非常密集,循环且可能断开的上位图。然后,该算法通过最佳分支和新颖的加权策略从该图中得出分类法。我们的实验表明,无论是构建全新的分类标准还是重构现有分类标准的子层次结构,我们都可以获得高质量的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号