首页> 外文期刊>Natural Computing >A hybrid approach for learning concept hierarchy from Malay text using artificial immune network
【24h】

A hybrid approach for learning concept hierarchy from Malay text using artificial immune network

机译:一种使用人工免疫网络从马来语文本中学习概念层次的混合方法

获取原文
获取原文并翻译 | 示例
       

摘要

A concept hierarchy is an integral part of an ontology but it is expensive and time consuming to build. Motivated by this, many unsupervised learning methods have been proposed to (semi-) automatically develop a concept hierarchy. A significant work is the Guided Agglomerative Hierarchical Clustering (GAHC) which relies on linguistic patterns (i.e., hypernyms) to guide the clustering process. However, GAHC still relies on contextual features to build the concept hierarchy, thus data sparsity still remains an issue in GAHC. Artificial Immune Systems are known for robustness, noise tolerance and adaptability. Thus, an extension to the GAHC is proposed by hybridizing it with Artificial Immune Network (aiNet) which we call Guided Clustering and aiNet for Learning Concept Hierarchy (GCAINY). In this paper, we have tested GCAINY using two parameter settings. The first parameter setting is obtained from the literature as a baseline parameter setting and second is by automatic parameter tuning using Particle Swarm Optimization (PSO). The effectiveness of the GCAINY is evaluated on three data sets. For further validations, a comparison between GCAINY and GAHC has been conducted and with statistical tests showing that GCAINY increases the quality of the induced concept hierarchy. The results reveal that the parameters value found by using PSO significant produce better concept hierarchy than the vanilla parameter. Thus it can be concluded th the proposed approach has greater ability to be used in the field of ontology learning.
机译:概念层次结构是本体的有机组成部分,但是构建起来既昂贵又耗时。因此,提出了许多无监督的学习方法来(半)自动开发概念层次。指导性聚集层次聚类(GAHC)是一项重要的工作,它依靠语言模式(即上位词)来指导聚类过程。但是,GAHC仍然依靠上下文功能来构建概念层次结构,因此数据稀疏性仍然是GAHC中的一个问题。人工免疫系统以其坚固性,噪声耐受性和适应性而闻名。因此,通过将GAHC与人工免疫网络(aiNet)混合来提出对GAHC的扩展,我们将其称为引导聚类和用于学习概念层次结构的aiNet(GCAINY)。在本文中,我们使用两个参数设置测试了GCAINY。第一个参数设置是从文献中获得的基线参数设置,第二个是使用粒子群优化(PSO)通过自动参数调整获得的。 GCAINY的有效性通过三个数据集进行评估。为了进行进一步的验证,已经对GCAINY和GAHC进行了比较,并进行了统计测试,结果表明GCAINY提高了所引入概念层次的质量。结果表明,使用PSO显着性得到的参数值比普通参数产生更好的概念层次。因此可以得出结论,所提出的方法具有更大的能力用于本体学习领域。

著录项

  • 来源
    《Natural Computing》 |2011年第1期|p.275-304|共30页
  • 作者单位

    Data Mining and Optimization Research Group, Centre for Artificial Information Technology,Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia,43650 Bangi, Selangor, Malaysia,Soft Computing Research Group, Faculty of Computer Science and Information System,Universiti Teknologi Malaysia, 81100 Skudai, Johor, Malaysia;

    Soft Computing Research Group, Faculty of Computer Science and Information System,Universiti Teknologi Malaysia, 81100 Skudai, Johor, Malaysia;

    Data Mining and Optimization Research Group, Centre for Artificial Information Technology,Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia,43650 Bangi, Selangor, Malaysia;

    Data Mining and Optimization Research Group, Centre for Artificial Information Technology,Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia,43650 Bangi, Selangor, Malaysia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    artificial immune system; immune network; machine learning; malay language; ontology learning; automatic taxonomy induction;

    机译:人工免疫系统;免疫网络机器学习马来语本体学习;自动分类学归纳;
  • 入库时间 2022-08-17 13:35:22

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号