首页> 外文期刊>International Journal of Nuclear Knowledge Management >A semi-automatic method for extracting a taxonomy for nuclear knowledge using hierarchical document clustering based on concept sets
【24h】

A semi-automatic method for extracting a taxonomy for nuclear knowledge using hierarchical document clustering based on concept sets

机译:使用基于概念集的分层文档聚类提取核知识分类的半自动方法

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we present a text mining approach for the semiautomatic extraction of taxonomy of concepts for nuclear knowledge and evaluate the achievable results. Taxonomies are a fundamental part of any knowledge management strategy or framework. We propose a method for hierarchical document clustering based on the notion of frequent concept sets. Most clustering algorithms treat documents as a bag of words and bypass the important relationships between words, such as synonyms. In this method, we consider the semantic relationship between words and use a domain thesaurus (ETDE/INIS) to identify concepts. To validate the method, we conducted a case study in which we implemented a prototype, generating a taxonomy for nuclear knowledge with the goal of conceptually mapping the scientific production of the Brazilian Nuclear Energy Commission (CNEN).
机译:在本文中,我们提出了一种文本挖掘方法,用于半自动提取核知识概念的分类法,并评估可实现的结果。分类法是任何知识管理策略或框架的基本组成部分。我们提出了一种基于频繁概念集概念的分层文档聚类方法。大多数聚类算法将文档视为一袋单词,并绕过单词之间的重要关系,例如同义词。在这种方法中,我们考虑单词之间的语义关系,并使用领域词库(ETDE / INIS)来识别概念。为了验证该方法,我们进行了一个案例研究,其中实施了一个原型,生成了核知识分类法,目的是从概念上绘制巴西核能委员会(CNEN)的科学生产图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号