首页> 外文期刊>Education for Information >Building concept maps by adapting semantic distance metrics to Wikipedia
【24h】

Building concept maps by adapting semantic distance metrics to Wikipedia

机译:通过使语义距离度量适应维基百科来构建概念图

获取原文
获取原文并翻译 | 示例
       

摘要

Building and checking concept maps is an active research topic in visual learning. Concept maps are intended to show visual representations of interrelated concepts in educational and professional settings. For the last decades, numerous formulas have been proposed to compute the semantic proximity between any pair of concepts in the map. A review of the employment of semantic distances in concept map construction shows the lack of a clear criterion to select a suitable formula. Traditional metrics can be basically grouped depending on the representation of their knowledge source: statistic approaches based on co-occurrence of words in big corpora; path-based methods using lexical structures, like taxonomies; and multi-source methods which combine statistic approaches and path-based methods. On the one hand, path-based measures give better results than corpora-based metrics, but they cannot be used to process specific concepts or proper nouns due to the limited vocabulary of the taxonomies used. On the other side, information obtained from big corpora – including the World Wide Web – is not organized in a specific way and natural language processing techniques are usually needed in order to obtain acceptable results. In this research Wikipedia is proposed since it does not have such limitations. This article defines an approach to adapt path-based semantic similarity measures to Wikipedia for building concept maps. Experimental evaluation with a well-known set of human similarity judgments shows that the Wikipedia adapted metrics obtains equal or even better results when compared with the non-adapted approaches.
机译:建立和检查概念图是视觉学习中活跃的研究主题。概念图旨在显示教育和专业环境中相互关联的概念的视觉表示。在过去的几十年中,已经提出了许多公式来计算地图中任何一对概念之间的语义接近度。对概念图构造中语义距离使用的回顾表明,缺乏选择合适公式的明确标准。传统的度量标准基本上可以根据其知识来源的表示进行分组:基于大语料库中单词共现的统计方法;使用词汇结构(例如分类法)的基于路径的方法;以及将统计方法与基于路径的方法相结合的多源方法。一方面,基于路径的度量比基于语料库的度量具有更好的结果,但是由于所用分类法的词汇量有限,因此它们不能用于处理特定概念或专有名词。另一方面,从大型语料库(包括万维网)获得的信息不是以特定的方式组织的,通常需要自然语言处理技术才能获得可接受的结果。这项研究提出了维基百科,因为它没有这种限制。本文定义了一种方法,可以使基于路径的语义相似性度量适应Wikipedia来构建概念图。使用一组著名的人类相似性判断进行的实验评估表明,与未经适应的方法相比,采用Wikipedia的度量标准可以获得相同甚至更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号