首页> 外文会议>International Conference on Web Research >A semantic ontology-based document organizer to cluster elearning documents
【24h】

A semantic ontology-based document organizer to cluster elearning documents

机译:基于语义本体的文档管理器,用于群集电子学习文档

获取原文

摘要

Document clustering is a useful technique to organize large sets of documents into meaningful groups. The usefulness is appreciated by labeling the clusters with relevant words that describe their associated documents. The traditional approach for document clustering, i.e. bag-of-words representation, often ignores the semantic relations between terms. Hence, ontology-based document clustering is proposed. In the context of e-Learning, the richer annotation of learning materials, via the use of appropriate ontologies is a way to deal with the reusability and remix of learning objects. Through providing a semantic infrastructure that will explicitly declare the semantics and relations between concepts used in labeling learning objects, the desired quality in the learning offer can be ensured. This paper proposes an ontology-based document clustering approach based on two-step clustering algorithm and compares its performance with the conventional clustering. Ontology is introduced through defining a weighting scheme that integrates traditional scheme, i.e. co-occurrences of words, with weights of relations between words in ontology. Our experimental evaluations are performed on ICVL (International Conference on Virtual Learning) paper collection as dataset with e-Learning domain ontology as the background knowledge. The ontology was implemented by us through a different research. The results show that inclusion of ontology increases the clustering quality.
机译:文档群集是将大集文档组织成有意义的群体的有用技术。通过将群集标记具有描述其相关文档的相关单词来欣赏的有用性。文档群集的传统方法,即文字袋式表示,通常忽略术语之间的语义关系。因此,提出了基于本体的文档聚类。在电子学习的背景下,通过使用适当的本体的学习材料的更丰富的注释是一种处理学习对象的可重用性和混音的方法。通过提供一个语义基础设施,将明确声明标记学习对象中使用的概念之间的语义和关系,可以确保学习报价中所需的质量。本文提出了一种基于两步聚类算法的本体文档聚类方法,并将其性能与传统聚类进行比较。通过定义整合传统方案的加权方案来引入本体论,即单词的共同发生,具有本体中文字之间的关系的重量。我们的实验评估在ICVL(虚拟学习国际会议)纸质集合中进行了与电子学习领域本体作为背景知识的数据集。本体学由我们通过不同的研究实施。结果表明,包含本体的含量增加了聚类质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号