首页> 外文会议>Intelligent tools for building a scientific information platform >Semantic Clustering of Scientific Articles with Use of DBpedia Knowledge Base
【24h】

Semantic Clustering of Scientific Articles with Use of DBpedia Knowledge Base

机译:利用DBpedia知识库对科学文章进行语义聚类

获取原文
获取原文并翻译 | 示例

摘要

A case study of semantic clustering of scientific articles related to Rough Sets is presented. The proposed method groups the documents on the basis of their content and with assistance of DBpedia knowledge base. The text corpus is first treated with Natural Language Processing tools in order to produce vector representations of the content and then matched against a collection of concepts retrieved from DBpedia. As a result, a new representation is constructed that better reflects the semantics of the texts. With this new representation, the documents are hierarchically clustered in order to form partition of papers that share semantic relatedness. The steps in textual data preparation, utilization of DBpedia and clustering are explained and illustrated with experimental results. Assessment of clustering quality by human experts and by comparison to traditional approach is presented.
机译:提出了与粗糙集相关的科学文章语义聚类的案例研究。所提出的方法根据文档的内容并在DBpedia知识库的帮助下对文档进行分组。首先使用自然语言处理工具处理文本语料库,以产生内容的矢量表示,然后将其与从DBpedia检索的概念集合进行匹配。结果,构建了一个新的表示形式,可以更好地反映文本的语义。通过这种新的表示形式,文档可以按层次结构进行聚类,以形成共享语义相关性的论文分区。实验结果说明并说明了文本数据准备,DBpedia利用和聚类的步骤。介绍了人类专家对聚类质量的评估,并与传统方法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号