【24h】

RDF Data Clustering based on Resource and Predicate Embeddings

机译:基于资源和谓词嵌入的RDF数据群集

获取原文

摘要

With the increasing amount of Linked Data on the Web in the past decade, there is a growing desire for machine learning community to bring this type of data into the fold. However, while Linked Data and Machine Learning have seen an explosive growth in popularity, relatively little attention has been paid in the literature to the possible union of both Linked Data and Machine Learning. The best way to collaborate these two fields is to focus on RDF data. After a thorough overview of Machine learning pipeline on RDF data, the paper presents an unsupervised feature extraction technique named Walks and two language modeling approaches, namely Word2vec and Doc2vec. In order to adapt the RDF graph to the clustering mechanism, we first applied the Walks technique on several sequences of entities by combining it with the Word2Vec approach. However, the application of the Doc2vec approach to a set of walks gives better results on two different datasets.
机译:随着在过去十年内的网络上越来越多的联系数据,对机器学习界的渴望越来越大,将这种类型的数据带入折叠中。然而,虽然联系数据和机器学习已经看到了普及的爆炸性增长,但在与联系数据和机器学习的可能联盟中,文献中的重视相对较少。协作这两个字段的最佳方法是专注于RDF数据。在RDF数据上彻底概述机器学习管道后,本文提出了一个名为Walks和两种语言建模方法的无监督功能提取技术,即Word2Vec和Doc2Vec。为了使RDF图进行调整到聚类机制,我们首先通过将其与Word2Vec方法组合来应用于几个实体序列上的步行技术。但是,将DOC2VEC方法应用于一组散步,在两个不同的数据集中提供更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号