RDF Data Clustering based on Resource and Predicate Embeddings

机译：基于资源和谓词嵌入的RDF数据群集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the increasing amount of Linked Data on the Web in the past decade, there is a growing desire for machine learning community to bring this type of data into the fold. However, while Linked Data and Machine Learning have seen an explosive growth in popularity, relatively little attention has been paid in the literature to the possible union of both Linked Data and Machine Learning. The best way to collaborate these two fields is to focus on RDF data. After a thorough overview of Machine learning pipeline on RDF data, the paper presents an unsupervised feature extraction technique named Walks and two language modeling approaches, namely Word2vec and Doc2vec. In order to adapt the RDF graph to the clustering mechanism, we first applied the Walks technique on several sequences of entities by combining it with the Word2Vec approach. However, the application of the Doc2vec approach to a set of walks gives better results on two different datasets.

机译：随着在过去十年内的网络上越来越多的联系数据，对机器学习界的渴望越来越大，将这种类型的数据带入折叠中。然而，虽然联系数据和机器学习已经看到了普及的爆炸性增长，但在与联系数据和机器学习的可能联盟中，文献中的重视相对较少。协作这两个字段的最佳方法是专注于RDF数据。在RDF数据上彻底概述机器学习管道后，本文提出了一个名为Walks和两种语言建模方法的无监督功能提取技术，即Word2Vec和Doc2Vec。为了使RDF图进行调整到聚类机制，我们首先通过将其与Word2Vec方法组合来应用于几个实体序列上的步行技术。但是，将DOC2VEC方法应用于一组散步，在两个不同的数据集中提供更好的结果。

著录项

来源
《International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management》|2018年|1(CD-ROM)|共7页
会议地点
作者
Siham Eddamiri; El Moukhtar Zemmouri; Asmaa Benghabrit;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G354-53;
关键词
Machine learning; Linked data; RDF; Clustering; Word2vec; Doc2vec; K-means;

机译：机器学习;链接数据;RDF;聚类;word2vec;doc2vec;K-means.;

相似文献

外文文献
中文文献
专利

1. Q-PD: query graph extension framework using predicate-based RDF on linked open data [J] . Jongmo Kim, Kunyoung Kim, Mye Sohn, International journal of web and grid services . 2020,第2期

机译：Q-PD：使用基于谓词的RDF在链接的开放数据上查询图形扩展框架
2. Towards a semantic medical Web: HealthCyberMap’s tool for building an RDF metadata base of health information resources based on the Qualified Dublin Core Metadata Set [J] . Maged N. Kamel Boulos, Abdul V. Roudsari, Ewart R. Carson Medical science monitor : . 2002,第9期

机译：迈向语义医学网络：HealthCyberMap的工具，用于基于合格的都柏林核心元数据集构建健康信息资源的RDF元数据库
3. Interactive search over Web scale RDF data using predicates as constraints [J] . Teng Mingyan, Zhu Guangtian Journal of Intelligent Information Systems . 2015,第3期

机译：使用谓词作为约束条件的Web规模RDF数据的交互式搜索
4. RDF Data Clustering based on Resource and Predicate Embeddings [C] . Siham Eddamiri, El Moukhtar Zemmouri, Asmaa Benghabrit International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management . 2018

机译：基于资源和谓词嵌入的RDF数据群集
5. Comparison of clustered RDF data stores [D] . Patchigolla, Venkata N. Ramarekha 2011

机译：集群RDF数据存储的比较
6. NCBI2RDF: Enabling Full RDF-Based Access to NCBI Databases [O] . Alberto Anguita, Miguel García-Remesal, Diana de la Iglesia, 2006

机译：NCBI2RDF：启用对NCBI数据库的基于RDF的完全访问
7. RDF2Vec: RDF graph embeddings for data mining [O] . Ristoski Petar, Paulheim Heiko 2016

机译：RDF2Vec：用于数据挖掘的RDF图嵌入
8. West Virginia US Department of Energy experimental program to stimulate competitive research. Section 2: Human resource development; Section 3: Carbon-based structural materials research cluster; Section 3: Data parallel algorithms for scientific computing [R] . 1994

机译：西弗吉尼亚州美国能源部实验计划，以刺激竞争研究。第2节：人力资源开发;第3节：碳基结构材料研究集群;第3节：科学计算的数据并行算法

RDF Data Clustering based on Resource and Predicate Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅