Unsupervised Learning of Link Discovery Configuration

机译：链接发现配置的无监督学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Discovering links between overlapping datasets on the Web is generally realised through the use of fuzzy similarity measures. Configuring such measures is often a non-trivial task that depends on the domain, ontological schemas, and formatting conventions in data. Existing solutions either rely on the user's knowledge of the data and the domain or on the use of machine learning to discover these parameters based on training data. In this paper, we present a novel approach to tackle the issue of data linking which relies on the unsupervised discovery of the required similarity parameters. Instead of using labeled data, the method takes into account several desired properties which the distribution of output similarity values should satisfy. The method includes these features into a fitness criterion used in a genetic algorithm to establish similarity parameters that maximise the quality of the resulting linkset according to the considered properties. We show in experiments using benchmarks as well as real-world datasets that such an unsupervised method can reach the same levels of performance as manually engineered methods, and how the different parameters of the genetic algorithm and the fitness criterion affect the results for different datasets.

机译：通常，通过使用模糊相似性度量来发现Web上重叠的数据集之间的链接。配置此类措施通常是一项艰巨的任务，它取决于域，本体架构和数据中的格式约定。现有解决方案要么依赖于用户对数据和域的了解，要么依赖于使用机器学习根据训练数据发现这些参数。在本文中，我们提出了一种新颖的方法来解决数据链接问题，该方法依赖于所需相似性参数的无监督发现。代替使用标记的数据，该方法考虑了输出相似性值的分布应满足的几个期望的特性。该方法将这些特征包括在适合度标准中，该适合度标准在遗传算法中用于建立相似性参数，以根据考虑的属性来最大化所得链接集的质量。我们在使用基准测试和真实数据集的实验中表明，这种无监督的方法可以达到与手动设计方法相同的性能水平，以及遗传算法的不同参数和适用性标准如何影响不同数据集的结果。

著录项

来源
《The semantic web: research and applications.》|2012年|p.119-133|共15页
会议地点 Heraklion(GR);Heraklion(GR)
作者
Andriy Nikolov; Mathieu dAquin; Enrico Motta;
展开▼
作者单位

Knowledge Media Institute, The Open University, Milton Keynes, UK;

Knowledge Media Institute, The Open University, Milton Keynes, UK;

Knowledge Media Institute, The Open University, Milton Keynes, UK;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;计算机网络;
关键词

相似文献

外文文献
中文文献
专利

1. Recognition of polymer configurations by unsupervised learning [J] . Xin Xu, Qianshi Wei, Huaping Li, Physical review, E . 2019,第4aPta2期

机译：通过无监督学习识别聚合物配置
2. Unsupervised Learning of Polychronous Wavefront Computation Configurations for Pattern Recognition [J] . Fred Highland Procedia Computer Science . 2018,第1期

机译：用于模式识别的多时波前计算配置的无监督学习
3. Unsupervised Learning of Patterns Using Multilayer Reverberating Configurations of Polychronous Wavefront Computation [J] . Fred Highland, Corey Hart Procedia Computer Science . 2016,第1期

机译：使用多级波前计算的多层混响配置进行模式的无监督学习
4. Unsupervised Learning of Link Discovery Configuration [C] . Andriy Nikolov, Mathieu dAquin, Enrico Motta Extended Semantic Web Conference . 2012

机译：无监督的链接发现配置学习
5. Discovery of Visual Semantics by Unsupervised and Self-Supervised Representation Learning [D] . Larsson, Gustav Martin. 2017

机译：通过无监督和自监督的表示学习发现视觉语义
6. Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery [O] . Nora K. Speicher, Nico Pfeifer -1

机译：通过规范化的无监督多核学习来集成不同的数据类型并将其应用于癌症亚型的发现
7. Unsupervised learning of link discovery configuration [O] . Nikolov Andriy, dAquin Mathieu, Motta Enrico 2012

机译：无监督学习链路发现配置
8. Unsupervised Group Discovery and Link Prediction in Relational Datasets: A Nonparametric Bayesian Approach [R] . Koutsourelakis, P. S. 2007

机译：关联数据集中的无监督群发现和链接预测：非参数贝叶斯方法

Unsupervised Learning of Link Discovery Configuration

摘要

著录项

相似文献

相关主题

期刊订阅