NSCaching: Simple and Efficient Negative Sampling for Knowledge Graph Embedding

机译：NSCACHING：知识图形嵌入的简单有效的负面抽样

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Knowledge graph (KG) embedding is a fundamental problem in data mining research with many real-world applications. It aims to encode the entities and relations in the graph into low dimensional vector space, which can be used for subsequent algorithms. Negative sampling, which samples negative triplets from non-observed ones in the training data, is an important step in KG embedding. Recently, generative adversarial network (GAN), has been introduced in negative sampling. By sampling negative triplets with large scores, these methods avoid the problem of vanishing gradient and thus obtain better performance. However, using GAN makes the original model more complex and harder to train, where reinforcement learning must be used. In this paper, motivated by the observation that negative triplets with large scores are important but rare, we propose to directly keep track of them with cache. However, how to sample from and update the cache are two important questions. We carefully design the solutions, which are not only efficient but also achieve good balance between exploration and exploitation. In this way, our method acts as a "distilled" version of previous GAN-based methods, which does not waste training time on additional parameters to fit the full distribution of negative triplets. The extensive experiments show that our method can gain significant improvement on various KG embedding models, and outperform the state-of-the-arts negative sampling methods based on GAN.

机译：知识图（千克）嵌入是数据挖掘研究中的基本问题，具有许多现实世界应用。它旨在将图中的实体和关系编码为低维矢量空间，其可用于后续算法。消极采样，其在训练数据中从未观察到的非观察到的阴性三胞胎，是KG嵌入的重要步骤。最近，生成的对抗性网络（GaN）已被引入负面采样。通过具有大得分的负三胞胎，这些方法避免了梯度消失的问题，从而获得更好的性能。然而，使用GaN使原始模型更复杂，更难训练，必须使用加固学习。在本文中，通过观察到具有大分数的负三胞胎很重要但罕见，我们建议直接通过缓存跟踪它们。但是，如何从和更新缓存是两个重要问题。我们仔细设计了解决方案，这不仅有效，而且在勘探和剥削之间实现了良好的平衡。通过这种方式，我们的方法充当了先前GaN的方法的“蒸馏”版本，它不会在附加参数上浪费训练时间以适应负三胞胎的全部分布。广泛的实验表明，我们的方法可以对各种KG嵌入模型进行显着改进，并且优于基于GaN的最先进的负采样方法。

著录项

来源
《IEEE International Conference on Data Engineering》|2019年|721p|共12页
会议地点
作者
Yongqi Zhang; Quanming Yao; Yingxia Shao; Lei Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据处理、数据处理系统;
关键词
Training; Generative adversarial networks; Gallium nitride; Task analysis; Optimization; Semantics; Sampling methods;

机译：培训;生成的对抗网络;氮化镓;任务分析;优化;语义;抽样方法;

相似文献

外文文献
中文文献
专利

1. Simple and automated negative sampling for knowledge graph embedding [J] . Zhang Yongqi, Yao Quanming, Chen Lei The VLDB journal . 2021,第2期

机译：知识图形嵌入的简单和自动化的负面抽样
2. Understanding Negative Sampling in Knowledge Graph Embedding [J] . Jing Qian, Gangmin Li, Katie Atkinson, International Journal of Artificial Intelligence & Applications (IJAIA) . 2021,第1期

机译：了解知识图嵌入中的负面抽样
3. A simple and highly efficient counter-current chromatography method for the isolation of concentrated fractions of compounds based on the sequential sample loading technique: Comparative theoretical study of conventional multiple and intermittent sample loading counter-current chromatography separations [J] . Kostanyan Artak E. Journal of chromatography, A: Including electrophoresis and other separation methods . 2021,第1期

机译：一种简单且高效的逆流色谱法，用于基于顺序样品加载技术分离化合物的浓缩级分：常规多发性样品加载逆流色谱分离的比较理论研究
4. NSCaching: Simple and Efficient Negative Sampling for Knowledge Graph Embedding [C] . Yongqi Zhang, Quanming Yao, Yingxia Shao, IEEE International Conference on Data Engineering . 2019

机译：NSCaching：知识图嵌入的简单有效的负采样
5. On the efficiency of ranked set sampling relative to simple random sampling for estimating the ordinary least squares parameters of the simple linear regression model. [D] . Murff, Elizabeth J Tipton. 2001

机译：关于估计简单线性回归模型的普通最小二乘法参数的排序集抽样相对于简单随机抽样的效率。
6. Efficient and Exact Sampling of Simple Graphs with Given Arbitrary Degree Sequence [O] . Charo I. Del Genio, Hyunju Kim, Zoltán Toroczkai, 2010

机译：给定任意度数序列的简单图的有效和精确采样
7. Understanding Negative Sampling in Knowledge Graph Embedding [O] . Jing Qian, Gangmin Li, Katie Atkinson, 2021

机译：了解知识图形嵌入中的负面抽样
8. Mass Spectral Investigations on Toxins. 2. Simultaneous Detection and Quantification of Ultra-Trace Levels of Simple Trichothecenes in Environmental and Fermentation Samples by Gas Chromatographic/Negative Ion Chemical Ionization- [R] . Krishnamurthy, T., Wasserman, M. B., Sarver, E. W. 1987

机译：毒素的质谱研究。 2.通过气相色谱/负离子化学电离同时检测和定量环境和发酵样品中超痕量单纯的单端孢菌素 -

NSCaching: Simple and Efficient Negative Sampling for Knowledge Graph Embedding

摘要

著录项

相似文献

相关主题

期刊订阅