首页> 外文会议>Conference on Neural Information Processing Systems >Exact sampling of determinantal point processes with sublinear time preprocessing
【24h】

Exact sampling of determinantal point processes with sublinear time preprocessing

机译:具有载于Sublinear Time预处理的决定性点过程的精确抽样

获取原文

摘要

We study the complexity of sampling from a distribution over all index subsets of the set {1, ..., n} with the probability of a subset S proportional to the determinant of the submatrix L_S of some n × n positive semidefinite matrix L, where L_S corresponds to the entries of L indexed by S. Known as a determinantal point process (DPP), this distribution is used in machine learning to induce diversity in subset selection. When sampling from DDPs, we often wish to sample multiple subsets S with small expected size k Δ= E[|S|] n from a very large matrix L, so it is important to minimize the preprocessing cost of the procedure (performed once) as well as the sampling cost (performed repeatedly). For this purpose we provide DPP-VFX, a new algorithm which, given access only to L, samples exactly from a determinantal point process while satisfying the following two properties: (1) its preprocessing cost is n · poly(k), i.e., sublinear in the size of L, and (2) its sampling cost is poly(k), i.e., independent of the size of L. Prior to our results, state-of-the-art exact samplers required O(n~3) preprocessing time and sampling time linear in n or dependent on the spectral properties of L. We furthermore give a reduction which allows using our algorithm for exact sampling from cardinality constrained determinantal point processes with n · poly(k) time preprocessing. Our implementation of DPP-VFX is provided at https://github.com/guilgautier/DPPy/.
机译:我们研究了SET {1,...,n}的所有指数子集的分布中采样的复杂性,其中子集的概率与某些n×n正半纤维素矩阵L的子峰L_s的决定簇成比例的概率,其中L_S对应于由S的L索引的L索引的参数称为确定静电点处理(DPP),该分布用于机器学习中以引起子集选择的多样性。从DDPS采样时,我们经常希望使用非常大的矩阵L采样具有小预期大小kΔ= e [| s |] n的多个子集s,因此重要的是最小化程序的预处理成本(执行一旦)以及采样成本(重复执行)。为此目的,我们提供DPP-VFX,一种新的算法,该算法,它仅给予L的访问权限,从确定静电点过程中准确地样本,同时满足以下两个属性:(1)其预处理成本是n·poly(k),即, Lublinear大小的L,(2)其采样成本是聚(k),即独立于L的大小。在我们的结果之前,所需的最先进的精确采样器O(n〜3)在N中的预处理时间和采样时间线性或取决于L的光谱性能。我们还提供了一种降低,其允许使用我们的算法从基数受约束的决定性点过程中具有N·poly(k)时间预处理的精确采样。我们在https://github.com/guilgautier/dppy/提供了DPP-VFX的实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号