首页> 外文会议>Annual ACM-SIAM Symposium on Discrete Algorithms >A Faster Subquadratic Algorithm for Finding Outlier Correlations
【24h】

A Faster Subquadratic Algorithm for Finding Outlier Correlations

机译:用于查找异常相关性的更快的子相关算法

获取原文

摘要

We study the problem of detecting outlier pairs of strongly correlated variables among a collection of n variables with otherwise weak pairwise correlations. After normalization, this task amounts to the geometric task where we are given as input a set of n vectors with unit Euclidean norm and dimension d, and we are asked to find all the outlier pairs of vectors whose inner product is at least ρ in absolute value, subject to the promise that all but at most q pairs of vectors have inner product at most τ in absolute value for some constants 0 < τ < ρ < 1. Improving on an algorithm of G. Valiant [FOCS 2012; J.ACM 2015], we present a randomized algorithm that for Boolean inputs ({-1, 1}-valued data normalized to unit Euclidean length) runs in time O(n~(max {1-γ+M(Δγ,γ), M(1-γ,2Δγ)}) + qdn~(2γ)), where 0 < γ < 1 is a constant tradeoff parameter and M(μ, ν) is the exponent to multiply an [n~μ] × [n~ν] matrix with an [n~ν] × [n~μ] matrix and Δ = 1/(1 - log_τ ρ). As corollaries we obtain randomized algorithms that run in time O(n(2ω)/(3-log_τ ρ) + qdn(2(1-log_τ ρ))/(3-log_τ ρ)) and in time O(n4/(2+α(1 - log_τ ρ)) + qdn(2α(1 - log_τ ρ))/(2+α(1 - log_τ ρ))), where 2 ≤ ω < 2.38 is the exponent for square matrix multiplication and 0.3 < α ≤ 1 is the exponent for rectangular matrix multiplication. We present further corollaries for the light bulb problem and for learning sparse Boolean functions. (The notation O (·) hides polylogarithmic factors in n and d whose degree may depend on ρ and τ.)
机译:我们研究了在N个变量的集合中检测到强烈相关变量的异常与较弱的成对相关性的问题。归一化之后,此任务金额为几何任务,其中我们被提供为具有单位欧几里德规范和维度D的一组N向量,并且我们被要求找到内部产品至少为绝对ρ的所有异常对向量。值得的是,经过的承诺,除了大多数Q对向量的所有情况下都有大多数τ的内部产品,对于某些常数0 <τ<ρ<1的绝对值。关于G. Valant算法的改进[Focs 2012; J.AcM 2015,我们介绍了一种随机算法,即用于布尔输入({-1,1}归一化到单位欧几里德长度的数据)在时间o(n〜(max {1-γ+ m(Δγ,γ ),M(1-γ,2Δγ)})+ QDN〜(2γ)),其中0 <γ<1是恒定的折衷参数,M(μ,ν)是乘以[nμ]×的指数[n〜ν]矩阵,具有[n〜ν]×[nμ]矩阵和δ= 1 /(1 - log_τρ)。作为冠状体,我们获得了在时间O(n(2Ω)/(3-log_τρ)+ qdn(2(1-log_τρ))/(3-log_τρ))和时间o(n4 /(n4 /))的随机化算法2 +α(1 - log_τρ))+ QDN(2α(1 - log_τρ))/(2 +α(1 - log_τρ))),其中2≤Ω<2.38是方矩阵乘法的指数和0.3 <α≤1是矩形矩阵乘法的指数。我们为灯泡问题提供了进一步的推论,并用于学习稀疏布尔函数。 (符号O(·)隐藏在尺寸可以取决于ρ和τ的N和D中的聚动力学因子。)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号