首页> 外文学位 >Convex Optimization with Applications in Sparse Multivariate Statistics.
【24h】

Convex Optimization with Applications in Sparse Multivariate Statistics.

机译:凸优化及其在稀疏多元统计中的应用。

获取原文
获取原文并翻译 | 示例

摘要

The main focus of this thesis is to build sparse statistical models central to several machine learning tasks. Parsimonious modeling in statistics seeks to balance the tradeoff between solution accuracy and the curse of dimensionality. From an optimization perspective, challenges emanate from the inherent non-convexity in these problems and the computational bottlenecks in traditional algorithms.;We first focus on capturing dependence relationships between variables in as sparse a manner as possible. Covariance selection seeks to estimate a covariance matrix by maximum likelihood while restricting the number of nonzero inverse covariance matrix coefficients. A single penalty parameter usually controls the tradeoff between log likelihood and sparsity in the inverse matrix. We describe an efficient algorithm for computing a full regularization path of solutions to this problem.;We next derive a semidefinite relaxation for the problem of computing generalized eigenvalues with a constraint on the cardinality of the corresponding eigenvector. We first use this result to produce a sparse variant of linear discriminant analysis and compare classification performance with greedy and thresholding approaches. We then use this relaxation to produce lower performance bounds on the subset selection problem.;Finally, we derive approximation algorithms for the nonnegative matrix factorization problem, i.e. the problem of factorizing a matrix as the product of two matrices with nonnegative coefficients. We form convex approximations of this problem which can be solved efficiently via first-order algorithms.
机译:本文的主要重点是建立稀疏的统计模型,该模型对于若干机器学习任务至关重要。统计中的简约建模试图在求解精度和维数诅咒之间取得平衡。从优化的角度来看,挑战来自于这些问题固有的非凸性以及传统算法的计算瓶颈。我们首先关注于以尽可能稀疏的方式捕获变量之间的依赖关系。协方差选择旨在通过最大似然来估计协方差矩阵,同时限制非零逆协方差矩阵系数的数量。单个惩罚参数通常控制逆矩阵中对数似然和稀疏性之间的折衷。我们描述了一种用于计算该问题的完整正则化路径的有效算法。;接下来,我们对计算广义特征值的问题(对应特征向量的基数有约束)推导了半确定松弛。我们首先使用此结果来生成线性判别分析的稀疏变体,然后将分类性能与贪婪和阈值方法进行比较。然后,我们使用这种松弛来在子集选择问题上产生较低的性能界限。最后,我们导出了非负矩阵分解问题的近似算法,即将矩阵分解为两个具有非负系数的矩阵乘积的问题。我们形成这个问题的凸近似,可以通过一阶算法有效地解决。

著录项

  • 作者

    Krishnamurthy, Vijay Kumar.;

  • 作者单位

    Princeton University.;

  • 授予单位 Princeton University.;
  • 学科 Statistics.;Economics Finance.;Operations Research.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 122 p.
  • 总页数 122
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号