首页> 外文期刊>Computational statistics & data analysis >Primal path algorithm for compositional data analysis
【24h】

Primal path algorithm for compositional data analysis

机译:组成数据分析的原始路径算法

获取原文
获取原文并翻译 | 示例
           

摘要

We consider the LASSO estimator for compositional data in which covariates are nonnegative, and their sum is always one. Due to the linear constraint of the regression coefficients caused by the sum to one condition, standard algorithms for LASSO cannot be applied directly to compositional data. Hence, a specific regularized regression model with linear constraints is commonly used. However, linear constraints incur additional computational time, which becomes severe in high-dimensional cases. Additionally, the exact computation for the regression is not investigated under existing methods. In this paper, we first propose an exact solution path algorithm for a l(1) regularized regression with high-dimensional compositional data and extend to a classification model. We also compare its computational speed with that of previously developed algorithms and then apply the proposed algorithm to analyzing income inequality data in economics and human gut microbiome data in biology. By analyzing simulated and real data sets, we illustrate that our specialized algorithm is significantly more efficient than the generalized LASSO algorithm for compositional data. (C) 2020 Elsevier B.V. All rights reserved.
机译:我们考虑套索估算器的组成数据,其中协调数据是非负面的,并且它们的总和总是一个。由于由总和引起的回归系数的线性约束,Lasso的标准算法不能直接应用于组成数据。因此,通常使用具有线性约束的特定正则化回归模型。然而,线性约束导致额外的计算时间,这在高维例中变得严重。此外,未在现有方法下研究回归的确切计算。在本文中,我们首先提出了一种具有高维成分数据的L(1)正则化回归的精确解决方案路径算法,并延伸到分类模型。我们还将其计算速度与先前开发的算法的计算速度进行比较,然后应用所提出的算法来分析生物学中经济学和人体肠道微生物数据中的收入不等式数据。通过分析模拟和真实数据集,我们说明我们的专用算法比用于组建数据的广义套索算法明显更有效。 (c)2020 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号