首页> 中文期刊>计算机科学与探索 >基于张量的正则化多线性回归算法及其应用

基于张量的正则化多线性回归算法及其应用

     

摘要

常用的回归算法,如LASSO(least absolute shrinkage and selection operator)算法,是对数据向量化后进行分析处理.然而,数据向量化将破坏数据的原始结构和内在相关性,并且忽略数据的高阶依赖性.与此同时,数据向量化会导致数据维数过高,计算复杂和存储困难.因此,提出了一种基于张量的正则化多线性回归算法(multilinear LASSO,mLASSO).该算法是LASSO算法在张量空间的一个扩展,首先使用加权向量对张量做模乘运算,将张量空间变换到向量空间;然后在该空间上使用LASSO算法对目标值进行回归分析,得到该方向上的加权向量,采用交替迭代算法依次优化各个方向的加权向量;最后,使用各个方向的最优加权向量和张量数据做模乘运算得到预测变量值.算法主要包含以下两个优点:(1)充分利用了数据的结构信息;(2)该算法使用的LASSO算法嵌入了特征选择功能,提高了模型的泛化能力.实验结果表明该方法在多线性数据上表现出了良好的性能.%As one of the conventional regression algorithms, LASSO (least absolute shrinkage and selection operator) algorithm is mostly employed to analyze the vectorized dataset. However, the vectorization of a dataset may undermine the original structure and inner relations of the dataset and hide the high-order dependencies. Further, it also increases the data dimensionality as well as time and space complexity. This paper proposes a tensor-based regularized multilinear regression algorithm, named multilinear LASSO (mLASSO), by reformulating the LASSO algorithm for tensor space. The proposed algorithm firstly decomposes tensor space to vector space by applying mode production and employing weighted vectors. Then, the algorithm iteratively uses LASSO to update the weighted vectors for converging the proposed model. Finally, the optimum weighted vectors are applied to all direction in the tensor space in order to generate the regression model. The contribution of this paper is twofold: (1) The algorithm employs the whole structural information of the dataset for generating a regression model. (2) Since the proposed algorithm employs LASSO, it can significantly improve the performance of the generated model by using embedded feature selection. Experimental studies confirm that the proposed algorithm achieves satisfactory performance on the multilinear data.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号