首页> 美国卫生研究院文献>other >Multiple Response Regression for Gaussian Mixture Models with Known Labels
【2h】

Multiple Response Regression for Gaussian Mixture Models with Known Labels

机译:具有已知标签的高斯混合模型的多重响应回归

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Multiple response regression is a useful regression technique to model multiple response variables using the same set of predictor variables. Most existing methods for multiple response regression are designed for modeling homogeneous data. In many applications, however, one may have heterogeneous data where the samples are divided into multiple groups. Our motivating example is a cancer dataset where the samples belong to multiple cancer subtypes. In this paper, we consider modeling the data coming from a mixture of several Gaussian distributions with known group labels. A naive approach is to split the data into several groups according to the labels and model each group separately. Although it is simple, this approach ignores potential common structures across different groups. We propose new penalized methods to model all groups jointly in which the common and unique structures can be identified. The proposed methods estimate the regression coefficient matrix, as well as the conditional inverse covariance matrix of response variables. Asymptotic properties of the proposed methods are explored. Through numerical examples, we demonstrate that both estimation and prediction can be improved by modeling all groups jointly using the proposed methods. An application to a glioblastoma cancer dataset reveals some interesting common and unique gene relationships across different cancer subtypes.
机译:多重响应回归是一种有用的回归技术,可使用同一组预测变量对多个响应变量进行建模。多数现有的用于多响应回归的方法都用于对同类数据进行建模。但是,在许多应用中,可能会有异类数据,其中样本被分为多个组。我们的激励示例是癌症数据集,其中样本属于多种癌症亚型。在本文中,我们考虑对来自具有已知组标签的几种高斯分布的混合数据进行建模。天真的方法是根据标签将数据分为几组,并分别对每组建模。尽管很简单,但是这种方法忽略了不同群体之间潜在的共同结构。我们提出了一种新的惩罚方法,可以共同对所有组进行建模,从而可以确定公共结构和唯一结构。所提出的方法估计了回归系数矩阵以及响应变量的条件逆协方差矩阵。探索了所提出方法的渐近性质。通过数值示例,我们证明,通过使用提出的方法联合对所有组进行建模,可以同时提高估计和预测。胶质母细胞瘤癌症数据集的一项应用揭示了不同癌症亚型之间一些有趣的共同且独特的基因关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号