首页> 外文期刊>Genetic epidemiology. >Inferring gene regulatory relationships with a high-dimensional robust approach
【24h】

Inferring gene regulatory relationships with a high-dimensional robust approach

机译:用高维鲁棒方法推断基因监管关系

获取原文
获取原文并翻译 | 示例
           

摘要

Gene expression (GE) levels have important biological and clinical implications. They are regulated by copy number alterations (CNAs). Modeling the regulatory relationships between GEs and CNAs facilitates understanding disease biology and can also have values in translational medicine. The expression level of a gene can be regulated by its cis-acting as well as trans-acting CNAs, and the set of trans-acting CNAs is usually not known, which poses a high-dimensional selection and estimation problem. Most of the existing studies share a common limitation in that they cannot accommodate long-tailed distributions or contamination of GE data. In this study, we develop a high-dimensional robust regression approach to infer the regulatory relationships between GEs and CNAs. A high-dimensional regression model is used to accommodate the effects of both cis-acting and trans-acting CNAs. A density power divergence loss function is used to accommodate long-tailed GE distributions and contamination. Penalization is adopted for regularized estimation and selection of relevant CNAs. The proposed approach is effectively realized using a coordinate descent algorithm. Simulation shows that it has competitive performance compared to the nonrobust benchmark and the robust LAD (least absolute deviation) approach. We analyze TCGA (The Cancer Genome Atlas) data on cutaneous melanoma and study GE-CNA regulations in the RAP (regulation of apoptosis) pathway, which further demonstrates the satisfactory performance of the proposed approach.
机译:基因表达(GE)水平具有重要的生物学和临床意义。它们是通过拷贝数改变(CNA)的监管。建模GES和CNA之间的监管关系有助于了解疾病生物学,也可以具有翻译药物的价值。基因的表达水平可以通过其顺式作用以及反式作用CNA来调节,并且通常未知的转效CNA一组,其姿势是高维选择和估计问题。大多数现有研究共享共同的限制,因为它们无法容纳长尾的分布或GE数据的污染。在这项研究中,我们开发了一种高维鲁棒回归方法来推断GES和CNA之间的监管关系。高维回归模型用于适应CIS作用和反式作用CNA的效果。密度功率发散损耗功能用于容纳长尾的GE分布和污染。通过正规化估计和联系选择的惩罚。使用坐标阶级算法有效地实现了所提出的方法。仿真结果表明,与非粗糙基准和强大的LAD(最小绝对偏差)方法相比,它具有竞争性能。我们分析皮肤黑素瘤的TCGA(癌症基因组ATLAS)数据,并研究GE-CNA规定在RAP(细胞凋亡的调节)途径中,进一步展示了所提出的方法的令人满意的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号