首页> 外文期刊>International journal of knowledge discovery in bioinformatics >Improved Feature Selection by Incorporating Gene Similarity into the LASSO
【24h】

Improved Feature Selection by Incorporating Gene Similarity into the LASSO

机译:通过将基因相似性纳入LASSO,改进了特征选择

获取原文
获取原文并翻译 | 示例
       

摘要

Personalized medicine is customizing treatments to a patient's genetic profile and has the potential to revolutionize medical practice. An important process used in personalized medicine is gene expression profiling. Analyzing gene expression profiles is difficult, because there are usually few patients and thousands of genes, leading to the curse of dimensionality. To combat this problem, researchers suggest using prior knowledge to enhance feature selection for supervised learning algorithms. The authors propose an enhancement to the LASSO, a shrinkage and selection technique that induces parameter sparsity by penalizing a model's objective function. Their enhancement gives preference to the selection of genes that are involved in similar biological processes. The authors 'modified LASSO selects similar genes by penalizing interaction terms between genes. They devise a coordinate descent algorithm to minimize the corresponding objective function. To evaluate their method, the authors created simulation data where they compared their model to the standard LASSO model and an interaction LASSO model. The authors' model outperformed both the standard and interaction LASSO models in terms of detecting important genes and gene interactions for a reasonable number of training samples. They also demonstrated the performance of their method on a real gene expression data set from lung cancer cell lines.
机译:个性化医学正在根据患者的遗传特征定制治疗方法,并且有可能改变医学实践。个性化医学中使用的重要过程是基因表达谱分析。分析基因表达谱非常困难,因为通常只有很少的病人和成千上万的基因,这导致了维度的诅咒。为了解决这个问题,研究人员建议使用先验知识来增强监督学习算法的特征选择。作者提出了对LASSO的增强功能,这是一种通过惩罚模型的目标函数而导致参数稀疏的收缩和选择技术。它们的增强优先选择相似生物学过程中涉及的基因。作者修改后的LASSO通过惩罚基因之间的相互作用项来选择相似的基因。他们设计了一种坐标下降算法来最小化相应的目标函数。为了评估他们的方法,作者创建了仿真数据,将他们的模型与标准LASSO模型和交互LASSO模型进行了比较。在检测合理数量的训练样本的重要基因和基因相互作用方面,作者的模型优于标准和相互作用的LASSO模型。他们还证明了他们的方法在肺癌细胞系真实基因表达数据集上的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号