首页> 外文期刊>Journal of theoretical & computational chemistry >Improving the classification performance with group lasso-based ranking method in high dimensional correlated data
【24h】

Improving the classification performance with group lasso-based ranking method in high dimensional correlated data

机译:基于组的基于卢斯的排名方法在高维相关数据中提高分类性能

获取原文
获取原文并翻译 | 示例
       

摘要

The high-throughput correlated DNA methylation (DNAmeth) dataset generated from Illumina Infinium Human Methylation 27 (IIHM 27K) BeadChip assay. In the DNAmeth data, there are several CpG sites for every gene, and these grouped CpG sites are highly correlated. Most of the current filtering-based ranking (FBR) methods do not consider the group correlation structures. Obtaining the significant features with the FBR methods and applying these features to the classifiers to attain the best classification accuracy in highly correlated DNAmeth data is a challenging task. In this research, we introduce a resampling of group least absolute shrinkage and selection operator (glasso) FBR method capable of ignoring the unrelated features in the data considering the group correlation among the features. The various classifiers, such as random forests (RF), Naive Bayes (NB), and support vector machines (SVM) with the significant CpGs obtained from the proposed resampling of group lasso-based ranking (RGLR) method helped to boost the classification accuracy. Through simulated and experimental prostate DNAmeth data, we showed that higher performance of accuracy, sensitivity, specificity, and geometric mean is achieved by ignoring the unimportant CpG sites through the RGLR method.
机译:从Illumina人甲基化27(IIHM 27K)珠芯片测定产生的高通量相关的DNA甲基化(DNAMeth)数据集。在D污姓数据中,每个基因有几个CPG站点,这些分组的CPG站点具有高度相关性。大多数基于过滤的排名(FBR)方法不考虑组相关结构。获得具有FBR方法的重要特征并将这些特征应用于分类器以获得最佳分类准确性,在高度相关的DNamet数据中是一个具有挑战性的任务。在这项研究中,我们介绍了能够忽略数据中的数据中的无关特征的基团最小绝对收缩和选择操作员(Glasso)FBR方法的重新采样。各种分类器,例如随机森林(RF),朴素贝叶斯(NB)和支持向量机(SVM),其具有从基于卢斯为基于卢斯的排名(RGLR)方法的提出的重采样获得的重要CPG有助于提高分类精度。通过模拟和实验前列腺DNAMETH数据,我们表明通过RGLR方法忽略不重要的CPG位点来实现更高的精度,灵敏度,特异性和几何平均值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号