首页> 美国卫生研究院文献>other >Identification and Analysis of Driver Missense Mutations Using Rotation Forest with Feature Selection
【2h】

Identification and Analysis of Driver Missense Mutations Using Rotation Forest with Feature Selection

机译:基于特征选择的旋转森林驾驶员遗漏突变识别与分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Identifying cancer-associated mutations (driver mutations) is critical for understanding the cellular function of cancer genome that leads to activation of oncogenes or inactivation of tumor suppressor genes. Many approaches are proposed which use supervised machine learning techniques for prediction with features obtained by some databases. However, often we do not know which features are important for driver mutations prediction. In this study, we propose a novel feature selection method (called DX) from 126 candidate features' set. In order to obtain the best performance, rotation forest algorithm was adopted to perform the experiment. On the train dataset which was collected from COSMIC and Swiss-Prot databases, we are able to obtain high prediction performance with 88.03% accuracy, 93.9% precision, and 81.35% recall when the 11 top-ranked features were used. Comparison with other various techniques in the TP53, EGFR, and Cosmic2plus datasets shows the generality of our method.
机译:识别与癌症相关的突变(驱动程序突变)对于理解导致癌基因激活或抑癌基因失活的癌症基因组的细胞功能至关重要。提出了许多使用监督的机器学习技术进行预测的方法,这些方法具有一些数据库获得的特征。但是,我们通常不知道哪些功能对于驾驶员突变预测很重要。在这项研究中,我们从126个候选特征集中提出了一种新颖的特征选择方法(称为DX)。为了获得最佳性能,采用旋转森林算法进行实验。在从COSMIC和Swiss-Prot数据库收集的火车数据集上,当使用11个排名最高的功能时,我们能够以88.03%的准确性,93.9%的准确性和81.35%的召回率获得较高的预测性能。与TP53,EGFR和Cosmic2plus数据集中的其他各种技术进行比较,可以看出我们方法的普遍性。

著录项

  • 期刊名称 other
  • 作者

    Xiuquan Du; Jiaxing Cheng;

  • 作者单位
  • 年(卷),期 -1(2014),-1
  • 年度 -1
  • 页码 905951
  • 总页数 7
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号