...
首页> 外文期刊>Journal of computational biology: A journal of computational molecular cell biology >Driver Missense Mutation Identification Using Feature Selection and Model Fusion
【24h】

Driver Missense Mutation Identification Using Feature Selection and Model Fusion

机译:基于特征选择和模型融合的驾驶员错义突变识别

获取原文
获取原文并翻译 | 示例

摘要

Driver mutations propel oncogenesis and occur much less frequently than passenger mutations. The need for automatic and accurate identification of driver mutations has increased dramatically with the exponential growth of mutation data. Current computational solutions to identify driver mutations rely on sequence homology. Here we construct a machine learning-based framework that does not rely on sequence homology or domain knowledge to predict driver missense mutations. A windowing approach to represent the local environment of the sequence around the mutation point as a mutation sample is applied, followed by extraction of three sequence-level features from each sample. After selecting the most significant features, the support vector machine and multimodal fusion strategies are employed to give final predictions. The proposed framework achieves relatively high performance and outperforms current state-of-the-art algorithms. The ease of deploying the proposed framework and the relatively accurate performance make this solution applicable to large-scale mutation data analyses.
机译:驱动基因突变促进了肿瘤的发生,其发生频率远低于乘客基因突变。随着突变数据的指数增长,对自动和准确识别驱动程序突变的需求已大大增加。用于识别驱动程序突变的当前计算解决方案依赖于序列同源性。在这里,我们构建了一个基于机器学习的框架,该框架不依赖序列同源性或领域知识来预测驾驶员错义突变。应用突变样本时,采用开窗方法代表突变点周围序列的局部环境,然后从每个样本中提取三个序列级特征。选择最重要的特征后,将使用支持向量机和多模式融合策略进行最终预测。所提出的框架实现了相对较高的性能,并且优于当前的最新算法。部署建议框架的简便性和相对准确的性能使该解决方案适用于大规模突变数据分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号