首页> 外文期刊>International journal of medical engineering and informatics >Integrating domain knowledge in supervised machine learning to assess the risk of breast cancer
【24h】

Integrating domain knowledge in supervised machine learning to assess the risk of breast cancer

机译:将领域知识整合到有监督的机器学习中以评估乳腺癌的风险

获取原文
获取原文并翻译 | 示例
       

摘要

We used various supervised machine learning and data mining techniques to generate a model for predicting risk of breast cancer in post menopausal women using genomic data, family history, and age. In this paper, we propose an approach to select nine best SNPs using various feature selection algorithms and evaluate binary classifiers performance. We have also designed an algorithm to incorporate domain knowledge into our machine learning model. Our observations revealed that the machine learning model generated using both the domain knowledge and the feature selection technique performed better compared to the naive approach of classification. It is also interesting to note that, in addition to selecting nine best SNPs, feature selection resulted in removing age from the set of features to be used for cancer risk assessment.
机译:我们使用各种监督的机器学习和数据挖掘技术来生成一个模型,该模型使用基因组数据,家族史和年龄来预测绝经后妇女的乳腺癌风险。在本文中,我们提出了一种使用各种特征选择算法选择九种最佳SNP并评估二进制分类器性能的方法。我们还设计了一种算法,将领域知识整合到我们的机器学习模型中。我们的观察表明,与单纯的分类方法相比,使用领域知识和特征选择技术生成的机器学习模型的性能更好。还值得注意的是,除了选择9个最佳SNP之外,特征选择还导致从用于癌症风险评估的特征集中消除了年龄。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号