...
首页> 外文期刊>Information Sciences: An International Journal >Using sensitivity analysis and visualization techniques to open black box data mining models
【24h】

Using sensitivity analysis and visualization techniques to open black box data mining models

机译:使用敏感性分析和可视化技术打开黑匣子数据挖掘模型

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose a new visualization approach based on a Sensitivity Analysis (SA) to extract human understandable knowledge from supervised learning black box data mining models, such as Neural Networks (NNs), Support Vector Machines (SVMs) and ensembles, including Random Forests (RFs). Five SA methods (three of which are purely new) and four measures of input importance (one novel) are presented. Also, the SA approach is adapted to handle discrete variables and to aggregate multiple sensitivity responses. Moreover, several visualizations for the SA results are introduced, such as input pair importance color matrix and variable effect characteristic surface. A wide range of experiments was performed in order to test the SA methods and measures by fitting four well-known models (NN, SVM, RF and decision trees) to synthetic datasets (five regression and five classification tasks). In addition, the visualization capabilities of the SA are demonstrated using four real-world datasets (e.g., bank direct marketing and white wine quality).
机译:在本文中,我们提出了一种基于敏感性分析(SA)的新可视化方法,该方法可从有监督的学习黑匣子数据挖掘模型(例如神经网络(NN),支持向量机(SVM)和集成)中提取人类可理解的知识,包括随机森林(RF)。介绍了五种SA方法(其中三种是全新方法)和四种输入重要性度量(一种新颖方法)。而且,SA方法适用于处理离散变量并聚合多个灵敏度响应。此外,还介绍了几种SA结果的可视化效果,例如输入对重要性颜色矩阵和可变效果特征表面。通过将四个众所周知的模型(NN,SVM,RF和决策树)拟合到综合数据集(五个回归和五个分类任务),进行了广泛的实验以测试SA方法和度量。此外,使用四个真实世界的数据集(例如,银行直销和白葡萄酒质量)展示了SA的可视化功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号