【24h】

High Dimensional Visual Data Classification

机译:高维视觉数据分类

获取原文
获取原文并翻译 | 示例

摘要

We present new visual data mining algorithms for interactive decision tree construction with large datasets. The size of data stored in the world is constantly increasing but the limits of current visual data mining (and visualization) methods concerning the number of items and dimensions of the dataset treated are well known (even with pixellisation methods). One solution to improve these methods is to use a higher-level representation of the data, for example a symbolic data representation. Our new interactive decision tree construction algorithms deal with interval and taxonomical data. With such a representation, we are able to deal with potentially very large datasets because we do not use the original data but higher-level data representation. Interactive algorithms are examples of new data mining approach aiming at involving more intensively the user in the process. The main advantages of this user-centered approach are the increased confidence and comprehensibility of the obtained model, because the user was involved in its construction and the possible use of human pattern recognition capabilities. We present some results we obtained on very large datasets.
机译:我们提出了用于大型数据集的交互式决策树构建的新视觉数据挖掘算法。世界上存储的数据量在不断增加,但是众所周知,有关处理的数据集的项目数量和维度的当前可视数据挖掘(和可视化)方法的局限性(即使采用像素化方法也是如此)。改善这些方法的一种解决方案是使用数据的高级表示形式,例如符号数据表示形式。我们新的交互式决策树构造算法处理区间和分类数据。通过这种表示,我们能够处理潜在的非常大的数据集,因为我们不使用原始数据,而是使用更高级别的数据表示。交互式算法是新数据挖掘方法的示例,旨在使用户更深入地参与该过程。这种以用户为中心的方法的主要优点是,所获得模型的置信度和可理解性增强,因为用户参与了其模型的构建以及可能使用的人类模式识别功能。我们介绍一些在非常大的数据集上获得的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号