首页> 美国政府科技报告 >Tri-Plots: Scalable Tools for Multidimensional Data Mining
【24h】

Tri-Plots: Scalable Tools for Multidimensional Data Mining

机译:Tri-plots:用于多维数据挖掘的可扩展工具

获取原文

摘要

We focus on the problem of finding patterns across two large, multidimensional datasets. For example, given feature vectors of healthy and of non-healthy patients, we want to answer the following questions: Are the two clouds of points separable. What is the smallest/largest pair-wise distance across the two datasets. Which of the two clouds does a new point (feature vector) come from. We propose a new tool, the tri-plot, and its generalization, the pq-plot, which help us answer the above questions. We provide a set of rules on how to interpret a tri-plot, and we apply these rules on synthetic and real datasets. We also show how to use our tool for classification, when traditional methods (nearest neighbor, classification trees) may fail.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号