首页> 外文学位 >A tree-based summarization framework for differences between two data sets.
【24h】

A tree-based summarization framework for differences between two data sets.

机译:一个基于树的摘要框架,用于两个数据集之间的差异。

获取原文
获取原文并翻译 | 示例

摘要

This work addresses the issue of describing the difference between two data sets. A framework is developed to quantify the difference between two data sets, given that the difference is induced by the different statistical distributions of the two data sets. Besides the quantification, this framework also provides an intuitive explanation of difference: a decision tree like structure is built to interpret the interesting point(s) of the difference. A dynamic programming algorithm is developed to give the global optimal solution. However, it has high computational complexity. To improve the efficiency, a greedy algorithm is proposed. Both algorithms are tested against the synthetic data sets and the real data sets.
机译:这项工作解决了描述两个数据集之间差异的问题。考虑到差异是由两个数据集的不同统计分布引起的,因此开发了一个框架来量化两个数据集之间的差异。除了量化之外,该框架还提供了差异的直观解释:构建类似决策树的结构来解释差异的有趣点。开发了动态规划算法以给出全局最优解。但是,它具有很高的计算复杂度。为了提高效率,提出了一种贪婪算法。两种算法都针对综合数据集和真实数据集进行了测试。

著录项

  • 作者

    Wang, Dong.;

  • 作者单位

    Kent State University.;

  • 授予单位 Kent State University.;
  • 学科 Computer Science.
  • 学位 M.S.
  • 年度 2009
  • 页码 51 p.
  • 总页数 51
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号