首页> 外文学位 >Instance selection for model-based classifiers
【24h】

Instance selection for model-based classifiers

机译:基于模型的分类器的实例选择

获取原文
获取原文并翻译 | 示例

摘要

Aspects of a classifier's training dataset can often make building a helpful and high accuracy classifier difficult. Instance selection addresses some of the issues in a dataset by selecting a subset of the data in such a way that learning from the reduced dataset leads to a better classifier. This work introduces an integer programming formulation of instance selection that relies on column generation techniques to obtain a good solution to the problem. Experimental results show that instance selection improves the usefulness of some classifiers by optimizing the training data so that that the training dataset has easier to learn boundaries between class values. Also included in this paper are two case studies from the Surveillance, Epidemiology, and End Results (SEER) database that further confirm the benefit of instance selection. Overall, results indicate that performing instance selection for a classifier is a competitive classification approach. However, it should be noted that instance selection might overfit classifiers that have already achieved a good fit to the dataset.
机译:分类器的训练数据集的各个方面通常会使构建有用的高精度分类器变得困难。实例选择通过选择数据的子集来解决数据集中的一些问题,这样从简化的数据集中学习可以得到更好的分类器。这项工作介绍了实例选择的整数编程公式,该公式依赖列生成技术来获得问题的良好解决方案。实验结果表明,实例选择通过优化训练数据来提高某些分类器的实用性,从而使训练数据集更易于学习类值之间的边界。本文还包括来自监视,流行病学和最终结果(SEER)数据库的两个案例研究,它们进一步证实了实例选择的好处。总体而言,结果表明为分类器执行实例选择是一种竞争性分类方法。但是,应注意的是,实例选择可能会过度拟合已经非常适合数据集的分类器。

著录项

  • 作者

    Bennette, Walter Dean.;

  • 作者单位

    Iowa State University.;

  • 授予单位 Iowa State University.;
  • 学科 Industrial engineering.;Computer science.
  • 学位 Ph.D.
  • 年度 2014
  • 页码 109 p.
  • 总页数 109
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号