首页> 外文会议> >An Extension of Iterative Scaling for Joint Decision-Level and Feature-Level Fusion in Ensemble Classification
【24h】

An Extension of Iterative Scaling for Joint Decision-Level and Feature-Level Fusion in Ensemble Classification

机译:集合分类中联合决策级和特征级融合的迭代缩放扩展。

获取原文

摘要

Improved iterative scaling (IIS) is a simple, powerful algorithm for learning maximum entropy (ME) conditional probability models that has found great utility in natural language processing and related applications. In nearly all prior work on IIS, one considers discrete-valued feature functions, depending on the data observations and class label, and encodes statistical constraints on these discrete-valued random variables. Moreover, most significantly for our purposes, the (ground-truth) constraints are measured from frequency counts, based on hard (0-1) training set instances of feature values. Here, we extend IIS for the case where the training (and test) set consists of instances of probability mass functions on the features, rather than instances of hard feature values. We show that the IIS methodology extends in a natural way for this case. This extension has applications 1) to ME aggregation of soft classifier outputs in ensemble classification and 2) to ME classification on mixed discrete-continuous feature spaces. Moreover, we combine these methods, yielding an ME method that jointly performs (soft) decision-level fusion and feature-level fusion in making ensemble decisions. We demonstrate favorable comparisons against both standard boosting and bagging on UC Irvine benchmark data sets. We also discuss some of our continuing research directions.
机译:改进的迭代缩放(IIS)是一种简单,强大的算法,用于学习最大熵(ME)条件概率模型,该概率模型在自然语言处理和相关应用中找到了很大的实用性。在几乎所有先前的IIS上工作都考虑了离散值的特征功能,具体取决于数据观察和类标签,并在这些离散值随机变量上编码统计约束。此外,对于我们的目的而言,最重要的是,基于特征值的硬(0-1)训练设置实例,从频率计数测量(地基)约束。在这里,我们为培训(和测试)集由特征上的概率质量功能的实例而不是硬特征值的实例来扩展IIS。我们表明IIS方法以自然的方式延伸到这种情况。此扩展名为Application 1)在Ensemble分类中的软分类器输出的聚合和2)对混合离散连续特征空间的分类。此外,我们结合了这些方法,产生了一个关于制作合并决策的(软)决策级融合和特征级融合的方法。我们展示了对UC IRVINE基准数据集的标准升压和袋装的有利比较。我们还讨论了我们的一些持续的研究方向。

著录项

  • 来源
    《》|2005年|P.61-66|共6页
  • 会议地点
  • 作者

    Miller; D.J.; Pal; S.;

  • 作者单位
  • 会议组织
  • 原文格式 PDF
  • 正文语种
  • 中图分类 工业技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号