【24h】

A Progressive Feature Selection Algorithm for Ultra Large Feature Spaces

机译:超大特征空间的渐进特征选择算法

获取原文
获取原文并翻译 | 示例

摘要

Recent developments in statistical modeling of various linguistic phenomena have shown that additional features give consistent performance improvements. Quite often, improvements are limited by the number of features a system is able to explore. This paper describes a novel progressive training algorithm that selects features from virtually unlimited feature spaces for conditional maximum entropy (CME) modeling. Experimental results in edit region identification demonstrate the benefits of the progressive feature selection (PFS) algorithm: the PFS algorithm maintains the same accuracy performance as previous CME feature selection algorithms (e.g., Zhou et al., 2003) when the same feature spaces are used. When additional features and their combinations are used, the PFS gives 17.66% relative improvement over the previously reported best result in edit region identification on Switchboard corpus (Kahn et al., 2005), which leads to a 20% relative error reduction in parsing the Switchboard corpus when gold edits are used as the upper bound.
机译:各种语言现象的统计建模的最新发展表明,附加功能可提供一致的性能改进。通常,改进受到系统能够探索的功能数量的限制。本文介绍了一种新颖的渐进式训练算法,该算法可从几乎无限的特征空间中选择特征进行条件最大熵(CME)建模。编辑区域识别的实验结果证明了渐进特征选择(PFS)算法的好处:当使用相同的特征空间时,PFS算法与以前的CME特征选择算法(例如Zhou等,2003)保持相同的准确性。 。当使用附加功能及其组合时,PFS相对于先前报告的关于配电板语料库的编辑区域识别的最佳结果提供了17.66%的相对改进(Kahn等人,2005年),这使得解析PFS时的相对误差降低了20%。黄金编辑用作上限时的总机语料库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号