【24h】

A Maximal Figure-of-Merit Learning Approach to Text Categorization

机译:文本分类的最大品质因数学习方法

获取原文
获取原文并翻译 | 示例

摘要

A novel maximal figure-of-merit (MFoM) learning approach to text categorization is proposed. Different from the conventional techniques, the proposed MFoM method attempts to integrate any performance metric of interest (e.g. accuracy, recall, precision, or F_1 measure) into the design of any classifier. The corresponding classifier parameters are learned by optimizing an overall objective function of interest. To solve this highly nonlinear optimization problem, we use a generalized probabilistic descent algorithm. The MFoM learning framework is evaluated on the Reuters-21578 task with LSI-based feature extraction and a binary tree classifier. Experimental results indicate that the MFoM classifier gives improved F_1 and enhanced robustness over the conventional one. It also outperforms the popular SVM method in micro-averaging F_1. Other extensions to design discriminative multiple-category MFoM classifiers for application scenarios with new performance metrics could be envisioned too.
机译:提出了一种新颖的最大品质因数(MFoM)学习方法,用于文本分类。与传统技术不同,提出的MFoM方法尝试将任何感兴趣的性能指标(例如准确性,查全率,准确性或F_1度量)集成到任何分类器的设计中。通过优化感兴趣的总体目标函数,可以学习相应的分类器参数。为了解决这个高度非线性的优化问题,我们使用了广义概率下降算法。通过基于LSI的特征提取和二叉树分类器,对Reuters-21578任务评估MFoM学习框架。实验结果表明,与传统的分类器相比,MFoM分类器具有改进的F_1和增强的鲁棒性。在微平均F_1中,它也优于流行的SVM方法。也可以设想为具有新性能指标的应用场景设计区分性多类别MFoM分类器的其他扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号