【24h】

Power set kernel for feature combination: data mining approach for its fast classifiers

机译:用于功能组合的Power Set内核:快速分类器的数据挖掘方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The kernel method (e.g., Support Vector Machines) attracts a great deal of attention recently. The merit of the kernel method is that the effective feature combination, which has been manually selected in the previous approaches, is implicitly expanded without loss of generality and computational cost. However, the kernel-based approach is usually too slow to classify large-scale test data. In this paper, we fist formulate a Power Set Kernel which gives a dot product of two sets. Then, we extend the Basket Mining algorithm to convert a kernel-based classifier into a simple and fast linear classifier. Experimental results on Japanese Word Segmentation and Japanese Dependency Parsing show that our new classifier is about 30-280 times faster than the standard kernel-based classifier.
机译:内核方法(例如,支持向量机)最近引起了很多关注。核方法的优点在于,隐含地扩展了在先前方法中手动选择的有效特征组合,而不会损失一般性和计算成本。但是,基于内核的方法通常太慢,无法对大规模测试数据进行分类。在本文中,我们首先制定了一个Power Set内核,该内核给出了两个集合的点积。然后,我们扩展了Basket Mining算法,将基于内核的分类器转换为简单快速的线性分类器。日语分词和日语依赖性解析的实验结果表明,我们的新分类器比基于标准内核的分类器快30-280倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号