【24h】

The Rocchio Classifier and Second Generation Wavelets

机译:Rocchio分类器和第二代小波

获取原文
获取原文并翻译 | 示例

摘要

Classification and characterization of text is of ever growing importance in defense and national security. The text classification task is an instance of classification using sparse features residing in a high dimensional feature space. Two standard (of a wide selection of available) algorithms for this task are the naive Bayes classifier and the Rocchio linear classifier. Naive Bayes classifiers are widely applied; the Rocchio algorithm is primarily used in document classification and information retrieval. Both these classifiers are popular because of their simplicity and ease of application, computational speed and reasonable performance. One aspect of the Rocchio approach, inherited from its information retrieval origin, is that it explicitly uses both positive and negative models. Parameters have been introduced which make it adaptive to the particulars of the corpora of interest and thereby improve its performance. The ideas inherent in these classifiers and in second generation wavelets can be recombined into new algorithms for classification. An example is a classifier using second generation wavelet-like functions for class probes that mimic the Rocchio positive template - negative template approach.
机译:文本的分类和特征在国防和国家安全中越来越重要。文本分类任务是使用驻留在高维特征空间中的稀疏特征进行分类的一个实例。朴素的贝叶斯分类器和Rocchio线性分类器是用于此任务的两种标准(可供选择的算法)算法。朴素贝叶斯分类器得到了广泛的应用。 Rocchio算法主要用于文档分类和信息检索。这两种分类器之所以受欢迎,是因为它们简单易用,计算速度快且性能合理。 Rocchio方法的一个方面是从其信息检索源继承而来的,它明确地使用了肯定和否定模型。已经引入参数,使其适应感兴趣的语料库的细节,从而提高其性能。这些分类器和第二代小波中固有的思想可以重组为新的分类算法。一个示例是使用类似于第二代小波函数的分类器对类探针进行模仿Rocchio阳性模板-阴性模板方法的分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号