首页> 外文会议>Symposium on multispectral image processing and pattern recognition >Drug-related webpages classification using images and text information based on multi-kernel learning
【24h】

Drug-related webpages classification using images and text information based on multi-kernel learning

机译:基于多核学习的基于图像和文本信息的毒品相关网页分类

获取原文

摘要

In this paper, multi-kernel learning(MKL) is used for drug-related webpages classification. First, body text and image-label text are extracted through HTML parsing, and valid images are chosen by the FOCARSS algorithm. Second, text-based BOW model is used to generate text representation, and image-based BOW model is used to generate images representation. Last, text and images representation are fused with a few methods. Experimental results demonstrate that the classification accuracy of MKL is higher than those of all other fusion methods in decision level and feature level, and much higher than the accuracy of single-modal classification.
机译:本文将多核学习(MKL)用于毒品相关网页的分类。首先,通过HTML解析提取正文和图像标签文本,并通过FOCARSS算法选择有效图像。其次,基于文本的BOW模型用于生成文本表示,而基于图像的BOW模型用于生成图像表示。最后,将文本和图像表示与几种方法融合在一起。实验结果表明,在决策水平和特征水平上,MKL的分类精度高于所有其他融合方法,并且比单模态分类的精度高得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号