首页> 外文会议>IEEE Latin American Conference on Computational Intelligence >Latent semantic indexing and convolutional neural network for multi-label and multi-class text classification
【24h】

Latent semantic indexing and convolutional neural network for multi-label and multi-class text classification

机译:潜在语义索引和卷积神经网络用于多标签和多类文本分类

获取原文

摘要

The classification of a real text should not be necessarily treated as a binary or multi-class classification, since the text may belong to one or more labels. This type of problem is called multi-label classification. In this paper, we propose the use of latent semantic indexing to text representation, convolutional neural networks to feature extraction and a single multi layer perceptron for multi-label classification in real text data. The experiments show that the model outperforms state of the art techniques when the dataset has long documents, and we observe that the precision is poor when the size of the texts is small.
机译:由于文本可以属于一个或多个标签,因此不应将实际文本的分类视为二进制或多级分类。这种类型的问题称为多标签分类。在本文中,我们提出了使用潜在语义索引来文本表示,卷积神经网络,以特征提取和用于实际文本数据中的多标签分类的单层Perceptron。实验表明,当数据集具有长文档时,模型优于现有技术的状态,并且当文本的大小很小时,我们观察到精度差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号