首页> 外文会议>International Conference on Electronics, Computer and Computation >A novel higher-order semantic kernel for text classification
【24h】

A novel higher-order semantic kernel for text classification

机译:一种新颖的文本分类高阶语义核

获取原文

摘要

In conventional text categorization algorithms, documents are symbolized as “bag of words” (BOW) with the fact that documents are supposed to be independent from each other. While this approach simplifies the models, it ignores the semantic information between terms of each document. In this study, we develop a novel method to measure semantic similarity based on higher-order dependencies between documents. We propose a kernel for Support Vector Machines (SVM) algorithm using these dependencies which is called Higher-Order Semantic Kernel. With the aim of presenting comparative performance of Higher-Order Semantic Kernel we performed many experiments not only with our algorithm but also with existing traditional first-order kernels such as Polynomial Kernel, Radial Basis Function Kernel, and Linear Kernel. The experiments using Higher-Order Semantic Kernel on several well-known datasets show that classification performance improves significantly over the first-order methods.
机译:在传统的文本分类算法中,文档被假定为“单词袋”(BOW),而文档被认为是彼此独立的。尽管此方法简化了模型,但它忽略了每个文档的术语之间的语义信息。在这项研究中,我们开发了一种基于文档之间更高阶依赖性的度量语义相似性的新方法。我们使用这些依赖关系为支持向量机(SVM)算法提出了一个内核,称为“高阶语义内核”。为了展示高阶语义内核的比较性能,我们不仅使用我们的算法,而且还使用多项式内核,径向基函数内核和线性内核等现有的传统一阶内核进行了许多实验。在几个著名的数据集上使用高阶语义内核进行的实验表明,分类性能比一阶方法显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号