...
首页> 外文期刊>Fuzzy Systems, IEEE Transactions on >Multilabel Text Categorization Based on Fuzzy Relevance Clustering
【24h】

Multilabel Text Categorization Based on Fuzzy Relevance Clustering

机译:基于模糊关联聚类的多标签文本分类

获取原文
获取原文并翻译 | 示例
           

摘要

We propose a fuzzy based method for multilabel text classification in which a document can belong to one or more than one category. In text categorization, the number of the involved features is usually huge, causing the curse of the dimensionality problem. Besides, a category can be a nonconvex region, which is a union of several overlapping or disjoint subregions. An automatic classification system, thus, may suffer from large memory requirements or poor performance. By incorporating fuzzy techniques, our proposed method can overcome these issues. A fuzzy relevance measure is adopted to transform high-dimensional documents to low-dimensional fuzzy relevance vectors to avoid the curse of dimensionality problem. A clustering technique is used to divide the relevance space into a collection of subregions which are then combined to make up individual categories. This allows complex and nonconvex regions to be created. A number of experiments are presented to show the effectiveness of the proposed method in both performance and speed.
机译:我们提出了一种基于模糊的多标签文本分类方法,其中文档可以属于一个或多个类别。在文本分类中,所涉及特征的数量通常很大,从而引起了尺寸问题的诅咒。此外,类别可以是非凸区域,它是几个重叠或不相交的子区域的并集。因此,自动分类系统可能会遇到内存需求大或性能差的问题。通过结合模糊技术,我们提出的方法可以克服这些问题。采用模糊关联度量将高维文档转换为低维模糊关联向量,避免了维数问题的困扰。使用聚类技术将相关空间划分为子区域的集合,然后将这些子区域组合起来以构成各个类别。这允许创建复杂和非凸区域。大量实验表明,该方法在性能和速度上均有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号