首页> 外文会议> >Discriminative category matching: efficient text classification for huge document collections
【24h】

Discriminative category matching: efficient text classification for huge document collections

机译:区分性类别匹配:高效的文本分类功能,可处理大量文档

获取原文

摘要

With the rapid growth of textual information available on the Internet, having a good model for classifying and managing documents automatically is undoubtedly important. When more documents are archived, new terms, new concepts and concept-drift will frequently appear Without a doubt, updating the classification model frequently, rather than using the old model for a very long period is absolutely essential. Here, the challenges are: a) obtain a high accuracy classification model; b) consume low computational time for both model training and operation; and c) occupy low storage space. However, none of the existing classification approaches could achieve all of these requirements. In this paper, we propose a novel text classification approach, called discriminative category matching, which could achieve all of the stated characteristics. Extensive experiments using two benchmarks and a large real-life collection are conducted. The encouraging results indicated that our approach is highly feasible.
机译:随着Internet上文本信息的迅速增长,拥有一个自动分类和管理文档的良好模型无疑是很重要的。当归档更多文档时,新术语,新概念和概念漂移将经常出现。毫无疑问,必须经常更新分类模型,而不是长时间使用旧模型,这绝对是必不可少的。这里的挑战是:a)获得高精度的分类模型; b)在模型训练和操作上都消耗较少的计算时间; c)占用较少的存储空间。但是,现有分类方法都无法满足所有这些要求。在本文中,我们提出了一种新颖的文本分类方法,称为判别类别匹配,它可以实现所有陈述的特征。进行了使用两个基准和大量现实生活的广泛实验。令人鼓舞的结果表明,我们的方法是高度可行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号