首页> 外文期刊>Information Sciences: An International Journal >Automatic text categorization based on content analysis with cognitive situation models
【24h】

Automatic text categorization based on content analysis with cognitive situation models

机译:基于内容分析和认知情况模型的自动文本分类

获取原文
获取原文并翻译 | 示例
           

摘要

Text categorization is an important research area of text mining. The original purpose of text categorization is to recognize, understand and organize different types of texts or documents. The general categorization approaches are treated as supervised learning, which infers similarity among a collection of categorized texts for training purposes. The existing categorization approaches are obviously not content-oriented and constrained at single word level. This paper introduces an innovative content-oriented text categorization approach named as CogCate. Inspired by cognitive situation models, CogCate exploits a human cognitive procedure in categorizing texts. In addition to traditional statistical analysis at word level, CogCate also applies lexical/semantical analysis. which ensures the accuracy of categorization. The evaluation experiments have testified the performance of CogCate. Meanwhile, CogCate remarkably reduces the time and effort spent on software training and maintenance of text collections. Our research work attests that interdisciplinary research efforts benefit text categorization.
机译:文本分类是文本挖掘的重要研究领域。文本分类的最初目的是识别,理解和组织不同类型的文本或文档。一般的分类方法被视为监督学习,可以推断出出于培训目的的分类文本集合之间的相似性。现有的分类方法显然不是面向内容的,并且仅限于单个单词级别。本文介绍了一种创新的面向内容的文本分类方法,称为CogCate。受认知情况模型的启发,CogCate在文本分类中利用了人类的认知程序。除了传统的词级统计分析外,CogCate还应用词法/语义分析。这样可以确保分类的准确性。评估实验证明了CogCate的性能。同时,CogCate显着减少了软件培训和文本收集维护所花费的时间和精力。我们的研究工作证明,跨学科的研究工作有益于文本分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号