首页> 外文会议>Knowledge-Based Intelligent Information and Engineering Systems pt.3; Lecture Notes in Artificial Intelligence; 4253 >Evolutionary ANNs for Improving Accuracy and Efficiency in Document Classification Methods
【24h】

Evolutionary ANNs for Improving Accuracy and Efficiency in Document Classification Methods

机译:进化型人工神经网络,用于提高文档分类方法的准确性和效率

获取原文
获取原文并翻译 | 示例

摘要

Approaches to document classification belong to two major families: similarity-based (crisp) classification methods and neural networks (gradual) ones. For gradual techniques, a major open issue is controlling search space dimension. While similarity-based methods identify clusters based on the same number of variables used for document encoding, neural networks automatically identify variables that cause distinctions among clusters. Therefore, the variables' number may vary depending on the documents structure and content, and is difficult to estimate it a priori. This paper proposes a hybrid classification method suitable for heterogeneous document bases like the ones commonly encountered in business and knowledge management applications. Our method is based on an evolutionary algorithm for tuning both neural network's structure and weights. While searching the optimal neural network's configuration it is possible to determine the minimal number of variables to be used in order to classify the given set of documents.
机译:文档分类的方法属于两个主要家族:基于相似性的(清晰)分类方法和基于神经网络的(渐进式)分类方法。对于渐进式技术,一个主要的开放问题是控制搜索空间尺寸。尽管基于相似度的方法基于用于文档编码的变量数量相同的数量来识别聚类,但是神经网络会自动识别导致聚类之间存在差异的变量。因此,变量的数量可能会根据文档的结构和内容而有所不同,并且很难进行先验估计。本文提出了一种适用于异构文档库的混合分类方法,例如在业务和知识管理应用程序中经常遇到的文档库。我们的方法基于可同时调整神经网络结构和权重的进化算法。在搜索最佳神经网络的配置时,可以确定要使用的变量的最小数量,以便对给定的文档集进行分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号