【24h】

Using Wavelets to Classify Documents

机译:使用小波来对文档进行分类

获取原文

摘要

Currently, Fourier and cosine discrete transformations are used to classify documents. This article proposes a new strategy that uses wavelets in the representation and reduction of data text. Wavelets have been extensively used for dimensionality reduction in the field of signal processing. In this work, we show that a text document, after being subjected to a simple process of reorganization of its terms, can be treated like a signal and analyzed by signal processing tools. We demonstrate that this new representation is able to describe the most relevant features of documents in a synthetic representation and this new perspective improves the performance of the classification algorithm.
机译:目前,傅里叶和余弦离散转换用于对文档进行分类。本文提出了一种新的策略,它在表示和减少数据文本中使用小波。小波已广泛用于信号处理领域的维度降低。在这项工作中,我们显示文本文档,在经过简单的重组过程之后,可以像信号一样对待并通过信号处理工具进行分析。我们证明,这种新的代表能够在合成表示中描述文档的最相关的功能,并且这种新的视角提高了分类算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号