A Text Classification Method with an Effective Feature Extraction Based on Category Analysis

机译：基于类别分析的有效特征提取文本分类方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text classification refers to determine the class of an unknown text according to its content in the given classification system. In order to extract fewer features to express the information in the text as much as possible, the paper analysis the various featuresȁ9; statistical properties and to extract the global features according to Zipf''s law; and then, based on the statistical analysis of the featuresȁ9; classified information, the efficient feature is extracted by computing the contribute of a feature; After that, the traditional TF-IDF formula is improved using category frequencies named by TF-IDF-CF for calculating the feature weight; Finally the text classification method is proposed. The experiment results illustrate that feature extraction methods proposed in the paper are effective and the formula TF-IDF-CF for calculating the feature weight has higher classification accuracy.

机译：文本分类是指在给定的分类系统中根据其内容确定未知文本的类别。为了尽可能少地提取特征以在文本中表达信息，本文对各种特征进行了分析[9]。统计特性，并根据Zipf定律提取全局特征;然后，基于特征ȁ9的统计分析;分类信息，通过计算特征的贡献来提取有效特征;此后，使用TF-IDF-CF命名的类别频率来计算特征权重，从而改进了传统的TF-IDF公式;最后提出了文本分类方法。实验结果表明，本文提出的特征提取方法是有效的，计算特征权重的公式TF-IDF-CF具有较高的分类精度。

著录项

来源
《Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09》|2009年|95-99|共5页
会议地点
作者
Li Yun; Sheng Yan; Luan Luan; Chen Ling;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
Category Frequency; Feature Extraction; Feature Weight; Zipf Law;

机译：类别频率;特征提取;特征权重; Zipf定律;

相似文献

外文文献
中文文献
专利

1. A Novel Feature Selection Method Based on Category Information Analysis for Class Prejudging in Text Classification [J] . Qiang Wang, Yi Guan, XiaoLong Wang ZhimingXu International journal of computer science and network security . 2006,第1A期

机译：一种基于类别信息分析的新型特征选择方法，用于文本分类中的类偏见
2. Comparing multiple categories of feature selection methods for text classification [J] . Zheng Wanwan, Jin Mingzhe Trends in Ecology & Evolution . 2020,第1期

机译：比较文本分类的多个类别的特征选择方法
3. Comparing multiple categories of feature selection methods for text classification [J] . Zheng Wanwan, Jin Mingzhe Digital scholarship in the humanities . 2020,第1期

机译：比较文本分类的多个类别的特征选择方法
4. A Text Classification Method with an Effective Feature Extraction based on Category Analysis [C] . Yun Li, Yan Sheng, Luan Luan, International Conference on Fuzzy Systems and Knowledge Discovery . 2009

机译：基于类别分析的有效特征提取的文本分类方法
5. New covariance-based feature extraction methods for classification and prediction of high-dimensional data. [D] . Sofolahan, Mopelola A. 2013

机译：基于协方差的新特征提取方法，用于高维数据的分类和预测。
6. Sentimental text mining based on an additional features method for text classification [O] . Ching-Hsue Cheng, Hsien-Hsiu Chen -1

机译：基于附加特征方法的情感文本挖掘
7. Effective and Extensible Feature Extraction Method Using Genetic Algorithm-Based Frequency-Domain Feature Search for Epileptic EEG Multi-classification [O] . Wen, Tingxi, Zhang, Zhongnan 2017

机译：基于遗传算法的有效可扩展特征提取方法基于算法的癫痫脑电频域特征搜索多分类

A Text Classification Method with an Effective Feature Extraction Based on Category Analysis

摘要

著录项

相似文献

相关主题

期刊订阅