A Method of Text Feature Extraction Based on Weighted Scatter Difference

机译：基于加权分散差的文本特征提取方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature reduction is one of the core technologies of automatic text categorization. As for the scatter difference criterion, poor categorization effect is made when the between-class distance is small and the class density is high. In order to solve this problem, a weighted method based on the sample distribution is shown in the paper, which will make the between-class and within-class scatter matrixes with poor scatter be weighted, to enhance the categorization ability after dimensional reduction and to improve the dimensional reduction effect of linear feature extraction method based on scatter difference. The following experiment tells us that this method is superior to the original maximum scatter difference method in precision rate and recall rate.

机译：特征缩减是自动文本分类的核心技术之一。至于散布差异标准，当类间距离小和类密度高时，分类效果差。为了解决这个问题，本文提出了一种基于样本分布的加权方法，该方法可以对散布不良的类间散布矩阵和类内散布矩阵进行加权，以提高降维后的分类能力。提高了基于散度差的线性特征提取方法的降维效果。下面的实验告诉我们，该方法在准确率和查全率方面优于原始的最大散射差法。

著录项

来源
《2010 Second WRI Global Congress on Intelligent Systems》|2010年|p.83-86|共4页
会议地点
作者
Haifeng Liu; Zhan Su; Zeqing Yao; Xueren Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
feature extraction; feature reduction; scatter difference; text classification;

机译：特征提取;特征约简;散点差;文本分类;

相似文献

外文文献
中文文献
专利

1. Weighted maximum scatter difference based feature extraction and its application to face recognition [J] . Xiaodong Li, Shumin Fei, Tao Zhang Machine Vision and Applications . 2011,第3期

机译：基于加权最大散射差异的特征提取及其在人脸识别中的应用
2. Text Extraction and Recognition from the Normal Images using MSER Feature Extraction and Text Segmentation Methods [J] . Nitin Sharma, Nidhi Indian Journal of Science and Technology . 2017,第17期

机译：使用MSER特征提取和文本分割方法从普通图像中提取和识别文本
3. Keyword Extraction From Chinese Text Based On Multidimensional Weighted Features [J] . YANG JIAN Journal of digital information management . 2016,第3期

机译：基于多维加权特征的中文文本关键词提取
4. A Method of Text Feature Extraction Based on Weighted Scatter Difference [C] . Haifeng Liu, Zhan Su, Zeqing Yao, WRI Global Congress on Intelligent Systems . 2010

机译：一种基于加权散射差的文本特征提取方法
5. New covariance-based feature extraction methods for classification and prediction of high-dimensional data. [D] . Sofolahan, Mopelola A. 2013

机译：基于协方差的新特征提取方法，用于高维数据的分类和预测。
6. Classification of Biomedical Texts for Cardiovascular Diseases with Deep Neural Network Using a Weighted Feature Representation Method [O] . Nizar Ahmed, Fatih Dilmaç, Adil Alpkocak 2020

机译：使用加权特征表示方法对深神经网络的生物医学文本的分类
7. A Rule-based Methodology and Feature-based Methodology for Effect Relation Extraction in Chinese Unstructured Text [O] . Wang Jingcheng 2015

机译：基于规则和基于特征的中文非结构化文本效果关系提取方法

A Method of Text Feature Extraction Based on Weighted Scatter Difference

摘要

著录项

相似文献

相关主题

期刊订阅