面向文本分类的混合特征降维策略

王东

首页> 中文期刊> 《贵州师范学院学报》 >面向文本分类的混合特征降维策略

面向文本分类的混合特征降维策略

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature dimensionality reduction has been an important research on text classification. An effective way to achieve feature dimensionality reduction is to design efficient feature selection methods. Based on the existing feature selection methods, in which the phenomenon of removing the strong features of distinction between the catego- ries ability and keeping the weak ones exists, the paper presents an efficient feature reduction algorithm, which firstly defines and quantifies features to establish the unisource feature retained set and forcibly removes the common features in all classes, and then adjusts the weights of the multi - source feature so as to achieve the target of feature reduction and improve the classification performance. Finally, a comparative analysis experiment is conducted in the Reuters - -21 578, NewsGroups corpus. The experimental result indicates that the algorithm is effective and feasible.%特征降维一直是文本分类的重要研究内容，针对现有特征选择方法中普遍存在误删除强区分类别能力特征而保留弱区分类别能力特征的现象，提出了一种有效的特征降维策略，该方法首先对特征进行了定义和量化，通过建立单源特征保留集，删除所有类中的公共特征，再对多源特征权值进行调整，从而迭到特征削减和提高分类性能的目的。在Reuters-21578，NewsGmup语料集上进行的实验对比中表明，新的降维策略是有效可行的。

著录项

来源
《贵州师范学院学报》 |2012年第6期|6-10|共5页
作者
王东;
展开▼
作者单位

贵州师范学院数学与计算机科学学院,贵州贵阳550018;

展开▼
原文格式 PDF
正文语种 chi
中图分类自动推理、机器学习;
关键词
文本分类; 单源特征; 多源特征; 特征降维; 特征选择;

相似文献

中文文献
外文文献
专利

1. 文本分类中一种混合型特征降维方法 [J] . 刘海峰 ,王元元 ,姚泽清 . 计算机工程 . 2009,第002期
2. 文本分类中的两阶段特征降维 [J] . 马兆才 . 甘肃科技 . 2014,第020期
3. 文本分类中基于概念映射的二次特征降维方法 [J] . 熊忠阳 ,付玲玲 ,张玉芳 . 计算机工程与应用 . 2012,第001期
4. 文本分类中的特征降维方法研究 [J] . 张玉芳 ,万斌候 ,熊忠阳 . 计算机应用研究 . 2012,第007期
5. 中文文本分类中一种基于语义的特征降维方法 [J] . 胡涛 ,刘怀亮 . 现代情报 . 2011,第011期
6. 中文文本分类中一种基于语义的特征降维方法 [C] . 胡涛 ,刘怀亮 . 《图书情报工作》杂志社、图书情报工作研究会第25次图书馆学情报学学术研讨会 . 2011
7. 基于多标签学习的特征降维和文本分类方法研究 [A] . 王成 . 2020

面向文本分类的混合特征降维策略

摘要

著录项

相似文献

相关主题

期刊订阅