Semantic Based Features Selection and Weighting Method for Text Classification

机译：基于语义的特征选择和文本分类的加权方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature selection and weighting is of vital concern in text classification process which improves the efficiency and accuracy of text classifier. Vector Space Model is used to represent the documents using "Bag of Word" BOW model with term weighting phenomena. Documents representation through this model has some limitations that are, ignoring term dependencies, structure and ordering of the terms in documents. To overcome this problem, Semantics Base Feature Vector using Part of Speech (POS), is proposed, which is used to extract the concept of terms using WordNet, co-occurring and associated terms. The proposed method is applied on small documents dataset which shows that this method outperforms then term frequency/inverse document frequency (TF-IDF) with BOW feature selection method for text classification.

机译：特征选择和加权在文本分类过程中是至关重要的，这提高了文本分类器的效率和准确性。矢量空间模型用于使用具有术语加权现象的“一词”弓形模型来代表文件。通过此模型的文档表示具有一些限制，即忽略文档中术语的阶段依赖性，结构和排序。为了克服这个问题，提出了使用部分语音（POS）的语义基本特征向量，用于使用Wordnet，共同发生和关联的术语提取术语的概念。所提出的方法应用于小型文件数据集，该方法表明该方法始终呈现术语频率/逆文档频率（TF-IDF），具有用于文本分类的弓形特征选择方法。

著录项

来源
《International Symposium on Information Technology》|2010年||共6页
会议地点
作者
Aurangzeb khan; Baharum Baharudin; Khairullah khan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G202-53;
关键词
Feature selection; Text classification; POS; Feature vector;

机译：功能选择;文本分类;pos;特征向量;
入库时间 2022-08-21 04:19:58

相似文献

外文文献
中文文献
专利

1. A Novel Feature Selection Method Based on Probability Latent Semantic Analysis for Chinese Text Classification [J] . ZHONG Jiang, SUN Qigan, LI Xue, 电子学报：英文版 . 2011,第002期

机译：基于概率潜在语义分析的中文文本分类新特征选择方法
2. Kernel Sparse Feature Selection Based on Semantics in Text Classification [J] . Zhantao Deng, Guyu Hu, Zhisong Pan, Information Technology Journal . 2012,第3期

机译：基于语义的文本分类中的核稀疏特征选择
3. Kernel Sparse Feature Selection Based on Semantics in Text Classification [J] . Zhantao Deng, Guyu Hu, Zhisong Pan, Information Technology Journal . 2012,第3期

机译：基于语义的文本分类中的核稀疏特征选择
4. Semantic Based Features Selection and Weighting Method for Text Classification [C] . Aurangzeb khan, Baharum Baharudin, Khairullah khan International Symposium on Information Technology . 2010

机译：基于语义的特征选择和文本分类的加权方法
5. Statistical model-based methods for observation selection in wireless sensor networks and for feature selection in classification. [D] . Qi, Qi. 2012

机译：基于统计模型的方法用于无线传感器网络中的观察选择和分类中的特征选择。
6. Sentimental text mining based on an additional features method for text classification [O] . Ching-Hsue Cheng, Hsien-Hsiu Chen -1

机译：基于附加特征方法的情感文本挖掘
7. A New Feature Selection Method for Text Classification Based on Independent Feature Space Search [O] . Yong Liu, Shenggen Ju, Junfeng Wang, 2020

机译：基于独立特征空间搜索的文本分类的新功能选择方法

Semantic Based Features Selection and Weighting Method for Text Classification

摘要

著录项

相似文献

相关主题

期刊订阅