首页> 外文会议>IEEE International Conference on Computer Systems and Applications >Hierarchical Approach to Select Feature Vectors for Classification of Text Documents

【24h】

Hierarchical Approach to Select Feature Vectors for Classification of Text Documents

机译：选择要素向量的分层方法，用于文本文档的分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Digital revolution that started over fifteen years ago is contributing to the exponential growth in text documents that show up in many forms such as web pages, emails, resumes, scientific reports, digital archives, etc. It is of great importance to develop techniques for automatic text document classification as a service to information consumers. Earlier text document classification techniques have used 'keyword-based' features and related statistics to achieve good results. More recently, some of these techniques have been extended to include 'phrase-based' and 'concept-based' features to achieve better results. Majority of these techniques utilize a very large number of features that are extracted from the training set of documents. We present a hierarchical method for selection of a fewer number of quality features to improve the classification efficiency.

机译：超过十五年前开始的数字革命是促进了文本文件中的指数增长，这些文本文件显示了许多形式，如网页，电子邮件，恢复，科学报告，数字档案等。它非常重视自动开发技术文本文档分类作为信息消费者的服务。早期的文本文档分类技术使用了“基于关键字的”功能和相关统计数据来实现良好的效果。最近，这些技术的一些已经扩展到包括“基于短语”和“基于概念的”功能来实现更好的结果。这些技术的大多数利用来自培训文件集中提取的大量功能。我们提出了一种选择少量质量特征来提高分类效率的分层方法。

著录项

来源
《IEEE International Conference on Computer Systems and Applications 》|2006年||共4页
会议地点
作者
Nagesh Kapalavayi; S. N. Jayaram Murthy; Gongzhu Hu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. A Novel Approach for Ontology- Based Feature Vector Generation for Web Text Document Classification [J] . Mohamed K. Elhadad, Khaled M. Badran, Gouda I. Salama International journal of software innovation . 2018 ,第1期

机译：基于本体的特征向量的Web文本文档分类新方法
2. A Hierarchical Neural-Network-Based Document Representation Approach for Text Classification [J] . Zheng Jianming, Guo Yupu, Feng Chong, Mathematical Problems in Engineering . 2018 ,第pta4期

机译：基于层次神经网络的文档表示方法
3. A Hierarchical Neural-Network-Based Document Representation Approach for Text Classification [J] . Jianming Zheng, Yupu Guo, Chong Feng, Mathematical Problems in Engineering: Theory, Methods and Applications . 2018 ,第4期

机译：基于层次神经网络的文档表示方法
4. Hierarchical Approach to Select Feature Vectors for Classification of Text Documents [C] . Nagesh Kapalavayi, S. N. Jayaram Murthy, Gongzhu Hu IEEE International Conference on Computer Systems and Applications . 2006

机译：选择要素向量的分层方法，用于文本文档的分类
5. Support vector machine/regression feature selection with an application towards classification. [D] . Halstead, John Brantley. 2005

机译：支持向量机/回归特征选择以及分类应用。
6. Computing symmetrical strength of N-grams: a two pass filtering approach in automatic classification of text documents [O] . Deepak Agnihotri, Kesari Verma, Priyanka Tripathi -1

机译：计算N-gram的对称强度：文本文档自动分类中的两遍过滤方法
7. An Enhanced Hybrid Feature Selection Technique Using Term Frequency-Inverse Document Frequency and Support Vector Machine-Recursive Feature Elimination for Sentiment Classification [O] . Nur Syafiqah Mohd Nafis, Suryanti Awang 2021

机译：具有术语频率 - 逆文档频率的增强混合特征选择技术，并支持传染媒介机递归特征消除情绪分类

Hierarchical Approach to Select Feature Vectors for Classification of Text Documents

摘要

著录项

相似文献

相关主题

期刊订阅