An Incremental Subspace Learning Algorithm to Categorize Large Scale Text Data

机译：一种用于大规模文本数据分类的增量子空间学习算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The dramatic growth in the number and size of on-line information sources has fueled increasing research interest in the incremental subspace learning problem. In this paper, we propose an incremental supervised subspace learning algorithm, called Incremental Inter-class Scatter (IIS) algorithm. Unlike traditional batch learners, IIS learns from a stream of training data, not a set. IIS overcomes the inherent problem of some other incremental operations such as Incremental Principal Component Analysis (PCA) and Incremental Linear Discriminant Analysis (LDA). The experimental results on the synthetic datasets show that IIS performs as well as LDA and is more robust against noise. In addition, the experiments on the Reuters Corpus Volume 1 (RCV1) dataset show that IIS outperforms state-of-the-art Incremental Principal Component Analysis (IPCA) algorithm, a related algorithm, and Information Gain in efficiency and effectiveness respectively.

机译：在线信息源的数量和规模的急剧增长，激发了人们对增量子空间学习问题的研究兴趣。在本文中，我们提出了一种增量监督子空间学习算法，称为增量类间散点（IIS）算法。与传统的批处理学习器不同，IIS是从培训数据流而非集合中学习的。 IIS克服了其他一些增量操作的固有问题，例如增量主成分分析（PCA）和增量线性判别分析（LDA）。综合数据集上的实验结果表明，IIS的性能与LDA一样好，并且对噪声的抵抗力更强。此外，对路透社语料库第1卷（RCV1）数据集的实验表明，IIS在效率和有效性方面分别优于最新的增量主成分分析（IPCA）算法，相关算法和信息增益。

著录项

来源
《Asia-Pacific Web Conference; 20050329-0401; Shanghai(CN)》|2005年|P.52-63|共12页
会议地点 Shanghai(CN)
作者
Jun Yan; Qiansheng Cheng; Qiang Yang; Benyu Zhang;
展开▼
作者单位

LMAM, Department of Information Science, School of Mathematical Sciences, Peking University, Beijing, P.R. China 100871;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;
关键词

相似文献

外文文献
中文文献
专利

1. Semi-supervised text categorization: Exploiting unlabeled data using ensemble learning algorithms [J] . Mohammad Reza Keyvanpour, Maryam Bahojb Imani Intelligent data analysis . 2013,第3期

机译：半监督文本分类：使用集成学习算法开发未标记的数据
2. A Hybrid Learning Algorithm in Automated Text Categorization of Legacy Data [J] . Dali Wang, Ying Bai, David Hamblin International Journal of Artificial Intelligence & Applications (IJAIA) . 2019,第5期

机译：传统数据自动文本分类中的混合学习算法
3. An incremental learning approach for the text categorization using hybrid optimization [J] . Mamta Kayest, Sanjay Kumar Jain International Journal of Intelligent Computing and Cybernetics . 2019,第3期

机译：使用混合优化的文本分类的增量学习方法
4. An Incremental Subspace Learning Algorithm to Categorize Large Scale Text Data [C] . Jun Yan, Qiansheng Cheng, Qiang Yang, Asia Pacific Conference on Web Technology . 2005

机译：一个增量子空间学习算法，用于对大规模文本数据进行分类
5. The CDM learning algorithm: An approach to learning for text categorization. [D] . Goldberg, Jeffrey Lee. 1996

机译：CDM学习算法：一种用于文本分类学习的方法。
6. Performance of a Deep Neural Network Algorithm Based on a Small Medical Image Dataset: Incremental Impact of 3D-to-2D Reformation Combined with Novel Data Augmentation Photometric Conversion or Transfer Learning [O] . Vikash Gupta, Mutlu Demirer, Matthew Bigelow, 2020

机译：基于小型医学图像数据集的深神经网络算法的性能：3D-2D改革与新型数据增强光度转换或转移学习结合的增量影响
7. An incremental subspace learning algorithm to categorize large scale text data [O] . Jun Yan, Qiansheng Cheng, Qiang Yang, 2005

机译：一种增量子空间学习算法，用于对大规模文本数据进行分类

An Incremental Subspace Learning Algorithm to Categorize Large Scale Text Data

摘要

著录项

相似文献

相关主题

期刊订阅