A high-performing comprehensive learning algorithm for text classification without pre-labeled training set

Lizhen Liu; Qianhui Liang

首页> 外文期刊>Knowledge and information systems >A high-performing comprehensive learning algorithm for text classification without pre-labeled training set

【24h】

A high-performing comprehensive learning algorithm for text classification without pre-labeled training set

机译：无需预先标记训练集的高性能文本分类综合学习算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate a comprehensive learning algorithm for text classification without pre-labeled training set based on incremental learning. In order to overcome the high cost in getting labeled training examples, this approach reforms fuzzy partition clustering to obtain a small quantity of labeled training data. Then the incremental learning of Bayesian classifier is applied. The model of the proposed classifier is composed of a Naive-Bayes-based incremental learning algorithm and a modified fuzzy partition clustering method. For improved efficiency, a feature reduction is designed based on the Quadratic Entropy in Mutual Information. We perform experiments to demonstrate the performance of the approach, and the results show that our approach is feasible and effective.

机译：在本文中，我们研究了一种基于增量学习的，无需预先标记的训练集的文本分类综合学习算法。为了克服获取带标签的训练样本的高成本，该方法对模糊分区聚类进行了改革，以获得少量的带标签的训练数据。然后应用贝叶斯分类器的增量学习。提出的分类器模型由基于朴素贝叶斯的增量学习算法和改进的模糊分区聚类方法组成。为了提高效率，基于互信息中的二次熵设计了特征缩减。我们进行实验以证明该方法的性能，结果表明我们的方法是可行和有效的。

著录项

来源
《Knowledge and information systems》 |2011年第3期|共12页
作者
Lizhen Liu; Qianhui Liang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统理论;
关键词
Text classification; Clustering; Dimension reduction; Fuzzy clustering; Incremental learning;

机译：文本分类;聚类;降维;模糊聚类;增量学习;

相似文献

外文文献
中文文献
专利

1. A high-performing comprehensive learning algorithm for text classification without pre-labeled training set [J] . Lizhen Liu, Qianhui Liang Knowledge and information systems . 2011,第3期

机译：无需预先标记训练集的高性能文本分类综合学习算法
2. Positive Unlabeled Learning Algorithm for One Class Classification of Social Text Stream with only very few Positive Training Samples [J] . Abhinandan Vishwakarma Computer Engineering and Intelligent Systems . 2015,第3期

机译：仅有很少的积极训练样本的社会文本流的一类分类的正面无标签学习算法
3. Positive Unlabeled Learning Algorithm for One Class Classification of Social Text Stream with only very few Positive Training Samples [J] . Abhinandan Vishwakarma Journal of Economics and Sustainable Development . 2015,第3期

机译：仅有很少的积极训练样本的社会文本流的一类分类的正面无标签学习算法
4. Using Cluster-Based Sampling to Select Initial Training Set for Active Learning in Text Classification [C] . Jaeho Kang, Kwang Ryel Ryu, Hyuk-Chul Kwon Advances in Knowledge Discovery and Data Mining . 2004

机译：使用基于聚类的抽样选择初始训练集以进行文本分类中的主动学习
5. A Comparative Analysis of Selected Set of Natural Language Processing (NLP) and Machine Learning (ML) Algorithms for Clinical Coding using Clinical Classification Standards [D] . Kaur, Rajvir 2018

机译：使用临床分类标准对用于临床编码的自然语言处理（NLP）和机器学习（ML）算法的选择集进行比较分析
6. Teleconsultations between Patients and Healthcare Professionals in Primary Care in Catalonia: The Evaluation of Text Classification Algorithms Using Supervised Machine Learning [O] . Francesc López Seguí, Ricardo Ander Egg Aguilar, Gabriel de Maeztu, 2020

机译：加泰罗尼亚基层医疗机构的患者与医疗专业人员之间的远程咨询：使用监督机器学习的文本分类算法的评估
7. High-Performing Feature Selection for Text Classification [O] . Monica Rogati, Yiming Yang 2002

机译：文本分类的高性能特征选择

A high-performing comprehensive learning algorithm for text classification without pre-labeled training set

摘要

著录项

相似文献

相关主题

期刊订阅