首页> 外文会议>ACM Annual Symposium on Applied Computing >An Optimized Approach for KNN Text Categorization using P-trees

【24h】

An Optimized Approach for KNN Text Categorization using P-trees

机译：使用P树的KNN文本分类的优化方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The importance of text mining stems from the availability of huge volumes of text databases holding a wealth of valuable information that needs to be mined. Text categorization is the process of assigning categories or labels to documents based entirely on their contents. Formally, it can be viewed as a mapping from the document space into a set of predefined class labels (aka subjects or categories); F: D→{C1, C2...Cn} where F is the mapping function, D is the document space and {C1, C2...Cn} is the set of class labels. Given an unlabeled document d, we need to find its class label, Ci, using the mapping function F where F(d) = Ci. In this paper, an optimized k-Nearest Neighbors (KNN) classifier that uses intervalization and the P-tree technology to achieve a high degree of accuracy, space utilization and time efficiency is proposed: As new samples arrive, the classifier finds the k nearest neighbors to the new sample from the training space without a single database scan.

机译：文本挖掘的重要性源于巨大的文本数据库的可用性，持有需要开采的有价值的有价值的信息。文本分类是将类别或标签分配给完全基于其内容的文档的过程。正式地，它可以被视为从文档空间的映射到一组预定义的类标签（AKA科目或类别）; f：d→{c1，c2 ... cn}其中f是映射函数，d是文档空间，{c1，c2 ... cn}是类标签集。鉴于未标记的文档D，我们需要使用其中f（d）= ci的映射函数f找到其类标签ci。在本文中，提出了一种优化的K-CORMATE邻居（KNN）分类器，其使用间隔化和P树技术实现高度精度，空间利用率和时间效率：随着新的样本到达，分类器找到最近的k 没有单一数据库扫描的训练空间，邻居到新的样本。

著录项

来源
《ACM Annual Symposium on Applied Computing 》|2004年||共5页
会议地点
作者
Imad Rahal; William Perrizo; Association for Computing Machinery(ACM); University of Cyprus;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术 ;
关键词
text categorization; P-trees; intervalization; k-Nearest Neighbors; KNN;

机译：文本分类;p树;区间化;k最近邻居;knn;

相似文献

外文文献
中文文献
专利

1. FSKNN: Multi-label text categorization based on fuzzy similarity and k nearest neighbors [J] . Jung-Yi Jiang, Shian-Chi Tsai, Shie-Jue Lee Expert Systems with Application . 2012 ,第3期

机译：FSKNN：基于模糊相似度和k个最近邻居的多标签文本分类
2. Multidass Boosting with Adaptive Group-Based kNN and Its Application in Text Categorization [J] . Lei La, Qiao Guo, Dequan Yang, Mathematical Problems in Engineering . 2012 ,第pta7期

机译：基于自适应组的kNN的多输入信号增强及其在文本分类中的应用
3. Using kNN model for automatic text categorization [J] . Guo GD, Wang H, Bell D, Soft computing: A fusion of foundations, methodologies and applications . 2006 ,第5期

机译：使用kNN模型进行自动文本分类
4. An Optimized Approach for KNN Text Categorization using P-trees [C] . Imad Rahal, William Perrizo Association for Computing Machinery(ACM) Annual Symposium on Applied Computing(SAC 2004) vol.1; 20040314-17; Nicosia(CY) . 2004

机译：使用P树的KNN文本分类的一种优化方法
5. Optimization of Word Embeddings in Text Categorization [D] . Lauren, Paula Amanda. 2018

机译：文本分类中词嵌入的优化
6. An Intelligent Parkinsons Disease Diagnostic System Based on a Chaotic Bacterial Foraging Optimization Enhanced Fuzzy KNN Approach [O] . Zhennao Cai, Jianhua Gu, Caiyun Wen, 2018

机译：基于混沌细菌觅食优化增强模糊KNN的智能帕金森病诊断系统
7. An kNN Model-based Approach and Its Application in Text Categorization [O] . Gongde Guo, Hui wang, David Bell, 2004

机译：基于kNN模型的方法及其在文本分类中的应用

An Optimized Approach for KNN Text Categorization using P-trees

摘要

著录项

相似文献

相关主题

期刊订阅