VSMs with K-Nearest Neighbour to Categorise Arabic Text Data

机译：VSM与K-Collect邻居分类阿拉伯文本数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text categorisation is a popular problem that has been studied extensively in the last four decades. This paper investigates different variations of vector space models (VSMs) and term weighting approaches using KNN algorithm. The base of our comparison in the experiments we conduct is the F1 evaluation measure. The Experimental results against different Arabic text categorisation data sets provide evidence that Dice and Jaccard Coefficient outperform the Cosine Coefficient approach with regards to F1 results, and the Dice-based TF.IDF achieves the highest average scores.

机译：文本分类是一个流行的问题，这在过去的四十年中已经过广泛研究过。本文研究了使用KNN算法的矢量空间模型（VSM）和术语加权方法的不同变化。我们在实验中的比较基础是F1评估措施。针对不同阿拉伯文文本分类数据集的实验结果提供了骰子和Jaccard系数优于余弦系数方法的证据，关于F1结果，基于骰子的TF.IDF实现了最高的平均分子。

著录项

来源
《World Congress on Engineering and Computer Science》|2008年||共4页
会议地点
作者
Fadi Thabtah; Wael Musa Hadi; Gaith Al-shammare;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 T-53;
关键词
Data sets; Data mining; Text categorization; Term weighting; VSM;

机译：数据集;数据挖掘;文本分类;术语加权;VSM;

相似文献

外文文献
中文文献
专利

1. Arabic Text Classification Using K-Nearest Neighbour Algorithm [J] . Alhutaish Roiss, Omar Nazlia The international arab journal of information technology . 2015,第2期

机译：使用最近邻算法的阿拉伯文本分类
2. Arabic Text Categorization Using Improved k-Nearest neighbour Algorithm [J] . KHALED Wail Hamood, AL-SARRAYRIH Haytham Saleem, KNIPPING Lars Journal of Applied Computer Science & Mathematics . 2014,第3期

机译：使用改进的k最近邻算法的阿拉伯文本分类
3. An intelligent scheme for categorising fault events in compensated power network using K-nearest neighbour technique [J] . Sunil Kumar Singh, D.N. Vishwakarma, R.K. Saket International journal of Power and energy conversion . 2020,第4期

机译：使用K-Collect邻邻技术对补偿电网故障事件进行分类的智能方案
4. VSMs with K-Nearest Neighbour to Categorise Arabic Text Data [C] . Fadi Thabtah, Wael Musa Hadi, Gaith Al-shammare World Congress on Engineering and Computer Science . 2008

机译：VSM与K-Collect邻居分类阿拉伯文本数据
5. Categorisation of Arabic Twitter Text [D] . Altamimi, Mohammed Hamed R. 2020

机译：阿拉伯语推特文本的分类
6. Love Thy Neighbour: Automatic Animal Behavioural Classification of Acceleration Data Using the K-Nearest Neighbour Algorithm [O] . Owen R. Bidder, Hamish A. Campbell, Agustina Gómez-Laich, 2010

机译：爱你的邻居：使用K最近邻居算法对加速度数据进行自动动物行为分类
7. Arabic Text Categorization Using Improved k-Nearest neighbour Algorithm [O] . Wail Hamood KHALED, Haytham Saleem AL-SARRAYRIH, Lars KNIPPING 2014

机译：使用改进的k-最近邻算法的阿拉伯语文本分类

VSMs with K-Nearest Neighbour to Categorise Arabic Text Data

摘要

著录项

相似文献

相关主题

期刊订阅