首页> 外文会议>International Conference on Intelligent Computing and Control Systems >High Dimensional Text Document Clustering and Classification using Machine Learning Methods
【24h】

High Dimensional Text Document Clustering and Classification using Machine Learning Methods

机译:使用机器学习方法的高维文本文档群集和分类

获取原文

摘要

Although High Dimensional documents are used for classification, document dimensions are a major concern and a worrying sign. The effects of Dimensional reduction can be both the positive and negative aspects. If a dimensional reduction is not in a correct format, classification using lower-dimensional documents will not produce the desired results. Our research studies centered on dimensional reduction using the fundamental similarity property, but they haven't yet discussed text classification. This approach discusses the classification and clustering task by providing various algorithms. We used clustering and classification methods on various datasets in this paper. It also suggested improving the efficiency of the K-means clustering and the Naive Bayes classification technique. Popular parameters like precision, recall, and F-score are used to evaluate the recommended method's performance. The results of the experiments would demonstrate that the proposed model outperforms current algorithms.
机译:虽然高维文档用于分类,但文件尺寸是一个主要关注和令人担忧的标志。尺寸减小的效果可以是正面和负面方面。如果尺寸减少不是正确的格式,则使用较低维文档的分类将不会产生所需的结果。我们的研究研究以基本的相似性为中心的维度减少,但他们还没有讨论过文本分类。该方法通过提供各种算法来讨论分类和聚类任务。我们在本文中使用了各种数据集的聚类和分类方法。它还建议提高K-Means聚类和朴素贝叶斯分类技术的效率。 Precision,Recall和F分数的流行参数用于评估推荐的方法的性能。实验结果将证明所提出的模型优于当前算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号