首页> 外文会议>International Symposium on Distributed Computing and Artificial Intelligence >Improving Persian Text Classification and Clustering Using Persian Thesaurus
【24h】

Improving Persian Text Classification and Clustering Using Persian Thesaurus

机译:使用波斯语词库改善波斯文本分类和聚类

获取原文

摘要

This paper proposes an innovative approach to improve the classification performance of Persian texts. The proposed method uses a thesaurus as a helpful knowledge to obtain more representative word-frequencies in the corpus. Two types of word relationships are considered in our used thesaurus. This is the first attempt to use a Persian thesaurus in the field of Persian information retrieval. Experimental results indicate the performance of text classification improves significantly in the case of employing Persian thesaurus rather the case of ignoring Persian thesaurus.
机译:本文提出了一种提高波斯文本分类性能的创新方法。该方法使用叙述作为有用的知识,以获得语料库中的更多代表性的词汇。在我们使用的词库中考虑了两种类型的词关系。这是第一次尝试在波斯语信息检索领域使用波斯语词库。实验结果表明,在采用波斯词库的情况下,文本分类的表现显着提高了忽视波斯词库的情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号