Text Categorization Based on Fuzzy Soft Set Theory

机译：基于模糊软集理论的文本分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we proposed a new method for Text Categorization based on fuzzy soft set theory so called fuzzy soft set classifier (FSSC). We use fuzzy soft set representation that derived from the bag-of-words representation and define each term as a distinct word in the set of words of the document collection. The FSSC categorize each document by using fuzzy c-means formula for classification, and use fuzzy soft set similarity to measure distance between two documents. We perform the experiments with the standard Reuters-21578 dataset, and using three kind of weigthing such as boolean, term frequency, and term frequency-invert document frequency to compare the performance of FSSC with others four classifier such as kNN, Bayesian, Rocchio, and SVM. We are using precision, recall, F-measure, retun-size, and the running time as a performance evaluation. Result shown that there is no absolute winner. The FSSC has precision, recall, and F-measure lower than SVM, and kNN but FSSC can work faster than both. When compared with the Bayesian and Rocchio, the FSSC works more slowly but has a higher precision and F-measure.

机译：本文提出了一种基于模糊软集理论的文本分类新方法，即模糊软集分类器（FSSC）。我们使用从词袋表示中得出的模糊软集合表示，并将每个术语定义为文档集合中单词集中的一个不同单词。 FSSC使用模糊c均值公式对每个文档进行分类，并使用模糊软集相似度来度量两个文档之间的距离。我们使用标准的Reuters-21578数据集进行了实验，并使用布尔值，词频和词频倒置三种频率进行加权，以将FSSC的性能与其他四个分类器（如kNN，贝叶斯，罗基奥，和SVM。我们将精度，召回率，F量度，调整大小和运行时间用作性能评估。结果显示，没有绝对赢家。 FSSC的精度，召回率和F量度均低于SVM和kNN，但FSSC的工作速度比两者都快。与贝叶斯和Rocchio相比，FSSC的工作速度较慢，但精度和F测度更高。

著录项

来源
《International conference on computational science and its applications》|2012年|340-352|共13页
会议地点
作者
Bana Handaga; Mustafa Mat Deris;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Fuzzy soft set theory; bag-of-words; Text Classification;

机译：模糊软集理论;言语袋;文字分类;

相似文献

外文文献
中文文献
专利

1. Developing soft sensors using hybrid soft computing methodology: a neurofuzzy system based on rough set theory and genetic algorithms [J] . Jian Xu Luo, Hui He Shao Soft Computing . 2006,第1期

机译：使用混合软计算方法开发软传感器：基于粗糙集理论和遗传算法的神经模糊系统
2. Developing soft sensors using hybrid soft computing methodology: a neurofuzzy system based on rough set theory and genetic algorithms [J] . Luo JX, Shao HH Soft computing: A fusion of foundations, methodologies and applications . 2006,第1期

机译：使用混合软计算方法开发软传感器：基于粗糙集理论和遗传算法的神经模糊系统
3. The Parameter Reduction of Fuzzy Soft Sets Based on Soft Fuzzy Rough Sets [J] . ZhimingZhang Advances in fuzzy systems . 2013,第5期

机译：基于软模糊粗糙集的模糊软集参数约简
4. Text Categorization Based on Fuzzy Soft Set Theory [C] . Bana Handaga, Mustafa Mat Deris International Conference on Computational Science and Its Applications . 2012

机译：基于模糊软件理论的文本分类
5. Content-based image retrieval based on fuzzy sets theory and learning automaton. [D] . Sule-koiki, Adedokun W. 2005

机译：基于模糊集理论和学习自动机的基于内容的图像检索。
6. A Method for Fuzzy Soft Sets in Decision Making Based on Grey Relational Analysis and D-S Theory of Evidence: Application to Medical Diagnosis [O] . Ningxin Xie, Guoqiu Wen, Zhaowen Li 2014

机译：基于灰色关联分析和D-S证据理论的模糊软集合决策方法：在医学诊断中的应用
7. Valuation Fuzzy Soft Sets: A Flexible Fuzzy Soft Set Based Decision Making Procedure for the Valuation of Assets [O] . José Carlos R. Alcantud, Salvador Cruz Rambaud, María J. Muñoz Torrecillas 2017

机译：估值模糊软集：一种基于柔性模糊软件集的资产评估决策程序

Text Categorization Based on Fuzzy Soft Set Theory

摘要

著录项

相似文献

相关主题

期刊订阅