Algorithms of Nonlinear Document Clustering Based on Fuzzy Multiset Model

Kiyotaka Mizutani; Ryo Inokuchi; Sadaaki Miyamoto

首页> 外文期刊>International journal of entelligent systems >Algorithms of Nonlinear Document Clustering Based on Fuzzy Multiset Model

【24h】

Algorithms of Nonlinear Document Clustering Based on Fuzzy Multiset Model

机译：基于模糊多集模型的非线性文档聚类算法

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Fuzzy multiset is applicable as a model of information retrieval because it has the mathematical structure that expresses the number and the degree of attribution of an element simultaneously. Therefore, fuzzy multisets can be used also as a suitable model for document clustering. This paper aims at developing clustering algorithms based on a fuzzy multiset model for document clustering. The standard proximity measure of the cosine correlation is generalized in the multiset model, and two nonlinear clustering techniques are applied to the existing clustering methods. One introduces a variable for controlling cluster volume sizes; the other one is a kernel trick used in support vector machines. Moreover, clustering by competitive learning is also studied. When the kernel trick has been used the classification configuration of data in a high-dimensional feature space is visualized by self-organizing maps. Two numerical examples, which use an artificial data and real document data, are shown and effects of the proposed methods are discussed.

机译：模糊多集具有数学结构，可以同时表示元素的数量和属性程度，因此可以用作信息检索的模型。因此，模糊多集也可以用作文档聚类的合适模型。本文旨在开发基于模糊多集模型的文档聚类算法。余弦相关性的标准接近度度量在多集模型中得到了概括，并且两种非线性聚类技术被应用于现有的聚类方法。一个引入了用于控制群集卷大小的变量。另一个是支持向量机中使用的内核技巧。此外，还研究了通过竞争学习进行聚类。使用内核技巧后，通过自组织映射可以直观显示高维特征空间中数据的分类配置。给出了两个数值示例，分别使用了人工数据和真实文档数据，并讨论了所提出方法的效果。

著录项

来源
《International journal of entelligent systems 》 |2008年第2期| 176-198| 共页
作者
Kiyotaka Mizutani; Ryo Inokuchi; Sadaaki Miyamoto;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Reformulated query-based document retrieval using optimised kernel fuzzy clustering algorithm [J] . Gowthul Alam M. M., Baulkani S. International Journal of Business Intelligence and Data Mining . 2017 ,第3期

机译：使用优化的内核模糊聚类算法的基于查询的重构文档检索
2. A partitioning based algorithm to fuzzy co-cluster documents and words [J] . William-Chandra Tjhi, Lihui Chen Pattern recognition letters . 2006 ,第3期

机译：一种基于分割的模糊共聚文档和单词算法
3. Improved Type2-NPCM Fuzzy Clustering Algorithm Based on Adaptive Particle Swarm Optimization for Takagi-Sugeno Fuzzy Modeling Identification [J] . Lassad Houcine, Mohamed Bouzbida, Abdelkader Chaari International Journal of Fuzzy Systems . 2020 ,第6期

机译：基于Adapi-Sugeno模糊建模识别的自适应粒子群优化的改进Type2-NPCM模糊聚类算法
4. Fuzzy Multiset Model and Methods of Nonlinear Document Clustering for Information Retrieval [C] . Sadaaki Miyamoto, Kiyotaka Mizutani International Conference on Modeling Decisions for Artificial Intelligence(MDAI 2004); 20040802-20040804; Barcelona; ES . 2004

机译：信息检索的非线性文档聚类的模糊多集模型和方法
5. A hybrid algorithm and its applications to fuzzy logic modeling of nonlinear systems. [D] . Wang, Zhongjun. 2007

机译：混合算法及其在非线性系统模糊逻辑建模中的应用。
6. A Local Neighborhood Robust Fuzzy Clustering Image Segmentation Algorithm Based on an Adaptive Feature Selection Gaussian Mixture Model [O] . Hang Ren, Taotao Hu 2020

机译：基于自适应特征选择高斯混合模型的局部邻域鲁棒模糊聚类图像分割算法
7. Algorithms of nonlinear document clustering based on fuzzy multiset model [O] . Mizutani Kiyotaka, Inokuchi Ryo, Miyamoto Sadaaki 2008

机译：基于模糊多集模型的非线性文档聚类算法

Algorithms of Nonlinear Document Clustering Based on Fuzzy Multiset Model

摘要

著录项

相似文献

相关主题

期刊订阅