Document clustering using locality preserving indexing and support vector machines

Chengfu Yang; Zhang Yi

首页> 外文期刊>Soft Computing >Document clustering using locality preserving indexing and support vector machines

【24h】

Document clustering using locality preserving indexing and support vector machines

机译：使用局部性保留索引和支持向量机的文档聚类

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A method of document clustering based on locality preserving indexing (LPI) and support vector machines (SVM) is presented. The document space is generally of high dimensionality, and clustering in such a high-dimensional space is often infeasible due to the curse of dimensionality. In this paper, by using LPI, the documents are projected into a lower-dimension semantic space in which the documents related to the same semantic are close to each other. Then, by using SVM, the vectors in semantic space are mapped by means of a Gaussian kernel to a high-dimensional feature space in which the minimal enclosing sphere is searched. The sphere, when mapped back to semantics space, can separate into several independent components by the support vectors, each enclosing a separate cluster of documents. By combining the LPI and SVM, not only higher clustering accuracies in a more unsupervised effective way, but also better generalization properties can be obtained. Extensive demonstrations are performed on the Reuters-21578 and TDT2 data sets.

机译：提出了一种基于局部保存索引（LPI）和支持向量机（SVM）的文档聚类方法。文档空间通常是高维的，并且由于维数的诅咒，在这样的高维空间中聚集通常是不可行的。在本文中，通过使用LPI，文档被投影到一个较低维的语义空间中，其中与相同语义相关的文档彼此接近。然后，通过使用SVM，借助高斯核将语义空间中的向量映射到高维特征空间，在高维特征空间中搜索最小的封闭球体。当球体映射回语义空间时，可以通过支持向量分成几个独立的组件，每个组件都包含一个单独的文档簇。通过将LPI和SVM结合在一起，不仅可以以更不受监督的有效方式获得更高的聚类精度，而且可以获得更好的泛化特性。在Reuters-21578和TDT2数据集上进行了广泛的演示。

著录项

来源
《Soft Computing》 |2008年第7期|677-683|共7页
作者
Chengfu Yang; Zhang Yi;
展开▼
作者单位

Computational Intelligence Laboratory School of Computer Science and Engineering University of Electronic Science and Technology of China Chengdu 610054 People’s Republic of China;

Computational Intelligence Laboratory School of Computer Science and Engineering University of Electronic Science and Technology of China Chengdu 610054 People’s Republic of China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Document clustering; Locality preserving indexing; Support vector machines; Gaussian kernel; Support vectors clustering;

机译：文档聚类;局部性索引;支持向量机;高斯核;支持向量聚类;

相似文献

外文文献
中文文献
专利

1. Document clustering using locality preserving indexing and support vector machines [J] . Yang CF, Yi Z Soft computing: A fusion of foundations, methodologies and applications . 2008,第7期

机译：使用局部性保留索引和支持向量机的文档聚类
2. Document clustering using locality preserving indexing [J] . Cai D., He X., Han J. IEEE Transactions on Knowledge and Data Engineering . 2005,第12期

机译：使用位置保留索引的文档聚类
3. Rotating Machine Fault Diagnosis Based on Locality Preserving Projection and Back Propagation Neural Network-Support Vector Machine Model [J] . Dong Shaojiang, Xu Xiangyang, Liu Juan, Measurement and Control: Journal of the Institute of Measurement and Control . 2015,第7期

机译：基于保局部投影和BP神经网络的旋转机械故障诊断-支持向量机模型
4. Degradation state recognition of ultrasonic motor based upon locality preserving projection and support vector machine optimized by fruit fly optimization algorithm [C] . Baiyan Chen, Hongru Li, Guoqing An 2017 2nd International Conference on Power and Renewable Energy . 2017

机译：基于果蝇优化算法的局部保留投影和支持向量机的超声电机退化状态识别
5. Clustering system and clustering support vector machine for local protein structure prediction. [D] . Zhong, Wei. 2006

机译：用于局部蛋白质结构预测的聚类系统和聚类支持向量机。
6. A Novel Support Vector Machine with Globality-Locality Preserving [O] . Cheng-Long Ma, Yu-Bo Yuan -1

机译：一种具有全局性和局部性的新型支持向量机
7. Document clustering using locality preserving indexing [O] . Deng Cai, Xiaofei He, Jiawei Han, 2005

机译：使用局部保留索引进行文档聚类

Document clustering using locality preserving indexing and support vector machines

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅