Document Clustering Using Incremental and Pairwise Approaches

机译：使用增量和成对方法的文档聚类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the experiments and results of a clustering approach for clustering of the large Wikipedia dataset in the INEX 2007 Document Mining Challenge. The clustering approach employed makes use of an incremental clustering method and a pairwise clustering method. The approach enables us to perform the clustering task on a large dataset by first reducing the dimension of the dataset to an undefined number of clusters using the incremental method. The lower-dimension dataset is then clustered to a required number of clusters using the pairwise method. In this way, clustering of the large number of documents is performed successfully and the accuracy of the clustering solution is achieved.

机译：本文介绍了在INEX 2007 Document Mining Challenge中对大型Wikipedia数据集进行聚类的聚类方法的实验和结果。所采用的聚类方法利用了增量聚类方法和成对聚类方法。该方法使我们能够通过使用增量方法首先将数据集的维数减小到未定义数量的聚类来对大型数据集执行聚类任务。然后，使用成对方法将较低维度的数据集聚类为所需数量的聚类。这样，成功完成了大量文档的聚类，并达到了聚类解决方案的准确性。

著录项

来源
《Focused Access to XML Documents》|2007年|P.222-233|共12页
会议地点 Dagstuhl Castle(DE);Dagstuhl Castle(DE)
作者
Tien Tran; Richi Nayak; Peter Bruza;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
clustering; structure; CONTENT; XML; inex 2007;

机译：集群;结构;内容; XML; inex 2007;
入库时间 2022-08-26 14:06:49

相似文献

外文文献
中文文献
专利

1. Incremental models for query clustering and query-context aware document clustering [J] . Poonam Goyal, N. Mehala, Navneet Goyal International journal of knowledge and web intelligence . 2015,第2期

机译：用于查询聚类和查询上下文感知的文档聚类的增量模型
2. Pairwise-adaptive dissimilarity measure for document clustering [J] . Dhondt J, Vertommen J, Verhaegen PA, Information Sciences: An International Journal . 2010,第12期

机译：成对自适应的文档聚类差异度量
3. DOCUMENT CLUSTERING WITH PAIRWISE CONSTRAINTS [J] . WORAPOJ KREESURADEJ, APINYA SUWANLAMAI International Journal of Pattern Recognition and Artificial Intelligence . 2006,第2期

机译：对具有约束的文档聚类
4. Document Clustering Using Incremental and Pairwise Approaches [C] . Tien Tran, Richi Nayak, Peter Bruza International Workshop of the Initiative for the Evaluation of XML Retrieval . 2008

机译：使用增量和成对方法进行文档群集
5. Text document topical recursive clustering and automatic labeling of a hierarchy of document clusters. [D] . Li, Xiaoxiao. 2012

机译：文本文档主题递归群集和文档群集层次结构的自动标记。
6. A machine learning approach for ranking clusters of docked protein‐protein complexes by pairwise cluster comparison [O] . Erik Pfeiffenberger, Raphael A.G. Chaleil, Iain H. Moal, -1

机译：通过成对聚类比较对停靠的蛋白质-蛋白质复合物的聚类进行排序的机器学习方法
7. Novelty-based incremental document clustering for on-line documents [O] . Sophoin Khy, Yoshiharu Ishikawa, Hiroyuki Kitagawa 2006

机译：用于在线文档的基于新颖的增量文档聚类
8. Incremental Model-Based Clustering for Large Datasets With Small Clusters [R] . Fraley, C. , Raftery, A. , Wehrensy, R. 2003

机译：基于增量模型的聚类适用于具有小集群的大型数据集

Document Clustering Using Incremental and Pairwise Approaches

摘要

著录项

相似文献

相关主题

期刊订阅