A Graph-based Clustering for Web Content Mining

机译：Web内容挖掘的基于图的聚类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we will introduce a new method of clustering where the data to be clustered will be represented by graphs instead of vectors or other models. Specifically, we will extend the classical kmeans clustering algorithm to work with graphs that represent web documents. We modeled web documents Content as graphs because they can allow us to retain information which is often discarded in simpler models. The experiments comparing the performance of clustering when using the traditional vector representation and our novel graph-based representations showed improvements in clustering quality and executing time over vector-based model.

机译：在本文中，我们将介绍一种新的聚类方法，其中要聚类的数据将由图形而不是矢量或其他模型表示。具体来说，我们将扩展经典的kmeans聚类算法，以处理代表Web文档的图形。我们将Web文档的内容建模为图形，因为它们可以使我们保留通常在简单模型中丢弃的信息。使用传统矢量表示法和我们新颖的基于图的表示法比较聚类性能的实验表明，与基于矢量的模型相比，聚类质量和执行时间有所改善。

著录项

来源
《2010 Third Pacific-Asia conference on web mining and web-based application.》|2010年|p.150-158|共9页
会议地点 Guilin(CN);Guilin(CN)
作者
Ming Tingtang; Li Jun;
展开▼
作者单位

Network Information Center Henan University Kaifeng, China;

Network Information Center Henan University Kaifeng, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;计算机网络;
关键词
Web Mining; Web Content Mining; Graph Similarity; Graph Matching;

机译：网络挖掘； Web内容挖掘；图相似度图匹配;
入库时间 2022-08-26 14:12:47

相似文献

外文文献
中文文献
专利

1. ODMM - AN ONTOLOGY BASED DEEP MINING METHOD TO CLUSTER THE CONTENT FROM WEB SERVERS [J] . S.GANESH KUMAR, Dr.K.VIVEKANANDAN Journal of Theoretical and Applied Information Technology . 2015,第2期

机译：ODMM-一种基于本体的深度挖掘方法，用于从Web服务器中收集内容
2. AN EFFECTIVE FUZZY CLUSTERING ALGORITHM FOR WEB DOCUMENT CLASSIFICATION: A CASE STUDY IN CULTURAL CONTENT MINING [J] . GEORGE E. TSEKOURAS, DAMIANOS GAVALAS International journal of software engineering and knowledge engineering . 2013,第6期

机译：Web文档分类的有效模糊聚类算法：以文化内容挖掘为例
3. IMPROVING EFFICIENCY OF TEXTUAL STATIC WEB CONTENT MINING USING CLUSTERING TECHNIQUES [J] . R. MANIKANDAN Journal of Theoretical and Applied Information Technology . 2011,第2期

机译：利用聚类技术提高文本静态Web内容挖掘的效率
4. A Graph-based Clustering for Web Content Mining [C] . Ming Tingtang, Li Jun Pacific-Asia conference on web mining and web-based application . 2010

机译：基于图形的Web内容挖掘聚类
5. Integrating automatic Web page clustering into Web log association mining. [D] . Guo, Jiayun. 2005

机译：将自动网页群集集成到Web日志关联挖掘中。
6. Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data [O] . Majed Alzahrani, Hiroyuki Kuwahara, Wei Wang, -1

机译：Gracob：一种基于图的恒定列双聚类方法用于挖掘生长表型数据
7. Integrating Web Content Clustering into Web Log Association Rule Mining [O] . Jiayun Guo, Qigang Gao 2008

机译：将Web内容集群集成到Web日志关联规则挖掘中

A Graph-based Clustering for Web Content Mining

摘要

著录项

相似文献

相关主题

期刊订阅