A Significance-Based Graph Model for Clustering Web Documents

机译：基于重要性的网络文档图模型

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional document clustering techniques rely on single-term analysis, such as the widely used Vector Space Model. However, recent approaches have emerged that are based on Graph Models and provide a more detailed description of document properties. In this work we present a novel Significance-based Graph Model for Web documents that introduces a sophisticated graph weighting method, based on significance evaluation of graph elements. We also define an associated similarity measure based on the maximum common subgraph between the graphs of the corresponding web documents. Experimental results on artificial and real document collections using well-known clustering algorithms indicate the effectiveness of the proposed approach.

机译：传统的文档聚类技术依赖于单项分析，例如广泛使用的向量空间模型。但是，最近出现了一些基于图模型的方法，这些方法提供了文档属性的更详细描述。在这项工作中，我们提出了一个新颖的基于Web的基于重要性的图形模型，该模型基于图形元素的重要性评估引入了一种复杂的图形加权方法。我们还基于相应Web文档的图之间的最大公共子图来定义关联的相似性度量。使用众所周知的聚类算法在人工和真实文档集合上的实验结果表明了该方法的有效性。

著录项

来源
《Helenic Conference on Artificial Intelligence(AI),(SETN 2006); 20060518-20; Heraklion(GR)》|2006年|P.516-519|共4页
会议地点 Heraklion(GR)
作者
Argyris Kalogeratos; Aristidis Likas;
展开▼
作者单位

Department of Computer Science, University of Ioannina, GR 45110, Ioannina, Greece;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. WEB DOCUMENT CLUSTERING THROUGH METAFILE GENERATION FOR DIGRAPH STRUCTURE USING DOCUMENT INDEX GRAPH [J] . BUDI, SRI NURDIATI, BIB PARUHUM SILALAHI Journal of Theoretical and Applied Information Technology . 2014,第1期

机译：通过文档索引图通过元数据生成的Web文档聚类图结构
2. WEB DOCUMENT CLUSTERING THROUGH METAFILE GENERATION FOR DIGRAPH STRUCTURE USING DOCUMENT INDEX GRAPH [J] . BUDI, SRI NURDIATI, BIB PARUHUM SILALAHI Journal of Theoretical and Applied Information Technology . 2014,第1期

机译：通过文档索引图通过元数据生成的Web文档聚类图结构
3. Parallelization of a graph-cut based algorithm for hierarchical clustering of web documents [J] . Karthick Seshadri, S. Mercy Shalinie Concurrency and computation: practice and experience . 2015,第17期

机译：Web文档分层聚类的基于图割的算法的并行化
4. A Significance-Based Graph Model for Clustering Web Documents [C] . Argyris Kalogeratos, Aristidis Likas Helenic Conference on Artificial Intelligence . 2006

机译：用于聚类Web文档的基于意义的图形模型
5. Clustering Web documents: A phrase-based method for grouping search engine results. [D] . Zamir, Oren Eli. 1999

机译：Web文档群集：一种基于短语的方法，用于对搜索引擎结果进行分组。
6. Soft document clustering using a novel graph covering approach [O] . Jens Dörpinghaus, Sebastian Schaaf, Marc Jacobs 2018

机译：使用新颖的图形覆盖方法进行软文档聚类
7. ACC/ACP/SCAI/SVMB/SVS clinical competence statement on vascular medicine and catheter-based peripheral vascular interventions11When citing this document, the American College of Cardiology, American College of Physicians, Society for Cardiovascular Angiography and Interventions, Society for Vascular Medicine and Biology, and the Society for Vascular Surgery would appreciate the following citation format: Creager MA, Goldstone J, Hirshfeld JW, Kazmers A, Kent KC, Lorell BH, Olin JW, Pauly RR, Rosenfield K, Roubin GS, Sicard GA, and White CJ. ACC/ACP/SCAI/SVMB/SVS clinical competence statement on vascular medicine and catheter-based peripheral vascular interventions: a report of the American College of Cardiology/American Heart Association/American College of Physicians Task Force on Clinical Competence (ACC/ACP/SCAI/SVMB/SVS Writing Committee on Clinical Competence on Peripheral Vascular Disease). J Am Coll Cardiol 2004;44:941–57.33copies: this document is available on the world wide web sites of the american college of cardiology (www.acc.org), american college of physicians (www.acponline.org), society for cardiovascular angiography and interventions (www.scai.org), society for vascular medicine and biology (www.svmb.org), and the society for vascular surgeons (www.vascularweb.org). single copies of this document may be purchased for $10.00 by calling 1-800-253-4636 or by writing to the american college of cardiology, educational services, 9111 old georgetown road, bethesda, maryland 20814-1699. permissions: multiple copies, modification, alteration, enhancement, and/or distribution of this document are not permitted without the express permission of the american college of cardiology foundation. please direct requests to copyright_permissions@acc.org. A report of the american college of cardiology/american heart association/american college of physicianstask force on clinical competence (acc/acp/scai/svmb/svs writing committee to develop a clinical competence statement on peripheral vascular disease) [O] . Creager Mark A., Goldstone Jerry, Hirshfeld John W., 2004

机译：ACC / ACP / SCAI / SVMB / SVS关于血管医学和基于导管的外周血管干预的临床能力声明11引用该文件时，美国心脏病学会，美国内科医师学会，心血管血管造影与介入学会，血管医学与生物学学会，并且血管外科学会将喜欢以下引用格式：Creager MA，Goldstone J，Hirshfeld JW，Kazmers A，Kent KC，Lorell BH，Olin JW，Pauly RR，Rosenfield K，Roubin GS，Sicard GA和White CJ 。 ACC / ACP / SCAI / SVMB / SVS关于血管医学和基于导管的外周血管干预的临床能力声明：美国心脏病学会/美国心脏协会/美国医师学会临床能力特别工作组（ACC / ACP / SCAI / SVMB / SVS外周血管疾病临床能力写作委员会）。 J Am Coll Cardiol 2004； 44：941-57.33副本：该文件可在美国心脏病学院（www.acc.org），美国医师学院（www.acponline.org），社会的万维网站点上获得心血管血管造影和干预措施（www.scai.org），血管医学和生物学学会（www.svmb.org）和血管外科医师学会（www.vascularweb.org）。可以致电1-800-253-4636或致信美国马里兰州贝塞斯达市乔治敦路9111号心脏病学院，教育服务学院20814-1699，以10.00美元的价格购买本文档的一份副本。权限：未经美国心脏病学会基金会的明确许可，不得对本文档进行多份复制，修改，更改，增强和/或分发。请直接将请求发送到copyright_permissions@acc.org。美国心脏病学会/美国心脏协会/美国医师学院的临床能力专责小组报告（acc / acp / scai / svmb / svs撰写委员会拟定了关于外周血管疾病的临床能力陈述）
8. Web Page Clustering using Heuristic Search in the Web Graph [R] . Bekkerman, R. , Zilberstein, S. , Allan, J. 2006

机译：Web图中使用启发式搜索的网页聚类

A Significance-Based Graph Model for Clustering Web Documents

摘要

著录项

相似文献

相关主题

期刊订阅