...
首页> 外文期刊>International journal of information retrieval research >Multi-View Meets Average Linkage: Exploring the Role of Metadata in Document Clustering
【24h】

Multi-View Meets Average Linkage: Exploring the Role of Metadata in Document Clustering

机译:多视图与平均链接相遇:探索元数据在文档群集中的作用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Inspired by the success of a recently developed algorithm MVSC-I_(R), the authors embed the idea of Multi-Viewpoint Based Sim ilarity Measure for clustering (MVSC) into a hierarchical clustering method, i.e., average linkage clustering, to overcome the problem of initiation with random seeds, resulting in a new algorithm, referred to as MVSC-HAC. The improved performance of this new algorithm encouraged us to further explore the impact of metadata in document clustering. In this paper, after reviewing two existing algorithms, the authors describe their new algorithm and present experimental results on various sizes of data sets at two different levels: the one using the entire context of documents and the one using existing meta tags of the documents. The result shows MVSC-HAC excels at both levels. The authors analyze the results, and provide a discussion based on other observation on the role of metadata in document clustering.
机译:受最近开发的算法MVSC-I_(R)成功的启发,作者将基于多视点的相似度测度聚类(MVSC)的思想嵌入了一种分层聚类方法,即平均链接聚类,以解决该问题。随机种子的启动,产生了一种新算法,称为MVSC-HAC。这种新算法的改进性能鼓励我们进一步探索元数据在文档聚类中的影响。在本文中,作者在回顾了两种现有算法之后,描述了他们的新算法,并在两个不同级别上针对各种大小的数据集展示了实验结果:一个使用整个文档上下文,一个使用现有的文档元标记。结果表明,MVSC-HAC在两个级别上均表现出色。作者分析了结果,并基于对元数据在文档聚类中的作用的其他观察结果进行了讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号