首页> 外文会议>Comparative evaluation of focused retrieval >An Iterative Clustering Method for the XML-Mining Task of the INEX 2010;
【24h】

An Iterative Clustering Method for the XML-Mining Task of the INEX 2010;

机译:一种用于INEX 2010 XML挖掘任务的迭代聚类方法;

获取原文
获取原文并翻译 | 示例

摘要

In this paper we propose two iterative clustering methods for grouping Wikipedia documents of a given huge collection into clusters. The recursive method clusters iteratively subsets of the complete collection. In each iteration, we select representative items for each group, which are then used for the next stage of clustering.The presented approaches are scalable algorithms which may be used with huge collections that in other way (for instance, using the classic clustering methods) would be computationally expensive of being clustered. The obtained results outperformed the random baseline presented in the INEX 2010 clustering task of the XML-Mining track.
机译:在本文中,我们提出了两种迭代聚类方法,用于将给定巨大集合的Wikipedia文档分组为聚类。递归方法将整个集合的子集迭代地聚类。在每次迭代中,我们为每个组选择代表项,然后将其用于聚类的下一阶段。本文提出的方法是可伸缩算法,可与其他方法(例如,使用经典聚类方法)的巨大集合一起使用群集将在计算上昂贵。获得的结果优于XML-Mining轨道的INEX 2010群集任务中提供的随机基线。

著录项

  • 来源
  • 会议地点 Vught(NL);Vught(NL)
  • 作者单位

    Benemerita Universidad Autonoma de Puebla, Mexico,Centro Nacional de Investigation y Desarrollo Tecnologico, Mexico;

    Instituto Tecnologico de Cerro Azul, Mexico,Centro Nacional de Investigation y Desarrollo Tecnologico, Mexico;

    Instituto Tecnologico de Tuxtla Gutierrez, Mexico,Centro Nacional de Investigation y Desarrollo Tecnologico, Mexico;

    Benemerita Universidad Autonoma de Puebla, Mexico;

    Benemerita Universidad Autonoma de Puebla, Mexico;

    Centro Nacional de Investigation y Desarrollo Tecnologico, Mexico;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 信息处理(信息加工);
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号