首页> 外文期刊>高技术通讯(英文版) >Hierarchical clustering based on single-pass for breaking topic detection and tracking
【24h】

Hierarchical clustering based on single-pass for breaking topic detection and tracking

机译:基于单遍的层次聚类用于突破主题检测和跟踪

获取原文
获取原文并翻译 | 示例
       

摘要

Single-pass is commonly used in topic detection and tracking ( TDT) due to its simplicity, high efficiency and low cost.When dealing with large-scale data, time cost will increase sharply and clustering performance will be affected greatly.Aiming at this problem, hierarchical clustering algo-rithm based on single-pass is proposed, which is inspired by hierarchical and concurrent ideas to di-vide clustering process into three stages.News reports are classified into different categories firstly. Then there are twice single-pass clustering processes in the same category, and one agglomerative clustering among different categories.In addition, for semantic similarity in news reports, topic model is improved based on named entities.Experimental results show that the proposed method can effectively accelerate the process as well as improve the performance.
机译:单次通过由于其简单,高效和低成本而被广泛用于主题检测和跟踪(TDT)中,当处理大规模数据时,时间成本将急剧增加,并且聚类性能将受到极大影响。在此基础上,提出了基于单遍的聚类算法,将聚类过程分为三个阶段。本文首先将新闻分为不同的类别。然后在同一类别中有两次单次通过聚类过程,在不同类别中有一个聚集性聚类。此外,针对新闻报道中的语义相似性,基于命名实体对主题模型进行了改进,实验结果表明,该方法可以有效地解决问题。加快过程并提高性能。

著录项

  • 来源
    《高技术通讯(英文版)》 |2018年第4期|369-377|共9页
  • 作者单位

    School of Computers, Guangdong University of Technology, Guangzhou 510006, P.R.China;

    School of Software Engineering, South China University of Technology, Guangzhou 510006, P.R.China;

    School of Software Engineering, South China University of Technology, Guangzhou 510006, P.R.China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-19 04:27:22
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号