首页> 外文会议>IEEE International Conference on Multimedia and Expo >Web topic detection using a ranked clustering-like pattern across similarity cascades
【24h】

Web topic detection using a ranked clustering-like pattern across similarity cascades

机译:在相似级联中使用排序的类聚模式进行Web主题检测

获取原文

摘要

In multi-media and social media communities, web topic detection poses two main difficulties that conventional approaches can barely handle: 1) there are large inter-topic variations among web topics; 2) supervised information is rare to identify the real topics. In this paper, we address these problems from the similarity diffusion perspective among objects on web, and present a clustering-like pattern across similarity cascades (SCs). SCs are a series of subgraphs generated by truncating a weighted graph with a set of thresholds, and then maximal cliques are used to describe the topic candidates. Poisson deconvolution is adopted to efficiently identify the real topics from these topic candidates. Experiments demonstrate that our approach outperforms the state-of-the-arts on two datasets. In addition, we report accuracy v.s. false positives per topic (FPPT) curves for performance evaluation. To our knowledge, this is the first complete evaluation of web topic detection at the topic-wise level, and it establishes a new benchmark for this problem.
机译:在多媒体和社交媒体社区中,Web主题检测带来了传统方法几乎无法解决的两个主要困难:1)Web主题之间存在较大的主题间差异; 2)监督信息很少能识别出真实的话题。在本文中,我们从网络对象之间的相似度扩散角度解决了这些问题,并提出了跨相似度级联(SC)的类聚模式。 SC是通过将具有一组阈值的加权图截断而生成的一系列子图,然后使用最大集团来描述主题候选者。采用泊松反卷积可从这些候选主题中有效识别出真实主题。实验表明,我们的方法优于两个数据集上的最新技术。此外,我们报告的准确性与每个主题的误报(FPPT)曲线用于绩效评估。据我们所知,这是对主题级别的Web主题检测的首次完整评估,它为该问题建立了新的基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号