首页> 外文会议>ACM international conference on Multimedia >Topic segmentation of news speech using word similarity
【24h】

Topic segmentation of news speech using word similarity

机译:基于词相似度的新闻演讲主题分割

获取原文

摘要

Conventional topic segmentation utilizes cosine measure as the similarity between consecutive passages. However, the cosine measure has a problem that it can not reflect the similarity unless exactly the same words are included in the passages. To solve this problem, in this paper, we propose a method to acquire the word similarity between different words from the input data directly and automatically by managing to collect the same topic sections. Further more, we propose a method to compute the passage similarity based on the word similarity. Finally we propose a method of topic segmentation based on the passage similarity in an unsupervised mode.

机译:

常规主题细分利用 cosine 度量作为连续段落之间的相似性。但是, cosine 度量标准存在一个问题,即除非段落中包含完全相同的单词,否则它无法反映相似性。为了解决这个问题,本文提出了一种方法,该方法通过管理收集相同的主题部分,从输入数据中直接自动自动获取不同单词之间的单词相似度。此外,我们提出了一种基于单词相似度来计算段落相似度的方法。最后,提出了一种基于段落相似度的无监督模式主题分割方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号