首页> 外文会议>International Conference on speech and computer >An Approach to Automatic Summarization of Television Programs
【24h】

An Approach to Automatic Summarization of Television Programs

机译:电视节目自动汇总的一种方法

获取原文

摘要

In this paper we present an approach to document summarization based on unsupervised techniques. We study the adequacy of these techniques to the problem of documents in which many topics of different duration are present, in our case the transcriptions of Spanish TV programs. The paper compares a classical Latent Semantic Analysis approach to a new proposal based on Latent Dirichlet Allocation. It is also studied the application of the summarization process to the different segments obtained in a previous process of topic segmentation. The topic segmentation is performed by considering distances between paragraphs, that are represented by means of continuous vectors obtained from the words contained in them. Experiments on some TV programs of political and miscellaneous news have been performed.
机译:在本文中,我们提出了一种基于无监督技术的文档汇总方法。我们研究了这些技术对于存在许多持续时间不同的主题的文档问题的适用性,在我们的案例中是西班牙电视节目的转录。本文将经典的潜在语义分析方法与基于潜在Dirichlet分配的新建议进行了比较。还研究了摘要过程在主题细分的先前过程中获得的不同细分中的应用。通过考虑段落之间的距离来执行主题分割,这些距离通过从包含在其中的单词获得的连续向量来表示。已经对一些政治和其他新闻电视节目进行了实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号