...
【24h】

Latent topic model for audio retrieval

机译:音频检索的潜在主题模型

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Latent topic model such as Latent Dirichlet Allocation (LDA) has been designed for text processing and has also demonstrated success in the task of audio related processing. The main idea behind LDA assumes that the words of each document arise from a mixture of topics, each of which is a multinomial distribution over the vocabulary. When applying the original LDA to process continuous data, the wordlike unit need be first generated by vector quantization (VQ). This data discretization usually results in information loss. To overcome this shortage, this paper introduces a new topic model named Gaussian-LDA for audio retrieval. In the proposed model, we consider continuous emission probability, Gaussian instead of multinomial distribution. This new topic model skips the vector quantization and directly models each topic as a Gaussian distribution over audio features. It avoids discretization by this way and integrates the procedure of clustering. The experiments of audio retrieval demonstrate that Gaussian-LDA achieves better performance than other compared methods.
机译:潜在主题模型(例如潜在Dirichlet分配(LDA))已设计用于文本处理,并且在音频相关处理的任务中也得到了证明。 LDA的主要思想是假设每个文档的单词都来自主题的混合体,每个主题都是词汇表上的多项式分布。将原始LDA应用于连续数据处理时,首先需要通过矢量量化(VQ)生成单词单元。这种数据离散化通常会导致信息丢失。为了克服这种不足,本文介绍了一种新的主​​题模型,即高斯-LDA,用于音频检索。在提出的模型中,我们考虑了连续发射概率,即高斯分布而不是多项式分布。这个新的主题模型跳过了矢量量化,并直接将每个主题建模为音频特征上的高斯分布。这样就避免了离散化,并集成了聚类的过程。音频检索实验表明,高斯-LDA比其他比较方法具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号