Latent topic model for audio retrieval

Pengfei Hu; Wenju Liu; Wei Jiang; Zhanlei Yang

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Latent topic model for audio retrieval

【24h】

Latent topic model for audio retrieval

机译：音频检索的潜在主题模型

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Latent topic model such as Latent Dirichlet Allocation (LDA) has been designed for text processing and has also demonstrated success in the task of audio related processing. The main idea behind LDA assumes that the words of each document arise from a mixture of topics, each of which is a multinomial distribution over the vocabulary. When applying the original LDA to process continuous data, the wordlike unit need be first generated by vector quantization (VQ). This data discretization usually results in information loss. To overcome this shortage, this paper introduces a new topic model named Gaussian-LDA for audio retrieval. In the proposed model, we consider continuous emission probability, Gaussian instead of multinomial distribution. This new topic model skips the vector quantization and directly models each topic as a Gaussian distribution over audio features. It avoids discretization by this way and integrates the procedure of clustering. The experiments of audio retrieval demonstrate that Gaussian-LDA achieves better performance than other compared methods.

机译：潜在主题模型（例如潜在Dirichlet分配（LDA））已设计用于文本处理，并且在音频相关处理的任务中也得到了证明。 LDA的主要思想是假设每个文档的单词都来自主题的混合体，每个主题都是词汇表上的多项式分布。将原始LDA应用于连续数据处理时，首先需要通过矢量量化（VQ）生成单词单元。这种数据离散化通常会导致信息丢失。为了克服这种不足，本文介绍了一种新的主题模型，即高斯-LDA，用于音频检索。在提出的模型中，我们考虑了连续发射概率，即高斯分布而不是多项式分布。这个新的主题模型跳过了矢量量化，并直接将每个主题建模为音频特征上的高斯分布。这样就避免了离散化，并集成了聚类的过程。音频检索实验表明，高斯-LDA比其他比较方法具有更好的性能。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2014年第3期|共6页
作者
Pengfei Hu; Wenju Liu; Wei Jiang; Zhanlei Yang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Topic model; LDA; Gaussian distribution; Audio retrieval;

机译：主题模型;LDA;高斯分布;音频检索;

相似文献

外文文献
中文文献
专利

1. Latent topic model for audio retrieval [J] . Pengfei Hu, Wenju Liu, Wei Jiang, Pattern Recognition: The Journal of the Pattern Recognition Society . 2014,第3期

机译：音频检索的潜在主题模型
2. Cross-language information retrieval models based on latent topic models trained with document-aligned comparable corpora [J] . Ivan Vulic, Wim De Smet, Marie-Francine Moens Information retrieval . 2013,第3期

机译：基于潜在主题模型的跨语言信息检索模型，该主题模型经过与文档对齐的可比语料库训练
3. Latent acoustic topic models for unstructured audio classification [J] . Panayiotis Georgiou, Samuel Kim, Shrikanth Narayanan APSIPA Transactions on Signal and Information Processing . 2012,第2012期

机译：非结构化音频分类的潜在声学主题模型
4. Latent Topic Modeling for Audio Corpus Summarization [C] . Timothy J. Hazen Annual conference of the International Speech Communication Association;INTERSPEECH 2011 . 2011

机译：音频语料库摘要的潜在主题建模
5. The Ensemble MeSH-Term Query Expansion Models Using Multiple LDA Topic Models and ANN Classifiers in Health Information Retrieval [D] . You, Sukjin. 2020

机译：使用多个LDA主题模型和健康信息检索的ANN分类器的集合网格术语查询型号
6. Clinical Case-based Retrieval Using Latent Topic Analysis [O] . Corey W. Arnold, Suzie M. El-Saden, Alex A.T. Bui, 2010

机译：使用潜在主题分析的基于临床病例的检索
7. Cross-language information retrieval models based on latent topic models trained with document-aligned comparable corpora [O] . Vulic Ivan, De Smet Wim, Moens Marie-Francine 2013

机译：基于潜在主题模型的跨语言信息检索模型，该主题模型经过与文档对齐的可比语料库训练

Latent topic model for audio retrieval

摘要

著录项

相似文献

相关主题

期刊订阅