N-GRAM EXTENSION FOR BAG-OF-AUDIO-WORDS

机译：用于禁止音频字的N-Gram扩展

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bag-of-audio-words is one of the most frequently used methods for incorporating an audio component into multimedia event detection and related tasks. A main criticism of the method, however, is that it ignores context. Each "word" is considered in isolation, ignoring its neighbors. We address this issue by representing the document by its audio word N-grams. Unlike words from natural language, audio words are generated by clustering algorithms where the number of clusters is specified by the researcher. We therefore also explore how the performance of the N-gram representation varies with codebook size. With this enhanced representation, we find the average probability of miss noticeably decreases when evaluated on TRECVID 2011 and 2012 datasets, indicating clear improvements on the multimedia event detection task.

机译：袋式录音单词是将音频组件结合到多媒体事件检测和相关任务中的最常用方法之一。然而，对该方法的主要批评是它忽略了背景。每个“单词”都是孤立的，忽略其邻居。我们通过通过其音频字n-gram表示文档来解决此问题。与来自自然语言的单词不同，通过聚类算法生成音频单词，其中研究人员指定了集群的数量。因此，我们还探讨了N-GRAM表示的性能如何因码本大小而变化。通过这种增强的表示，我们发现在TrecVID 2011和2012数据集上评估时明显减少的小姐的平均概率，这表明多媒体事件检测任务的清晰改进。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|2013年||共5页
会议地点
作者
Stephanie Pancoast; Murat Akbacak;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Variable n-grams and extensions for conversational speech languagemodeling [J] . Manhung Siu, Ostendorf M. IEEE Transactions on Speech and Audio Proceessing . 2000,第1期

机译：可变n-gram和用于会话语音语言建模的扩展
2. Variable n-grams and extensions for conversational speech language modeling [J] . Manhung Siu, Ostendorf M. IEEE Transactions on Speech and Audio Proceeding . 2000,第1期

机译：可变n-gram和用于会话语音语言建模的扩展
3. Google N-Gram Viewer does not Include Arabic Corpus! Towards N-Gram Viewer for Arabic Corpus [J] . Alsmadi Izzat, Zarour Mohammad The international arab journal of information technology . 2018,第5期

机译：Google N-Gram Viewer不包括阿拉伯语语料库！面向N-Gram阿拉伯语语料库查看器
4. N-gram extension for bag-of-audio-words [C] . Pancoast Stephanie, Akbacak Murat IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：N-gram扩展词袋
5. A Channel Capacity Based Attack to Quantify the Security of N-Gram Based Anomaly Detection Approaches [D] . Shanahan, Nicholas. 2017

机译：基于信道容量的攻击，以量化N-Gram基异常检测方法的安全性
6. How the world’s collective attention is being paid to a pandemic: COVID-19 related n-gram time series for 24 languages on Twitter [O] . Thayer Alshaabi, Michael V. Arnold, Joshua R. Minot, 2021

机译：如何向大流行病人的集体注意力：Covid-19相关的N-Gram时间序列在Twitter上进行24种语言
7. History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla? [O] . Khan Naira, Habib Md. Tarek, Alam Md. Jahangir, 2006

机译：历史（前向N-Gram）还是未来（向后N-Gram）？孟加拉国应采用哪种模型进行N-Gram分析？

N-GRAM EXTENSION FOR BAG-OF-AUDIO-WORDS

摘要

著录项

相似文献

相关主题

期刊订阅