Audio Content Analysis for Understanding Structures of Scene in Video

机译：音频内容分析以了解视频中的场景结构

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we propose a system to categorize audio in 7 classes. For classification features, we use the mean and variance of RMS, ZCR, fundamental frequency and frequency peak which are extracted from every frame of 25ms length. In addition to the audio content classification, we also perform speaker identification with the voice sequences extracted automatically using our proposed method. The accuracy of our proposed scheme reaches 93.8% in categorizing audio signal and 80% in the speaker identification process.

机译：在本文中，我们提出了一种将音频分类为7类的系统。对于分类特征，我们使用RMS，ZCR，基频和频率峰值的均值和方差，这些均值和方差是从25ms长度的每个帧中提取的。除了音频内容分类，我们还使用我们提出的方法对语音序列自动提取的语音进行说话人识别。我们提出的方案在对音频信号进行分类时的准确度达到93.8％，在说话人识别过程中达到80％。

著录项

来源
《International Conference on Intelligent Computing(ICIC 2006); 20060816-19; Kunming(CN)》|2006年|P.1213-1218|共6页
会议地点 Kunming(CN)
作者
Chan-Mi Kang; Joong-Hwan Baek;
展开▼
作者单位

Multimedia Retrieval Lab. in School of Electronics and Communication Engineering, Hankuk Aviation University;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Scene-Aware Audio for 360° Videos [J] . Li Dingzeyu, Langlois Timothy R., Zheng Changxi ACM Transactions on Graphics . 2018,第4CD期

机译：360°视频的场景感知音频
2. A highlight scene detection and video summarization system using audio feature for a personal video recorder [J] . Otsuka I., Nakane K., Divakaran A., IEEE Transactions on Consumer Electronics . 2005,第1期

机译：用于个人录像机的具有音频功能的精彩场面检测和视频摘要系统
3. A human-like description of scene events for a proper UAV-based video content analysis [J] . Cavaliere Danilo, Loia Vincenzo, Saggese Alessia, Knowledge-Based Systems . 2019,第AUGa15期

机译：对场景事件的类人化描述，用于基于无人机的适当视频内容分析
4. Audio Content Analysis for Understanding Structures of Scene in Video [C] . Chan-Mi Kang, Joong-Hwan Baek International Conference on Intelligent Computing . 2006

机译：录像中场景结构的音频内容分析
5. Audio-visual scene analysis with application in sports video. [D] . Xiong, Ziyou. 2004

机译：视听场景分析及其在体育视频中的应用。
6. Comparison of audio vs. audio + video for the rating of shared decision making in oncology using the observer OPTION5 instrument: an exploratory analysis [O] . Michael R. Gionfriddo, Megan E. Branda, Cara Fernandez, 2018

机译：使用观察者OPTION5仪器对音频与音频+视频进行比较以评估肿瘤学中的共享决策：探索性分析
7. Audio-coupled video content understanding of unconstrained video sequences [O] . Lopes Jose E.F.C. 2011

机译：不受限制的视频序列的音频耦合视频内容理解

Audio Content Analysis for Understanding Structures of Scene in Video

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅