首页> 外文会议>International Conference on Intelligent Computing(ICIC 2006); 20060816-19; Kunming(CN) >Audio Content Analysis for Understanding Structures of Scene in Video
【24h】

Audio Content Analysis for Understanding Structures of Scene in Video

机译:音频内容分析以了解视频中的场景结构

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a system to categorize audio in 7 classes. For classification features, we use the mean and variance of RMS, ZCR, fundamental frequency and frequency peak which are extracted from every frame of 25ms length. In addition to the audio content classification, we also perform speaker identification with the voice sequences extracted automatically using our proposed method. The accuracy of our proposed scheme reaches 93.8% in categorizing audio signal and 80% in the speaker identification process.
机译:在本文中,我们提出了一种将音频分类为7类的系统。对于分类特征,我们使用RMS,ZCR,基频和频率峰值的均值和方差,这些均值和方差是从25ms长度的每个帧中提取的。除了音频内容分类,我们还使用我们提出的方法对语音序列自动提取的语音进行说话人识别。我们提出的方案在对音频信号进行分类时的准确度达到93.8%,在说话人识别过程中达到80%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号