首页> 外文会议>Artificial intelligence: Theories, models and applications >A Multi-class Method for Detecting Audio Events in News Broadcasts
【24h】

A Multi-class Method for Detecting Audio Events in News Broadcasts

机译:一种检测新闻广播中音频事件的多类方法

获取原文
获取原文并翻译 | 示例

摘要

We propose a method for audio event detection in video streams from news. Apart from detecting speech, which is obviously the major class in such content, the proposed method detects five non-speech audio classes. The major difficulty of the particular task lies in the fact that most of the non-speech audio events are actually background sounds, with speech as the primary sound. We have adopted a set of 21 statistics computed on a mid-term basis over 7 audio features. A variation of the One Vs All classification architecture has been adopted and each binary classification problem is modeled using a separate probabilistic Support Vector Machine. Experiments have shown that the proposed method can achieve high precision rates for most of the audio events of interest.
机译:我们提出了一种用于新闻视频流中音频事件检测的方法。除了检测语音(显然是此类内容中的主要类别)外,该方法还检测了五个非语音音频类别。特定任务的主要困难在于以下事实:大多数非语音音频事件实际上都是背景声音,而语音是主要声音。我们采用了21种统计数据,这些统计数据是在中期基础上针对7种音频功能进行计算的。 One Vs All分类架构已采用,并且每个二进制分类问题都使用单独的概率支持向量机建模。实验表明,对于大多数感兴趣的音频事件,该方法可以达到较高的准确率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号