首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Automatic Excitement-Level Detection for Sports Highlights Generation
【24h】

Automatic Excitement-Level Detection for Sports Highlights Generation

机译:运动精彩片段自动激发水平检测

获取原文

摘要

The problem of automatic excitement detection in baseball videos is considered and applied for highlight generation. This paper focuses on detecting exciting events in video using complementary information from the audio and video domains. First, a new measure for non-stationarity which is extremely effective in separating background from speech is proposed. This new feature is employed in an unsupervised GMM-based segmentation algorithm that identifies the sports commentators speech within the crowd background. Thereafter, the "level-of-excitement" is measured using features such as pitch, F1-F3 center frequencies, and spectral center of gravity extracted from the commentators speech. Our experiments using actual baseball videos show that these features are well correlated with human assessment of excitability. Furthermore, slow-motion replay and baseball pitching-scenes from the video are also detected to estimate scene end-points. Finally, audio/video information is fused to rank-order scenes by "excitability" in order to generate highlights of user-defined time-lengths. The techniques described in this paper are generic and applicable to a variety of topic and video/acoustic domains.
机译:考虑了棒球视频中自动兴奋检测的问题,并将其应用于亮点生成。本文着重于利用来自音频和视频领域的补充信息来检测视频中令人兴奋的事件。首先,提出了一种非平稳性的新措施,该方法在分离背景和语音方面非常有效。这项新功能用于基于无监督GMM的细分算法中,该算法可识别人群背景中的体育评论员语音。之后,使用诸如音调,F1-F3中心频率和从评论者语音中提取的频谱重心之类的特征来测量“兴奋度”。我们使用实际棒球视频进行的实验表明,这些功能与人类对兴奋性的评估密切相关。此外,还可以检测视频中的慢动作重播和棒球投球场景,以估计场景终点。最后,通过“兴奋性”将音频/视频信息融合到排序场景中,以生成用户定义的时间长度的突出显示。本文介绍的技术是通用的,适用于各种主题和视频/声音领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号