首页> 外文会议> >Robust soccer highlight generation with a novel dominant-speech feature extractor
【24h】

Robust soccer highlight generation with a novel dominant-speech feature extractor

机译:新型的主导语音特征提取器可生成强大的足球精彩片段

获取原文

摘要

We describe soccer highlight generation from only the audio stream in the video. A novel audio feature is used to detect parts of the commentary corresponding to dominant and excited speech. It is computed by a twice-iterated composite Fourier transform (CFT) on short-time windows, wherein the magnitude spectrum of the first transform is input to a second transform. Dominant speech portions are found to be robustly characterized by increased density in the peak profile. We verify the robustness of CFT via large scale empirical testing and explain its working based on a pulse train postulate of dominant speech signals. Our audio-only approach results in a compute-efficient system deployable on current generation set-top-boxes and digital video recording devices.
机译:我们仅根据视频中的音频流来描述足球精彩片段的生成。一种新颖的音频功能可用于检测与主要语音和激动性语音相对应的评论部分。它是通过在短时间窗口上进行两次迭代的复合傅里叶变换(CFT)来计算的,其中,将第一变换的幅度谱输入到第二变换。发现主要语音部分的特征在于峰值轮廓的密度增加。我们通过大规模的经验测试来验证CFT的鲁棒性,并根据占主导地位的语音信号的脉冲序列假设来解释CFT的工作原理。我们的纯音频方法可实现可在当前一代的机顶盒和数字视频记录设备上部署的高效计算系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号