首页> 外文会议>ACM international conference on multimedia >An Audio Stream Classification and Optimal Segmentation for Multimedia Applications
【24h】

An Audio Stream Classification and Optimal Segmentation for Multimedia Applications

机译:多媒体应用的音频流分类和最佳分割

获取原文

摘要

In this paper we investigate on-line zero-crossing based audio stream segmentation and classification into speech and other segments. We consider such segments as applause, noise of the auditorium, and silence. We demonstrate that the features extracted from zero-crossing are stable and valid to be used for speech and other signal discrimination and classification and don't require large amount of data for the training. We describe the optimal segmentation of unlimited audio signals using results of the frames classification. We demonstrate that using optimal segmentation is better than using traditional sliding window technique.
机译:在本文中,我们调查基于零交叉的音频流分段和分类为语音和其他段。我们认为这样的细分会作为掌声,礼堂的噪音和沉默。我们证明从零交叉中提取的特征是稳定的并且有效地用于语音和其他信号辨别和分类,并且不需要大量数据进行培训。我们使用帧分类的结果描述无限音频信号的最佳分割。我们证明使用最佳分割优于使用传统的滑动窗技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号