首页> 外文会议>ACM international conference on Multimedia >An audio stream classification and optimal segmentation for multimedia applications
【24h】

An audio stream classification and optimal segmentation for multimedia applications

机译:多媒体应用的音频流分类和最佳分段

获取原文

摘要

In this paper we investigate on-line zero-crossing based audio stream segmentation and classification into speech and other segments. We consider such segments as applause, noise of the auditorium, and silence. We demonstrate that the features extracted from zero-crossing are stable and valid to be used for speech and other signal discrimination and classification and don't require large amount of data for the training. We describe the optimal segmentation of unlimited audio signals using results of the frames classification. We demonstrate that using optimal segmentation is better than using traditional sliding window technique.
机译:在本文中,我们研究了基于在线过零的音频流分割并将其分类为语音和其他片段。我们认为这些部分包括掌声,礼堂噪音和沉默。我们证明了从过零中提取的特征是稳定且有效的,可用于语音和其他信号的辨别和分类,并且不需要大量的数据来进行训练。我们使用帧分类的结果描述了无限音频信号的最佳分割。我们证明使用最佳分割比使用传统的滑动窗口技术更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号