首页> 美国卫生研究院文献>SpringerPlus >A hierarchical approach for speech-instrumental-song classification
【2h】

A hierarchical approach for speech-instrumental-song classification

机译:语音-乐器-歌曲分类的分层方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Audio classification acts as the fundamental step for lots of applications like content based audio retrieval and audio indexing. In this work, we have presented a novel scheme for classifying audio signal into three categories namely, speech, music without voice (instrumental) and music with voice (song). A hierarchical approach has been adopted to classify the signals. At the first stage, signals are categorized as speech and music using audio texture derived from simple features like ZCR and STE. Proposed audio texture captures contextual information and summarizes the frame level features. At the second stage, music is further classified as instrumental/song based on Mel frequency cepstral co-efficient (MFCC). A classifier based on Random Sample and Consensus (RANSAC), capable of handling wide variety of data has been utilized. Experimental result indicates the effectiveness of the proposed scheme.Electronic supplementary materialThe online version of this article (doi:10.1186/2193-1801-2-526) contains supplementary material, which is available to authorized users.
机译:音频分类是许多应用程序的基本步骤,例如基于内容的音频检索和音频索引。在这项工作中,我们提出了一种将音频信号分为三类的新颖方案,即语音,无语音音乐(乐器)和有语音音乐(歌曲)。已经采用了分级方法来对信号进行分类。在第一阶段,使用从简单特征(如ZCR和STE)派生的音频纹理将信号分类为语音和音乐。拟议的音频纹理捕获上下文信息并总结帧级功能。在第二阶段,根据梅尔频率倒谱系数(MFCC)将音乐进一步分类为器乐/歌曲。已经使用了基于随机样本和共识(RANSAC)的分类器,该分类器能够处理各种数据。实验结果表明了该方案的有效性。电子补充材料本文的在线版本(doi:10.1186 / 2193-1801-2-526)包含补充材料,授权用户可以使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号