首页> 外文期刊>Multimedia Tools and Applications >Content-based singer classification on compressed domain audio data
【24h】

Content-based singer classification on compressed domain audio data

机译:压缩域音频数据上基于内容的歌手分类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we proposed a singer identification approach to automatically identify the singer of an unknown MP3 audio data. Differing from previous researches for singer identification in MP3 compressed domain, we use Mel-Frequency Cepstral Coefficients (MFCC) as the feature instead of MDCT (modified discrete cosine transform) coefficients. Although MFCC is often used in music classification and speaker recognition, it cannot be directly obtained from compressed music data such as MP3 format. We introduce a modified method for calculating MFCC vector in MP3 compressed domain. For describing the distribution of MFCC vector, the Gaussian mixture model (GMM) is applied. To find the nearest singer, we use maximum likelihood classification (MLC) to allot each input MFCC vector to its nearest group. The experimental result verifies the feasibility of the proposed approach.
机译:在本文中,我们提出了一种歌手识别方法来自动识别未知MP3音频数据的歌手。与以前在MP3压缩域中进行歌手识别的研究不同,我们使用Mel频率倒谱系数(MFCC)作为特征而不是MDCT(修正离散余弦变换)系数。尽管MFCC通常用于音乐分类和说话者识别,但不能直接从压缩的音乐数据(例如MP3格式)中获得MFCC。我们介绍了一种在MP3压缩域中计算MFCC向量的改进方法。为了描述MFCC向量的分布,应用了高斯混合模型(GMM)。为了找到最接近的歌手,我们使用最大似然分类(MLC)将每个输入MFCC向量分配给最接近的组。实验结果验证了该方法的可行性。

著录项

  • 来源
    《Multimedia Tools and Applications》 |2015年第4期|1489-1509|共21页
  • 作者单位

    Department of Electrical Engineering, National Central University, No.300, Jhongda Rd., Jhongli City, Taoyuan County 32001 Taiwan, People's Republic of China;

    Department of Electrical Engineering, National Central University, No.300, Jhongda Rd., Jhongli City, Taoyuan County 32001 Taiwan, People's Republic of China;

    Department of Electrical Engineering, National Central University, No.300, Jhongda Rd., Jhongli City, Taoyuan County 32001 Taiwan, People's Republic of China;

    Department of Electrical Engineering, National Central University, No.300, Jhongda Rd., Jhongli City, Taoyuan County 32001 Taiwan, People's Republic of China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    MP3; MDCT; MFCC; GMM;

    机译:MP3;MDST;IFRC;GMM;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号