首页> 外文会议>Mexican International Conference on Artificial Intelligence >Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time
【24h】

Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time

机译:实时使用孤立音符自动转录音乐的人工智能方法

获取原文

摘要

We introduce a comparative study of several features obtained from audio signal and methods of Artificial Intelligence employed for Automatic Music Transcription in real-time, specially using monophonic notes. Mel-frequency Cepstrum Coefficients (MFCC), Linear Prediction Coefficients (LPC) and Cochlear Mechanics Cepstrum Coefficient (CMCC) were the features used which are a set of coefficients obtained from our laboratory experiments, which in this paper demonstrated to be more effective for Automatic Music Transcription (ATM) than other characteristics such as Mel Frequency Cepstral Coefficients (MFCC). At same time, Vector Quantization (VQ), Hidden Markov Models (HMM), Gaussian Mixtures Models (GMM) and Artificial Neural Networks (ANN) for pattern classification task were used. The database consisted of 840 music notes, we analyzed 5 scales and 14 samples by musical note. The results obtained showed that Vector Quatization, HMM using CMCC_L&B_RA and GMM were the best methods of Artificial Inteligent for this task, while MFCC and CMCC_L&B_RA were the best features employed.
机译:我们将对从音频信号获得的若干功能进行比较研究,并实时地对用于自动音乐转录的人工智能方法进行研究,特别是使用单音符。所使用的特征是梅尔频率倒谱系数(MFCC),线性预测系数(LPC)和耳蜗力学倒谱系数(CMCC),它们是从我们的实验室实验中获得的一组系数,在本文中证明对自动音乐转录(ATM)比其他特征(例如梅尔频率倒谱系数(MFCC))高。同时,使用矢量量化(VQ),隐马尔可夫模型(HMM),高斯混合模型(GMM)和人工神经网络(ANN)进行模式分类任务。该数据库由840个音符组成,我们通过音符分析了5个音阶和14个样本。获得的结果表明,矢量量化,使用CMCC_L&B_RA和GMM的HMM是实现此任务的最佳人工智能方法,而MFCC和CMCC_L&B_RA是采用的最佳功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号