An ensemble of deep transfer learning models for handwritten music symbol recognition

Paul Ashis; Pramanik Rishav; Malakar SamirSarkar Ram

首页> 外文期刊>Neural computing & applications >An ensemble of deep transfer learning models for handwritten music symbol recognition

【24h】

An ensemble of deep transfer learning models for handwritten music symbol recognition

机译：An ensemble of deep transfer learning models for handwritten music symbol recognition

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

In ancient times, there was no system to record or document music. A basic notation system to write European music was formulated around 14th century in the Baroque period which slowly evolved into the standard notation system that we have today. Later, the musical pieces from the classical and post-classical period of European music were documented as scores using this standard European staff notations. These notations are used by most of the modern genres of music due to their versatility. Hence, it is very important to develop a method that can store such music sheets containing handwritten music scores digitally. Optical music recognition (OMR) is a system that automatically interprets the scanned handwritten music scores. In this work, we have proposed a classifier ensemble of deep transfer learning models with support vector machine (SVM) as the aggregator for handwritten music symbol recognition. We have applied three pre-trained deep learning models, namely ResNet50, GoogleNet and DenseNet161 (each trained on ImageNet), and fine-tuned on our target datasets i.e., music symbol image datasets. The proposed ensemble technique can capture a more complex association of the base classifiers, thus improving the overall performance. We have evaluated the proposed model on five publicly available standard datasets, namely Handwritten Online Music Symbols (HOMUS), Capitan_Score_Uniform, Capitan_Score_Non-uniform, Rebelo_real and Fornes, and achieved state-of-the-art results for all these datasets. Additionally, we have evaluated our model on publicly available two non-music symbols datasets, namely CMATERdb 2.1.2 containing 120 handwritten Bangla city names and CMATERdb 3.1.1 dataset containing handwritten Bangla numerals to validate its effectiveness on diversified datasets. The source code of this present work is available at https://github.com/ashis0013/Music-Symbol-Recognition.

著录项

来源
《Neural computing & applications》 |2022年第13期|10409-10427|共19页
作者
Paul Ashis; Pramanik Rishav; Malakar SamirSarkar Ram;
展开▼
作者单位

Jadavpur Univ;

Asutosh Coll;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类人工神经网络计算机;人工智能理论;
关键词
Music symbol recognition; Ensemble learning; Support vector machine; HOMUS dataset; CLASSIFICATION; NOTATION;

An ensemble of deep transfer learning models for handwritten music symbol recognition

摘要

著录项

相关主题

期刊订阅