...
首页> 外文期刊>The international arab journal of information technology >An Improved Feature Extraction and Combination of Multiple Classifiers for Query-by-Humming
【24h】

An Improved Feature Extraction and Combination of Multiple Classifiers for Query-by-Humming

机译:改进的基于分类的查询分类器特征提取与组合

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose new methods for feature extraction and soft majority voting to adjust efficiency and accuracy of music retrieval. For our work, the input is humming sound which is sound wave and Musical Instrument Digital Interface (MIDI) is used as the reference song in database. A critical issue of humming sound are variation such as duration, sound, tempo, key, and noise interference from both environment and acquisition instruments. Besides all the problems of humming sound we have mentioned earlier, whether humming sound and MIDI in different domain which will make the difficulty for two domains to compare each other. However, to make these two in the same domain, we convert them into the frequency domain. Our approach starts from pre-processing by using features for note segmentation by humming sound. The process consists of four steps as follows: Firstly, the MIDI is already a sequence of pitch while the pitch in humming sound is needed to extract by Subharmonic-to-Harmonic (SHR). Subsequently, the extracted pitch can be used to calculate all above attributes and then multiple classifiers are applied to classify the multiple subsets of these features. Afterwards, the subset contain the multiple attributes, Multi-Dimensional Dynamic Time Warping (MD-DTW) is used for similarity measurement. Finally, Nearest Neighbours (NN) and soft majority voting are used to obtain the retrieval results in case of equal scores. From the experiments, to achieve 100% accuracy rate at the early top-n rank in retrieving, the appropriate feature set should consist of five classifiers.
机译:在本文中,我们提出了一种新的特征提取和软多数投票的方法,以调整音乐检索的效率和准确性。对于我们的工作,输入的是嗡嗡声,即声波,并且乐器数字接口(MIDI)被用作数据库中的参考歌曲。嗡嗡声的一个关键问题是变化,例如持续时间,声音,速度,调子以及来自环境和采集仪器的噪声干扰。除了前面提到的所有嗡嗡声问题外,是否在不同域中哼唱声音和MIDI都将使两个域之间难以相互比较。但是,为了使这两个位于同一域中,我们将它们转换为频域。我们的方法从预处理开始,通过使用通过嗡嗡声进行音符分割的功能。该过程包括以下四个步骤:首先,MIDI已经是一个音高序列,而嗡嗡声中的音高则需要通过次谐波到谐波(SHR)来提取。随后,提取的音高可用于计算所有上述属性,然后应用多个分类器对这些特征的多个子集进行分类。之后,该子集包含多个属性,多维动态时间规整(MD-DTW)用于相似性度量。最后,在分数相等的情况下,使用最近邻(NN)和软多数投票来获取检索结果。从实验中,要在检索的前n个排名中达到100%的准确率,适当的特征集应包含五个分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号