首页> 外文OA文献 >Locating Singing Voice Segments within Music Signals
【2h】

Locating Singing Voice Segments within Music Signals

机译:在音乐信号中定位演唱语音片段

摘要

A sung vocal line is the prominent feature of much popular music. It would be useful to locate the portions of a musical track during which the vocals are present reliably, both as a 'signature' of the piece and as a precursor to automatic recognition of lyrics. We approach this problem by using the acoustic classifier of a speech recognizer as a detector for speech-like sounds. Although singing (including a musical background) is a relatively poor match to an acoustic model trained on normal speech, we propose various statistics of the classifier's output in order to discriminate singing from instrumental accompaniment. A simple HMM allows us to find a best labeling sequence for this uncertain data. On a test set of forty 15 second excerpts of randomly-selected music, our classifier achieved around 80% classification accuracy at the frame level. The utility of different features, and our plans for eventual lyrics recognition, are discussed.
机译:演唱的人声是许多流行音乐的突出特征。定位乐曲中可靠地存在人声的部分,这既是乐曲的“签名”,又是歌词自动识别的前奏,将很有用。我们通过使用语音识别器的声学分类器作为类似语音的声音的检测器来解决此问题。尽管唱歌(包括音乐背景)与以正常语音训练的声学模型相对较差,但我们提出了分类器输出的各种统计信息,以区分唱歌与乐器伴奏。一个简单的HMM可以让我们为这种不确定的数据找到最佳的标记顺序。在随机选择的音乐的四十五秒摘要的测试集中,我们的分类器在帧级别上实现了约80%的分类精度。讨论了各种功能的实用性以及我们最终的歌词识别计划。

著录项

  • 作者单位
  • 年度 2001
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号