首页> 外文期刊>EURASIP journal on audio, speech, and music processing >Automatic Recognition of Lyrics in Singing
【24h】

Automatic Recognition of Lyrics in Singing

机译:唱歌中歌词的自动识别

获取原文
获取原文并翻译 | 示例
           

摘要

The paper considers the task of recognizing phonemes and words from a singing input by using a phonetic hidden Markov model recognizer. The system is targeted to both monophonic singing and singing in polyphonic music. A vocal separation algorithm is applied to separate the singing from polyphonic music. Due to the lack of annotated singing databases, the recognizer is trained using speech and linearly adapted to singing. Global adaptation to singing is found to improve singing recognition performance. Further improvement is obtained by gender-specific adaptation. We also study adaptation with multiple base classes defined by either phonetic or acoustic similarity. We test phoneme-level and word-level n-gram language models. The phoneme language models are trained on the speech database text. The large-vocabulary word-level language model is trained on a database of textual lyrics. Two applications are presented. The recognizer is used to align textual lyrics to vocals in polyphonic music, obtaining an average error of 0.94 seconds for line-level alignment. A query-by-singing retrieval application based on the recognized words is also constructed; in 57% of the cases, the first retrieved song is the correct one.
机译:本文考虑了通过使用语音隐式马尔可夫模型识别器从唱歌输入中识别音素和单词的任务。该系统的目标是单声道唱歌和复音音乐唱歌。应用声音分离算法将歌唱与和弦音乐分离。由于缺少注释的歌唱数据库,因此识别器使用语音进行训练,并且线性地适应歌唱。发现对歌唱的整体适应性提高了歌唱识别性能。通过针对性别的适应获得进一步的改善。我们还将研究通过语音或声学相似性定义的多个基本类别的适应性。我们测试音素级和词级n-gram语言模型。在语音数据库文本上训练音素语言模型。大词汇量单词级语言模型在文本歌词数据库上进行训练。介绍了两个应用程序。识别器用于将和弦歌词与复音音乐中的人声对齐,行级对齐的平均误差为0.94秒。还构造了基于识别出的单词的按词查询检索应用程序;在57%的情况下,第一首检索到的歌曲是正确的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号