Automatic Recognition of Lyrics in Singing

Annamaria Mesaros; Tuomas Virtanen

首页> 外文期刊>EURASIP journal on audio, speech, and music processing >Automatic Recognition of Lyrics in Singing

【24h】

Automatic Recognition of Lyrics in Singing

机译：唱歌中歌词的自动识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper considers the task of recognizing phonemes and words from a singing input by using a phonetic hidden Markov model recognizer. The system is targeted to both monophonic singing and singing in polyphonic music. A vocal separation algorithm is applied to separate the singing from polyphonic music. Due to the lack of annotated singing databases, the recognizer is trained using speech and linearly adapted to singing. Global adaptation to singing is found to improve singing recognition performance. Further improvement is obtained by gender-specific adaptation. We also study adaptation with multiple base classes defined by either phonetic or acoustic similarity. We test phoneme-level and word-level n-gram language models. The phoneme language models are trained on the speech database text. The large-vocabulary word-level language model is trained on a database of textual lyrics. Two applications are presented. The recognizer is used to align textual lyrics to vocals in polyphonic music, obtaining an average error of 0.94 seconds for line-level alignment. A query-by-singing retrieval application based on the recognized words is also constructed; in 57% of the cases, the first retrieved song is the correct one.

机译：本文考虑了通过使用语音隐式马尔可夫模型识别器从唱歌输入中识别音素和单词的任务。该系统的目标是单声道唱歌和复音音乐唱歌。应用声音分离算法将歌唱与和弦音乐分离。由于缺少注释的歌唱数据库，因此识别器使用语音进行训练，并且线性地适应歌唱。发现对歌唱的整体适应性提高了歌唱识别性能。通过针对性别的适应获得进一步的改善。我们还将研究通过语音或声学相似性定义的多个基本类别的适应性。我们测试音素级和词级n-gram语言模型。在语音数据库文本上训练音素语言模型。大词汇量单词级语言模型在文本歌词数据库上进行训练。介绍了两个应用程序。识别器用于将和弦歌词与复音音乐中的人声对齐，行级对齐的平均误差为0.94秒。还构造了基于识别出的单词的按词查询检索应用程序；在57％的情况下，第一首检索到的歌曲是正确的。

著录项

来源
《EURASIP journal on audio, speech, and music processing》 |2010年第2010期|P.546047.1-546047.11|共11页
作者
Annamaria Mesaros; Tuomas Virtanen;
展开▼
作者单位

Department of Signal Processing, Tampere University of Technology, 33720 Tampere, Finland;

rnDepartment of Signal Processing, Tampere University of Technology, 33720 Tampere, Finland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Automatic Recognition of Lyrics in Singing [J] . Annamaria Mesaros, Tuomas Virtanen EURASIP journal on audio, speech, and music processing . 2010,第1期

机译：唱歌中歌词的自动识别
2. AUTOMATIC SINGING QUALITY RECOGNITION EMPLOYING ARTIFICIAL NEURAL NETWORKS [J] . Pawel ZWAN Archives of acoustics . 2008,第1期

机译：采用人工神经网络的自动歌唱质量识别
3. Tra-la-Lyrics 2.0: Automatic Generation of Song Lyrics on a Semantic Domain [J] . Hugo Gon?alo Oliveira Journal of Artificial General Intelligence . 2015,第1期

机译：Tra-la-Lyrics 2.0：在语义域上自动生成歌曲歌词
4. An Automatic Singing Transcription System with Multilingual Singing Lyric Recognizer and Robust Melody Tracker [C] . Chong-kai Wang, Ren-yuan Lyu, Yuang-chin Chiang, European Conference on Speech Communication and Technology . 2003

机译：具有多语种歌唱抒情识别器和强大旋律跟踪器的自动歌唱转录系统
5. Representations of strong black women in Calypso lyrics of Calypso Rose, Singing Sandra, Singing Francine, Singing Vennie, and Queen Bee; and in Zora Neale Hurston's work: “Their Eyes Were Watching God” and Edwidge Danticat's work: “Breath, Eyes, Memory” [D] . Smith, Linda 2008

机译：Calypso Rose，Singing Sandra，Singing Francine，Singing Vennie和Queen Bee的Calypso歌词中有坚强的黑人女性；在佐拉·尼尔·赫斯顿（Zora Neale Hurston）的作品中：“他们的眼睛注视着上帝”，而在埃德维奇·丹提卡特（Edwidge Danticat）的作品中：“呼吸，眼睛，记忆”
6. Sing that Tune: Infants’ Perception of Melody and Lyrics and the Facilitation of Phonetic Recognition in Songs [O] . Gina C. Lebedeva, Patricia K. Kuhl -1

机译：唱歌：婴儿对歌剧和歌词的看法以及歌曲中的语音认可
7. Automatic Recognition of Lyrics in Singing [O] . Annamaria Mesaros, Tuomas Virtanen 2010

机译：唱歌中歌词的自动识别

Automatic Recognition of Lyrics in Singing

摘要

著录项

相似文献

相关主题

期刊订阅