Transcribing Lyrics from Commercial Song Audio: the First Step Towards Singing Content Processing

机译：从商用歌音频转录歌词：唱歌内容处理的第一步

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Spoken content processing (such as retrieval and browsing) is maturing, but the singing content is still almost completely left out. Songs are human voice carrying plenty of semantic information just as speech, and may be considered as a special type of speech with highly flexible prosody. The various problems in song audio, for example the significantly changing phone duration over highly flexible pitch contours, make the recognition of lyrics from song audio much more difficult. This paper reports an initial attempt towards this goal. We collected music-removed version of English songs directly from commercial singing content. The best results were obtained by TDNN-BLSTM with data augmentation with 3-fold speed perturbation plus some special approaches. The WER achieved (73.90%) was significantly lower than the baseline (96.21 %), but still relatively high.

机译：口头内容处理（例如检索和浏览）正在成熟，但唱歌内容仍然仍然完全遗漏。歌曲是人类的语音携带大量语义信息作为演讲，并且可以被视为具有高度灵活的韵律的特殊类型。歌曲音频中的各种问题，例如在高度灵活的音高轮廓上显着改变电话持续时间，使歌曲音频的歌词更加困难。本文报告了对此目标的初步尝试。我们直接从商业唱歌内容收集音乐版英语歌曲版本。通过TDNN-BLSTM获得最佳结果，具有3倍速度扰动加上一些特殊方法的数据增强。达到的萎缩（73.90％）明显低于基线（96.21％），但仍然比较高。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2018年|p5739-6377|共5页
会议地点
作者
Che-Ping Tsai; Yi-Lin Tuan; Lin-Shan Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Lyrics; Song Audio; Acoustic Model Adaptation; Genre; Prolonged Vowels;

机译：歌词;歌音;声学模型适应;类型;长时间元音;

相似文献

外文文献
中文文献
专利

1. Can Genre Be "Heard" in Scale as Well as Song Tasks? An Exploratory Study of Female Singing in Western Lyric and Musical Theater Styles [J] . Kayes Gillyanne, Welch Graham F. Journal of voice: official journal of the Voice Foundation . 2017,第3期

机译：可以在规模和歌曲任务中“听到”类型吗？西部抒情和音乐剧型女性歌唱的探索性研究
2. Old songs Can be as fresh as new: An ERP study on lyrics processing [J] . Chien Pei-Ju, Chan Shiao-hui Journal of neurolinguistics . 2015,第Null期

机译：旧歌可以像新歌一样新鲜：ERP研究歌词处理
3. The relationship of lyrics and tunes in the processing of unfamiliar songs: a functional magnetic resonance adaptation study. [J] . Sammler D, Baird A, Valabregue The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2010,第10期

机译：歌词和曲调之间的关系处理陌生歌曲：功能磁共振适应性研究。
4. Transcribing Lyrics from Commercial Song Audio: the First Step Towards Singing Content Processing [C] . Che-Ping Tsai, Yi-Lin Tuan, Lin-Shan Lee IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：从商用歌音频转录歌词：唱歌内容处理的第一步
5. Representations of strong black women in Calypso lyrics of Calypso Rose, Singing Sandra, Singing Francine, Singing Vennie, and Queen Bee; and in Zora Neale Hurston's work: “Their Eyes Were Watching God” and Edwidge Danticat's work: “Breath, Eyes, Memory” [D] . Smith, Linda 2008

机译：Calypso Rose，Singing Sandra，Singing Francine，Singing Vennie和Queen Bee的Calypso歌词中有坚强的黑人女性；在佐拉·尼尔·赫斯顿（Zora Neale Hurston）的作品中：“他们的眼睛注视着上帝”，而在埃德维奇·丹提卡特（Edwidge Danticat）的作品中：“呼吸，眼睛，记忆”
6. The Relationship of Lyrics and Tunes in the Processing of Unfamiliar Songs: A Functional Magnetic Resonance Adaptation Study [O] . Daniela Sammler, Amee Baird, Romain Valabrègue, 2010

机译：陌生歌曲处理过程中歌词与乐曲的关系：功能性磁共振适应性研究
7. Singing and Accompaniment Support the Processing of Song Lyrics and Change the Lyrics' Meaning [O] . Yke Schotanus 2020

机译：唱歌和伴奏支持歌曲歌词的处理，改变歌词的意义

Transcribing Lyrics from Commercial Song Audio: the First Step Towards Singing Content Processing

摘要

著录项

相似文献

相关主题

期刊订阅