Singing Voice Detection for Karaoke Application

机译：卡拉OK应用中的歌唱语音检测

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present a framework to detect the regions of singing voice in musical audio signals. This work is oriented towards the development of a robust transcriber of lyrics for karaoke applications. The technique leverages on a combination of low-level audio features and higher level musical knowledge of rhythm and tonality. Musical knowledge of the key is used to create a song-specific filterbank to attenuate the presence of the pitched musical instruments. This is followed by subband processing of the audio to detect the musical octaves in which the vocals are present. Text processing is employed to approximate the duration of the sung passages using freely available lyrics. This is used to obtain a dynamic threshold for vocalon-vocal segmentation. This pairing of audio and text processing helps create a more accurate system. Experimental evaluation on a small database of popular songs shows the validity of the proposed approach. Holistic and per-component evaluation of the system is conducted and various improvements are discussed.

机译：我们提出了一个框架来检测音乐音频信号中歌声的区域。这项工作的目的是为卡拉OK应用程序开发强大的歌词转录器。该技术结合了低级音频功能和高级的节奏和音调音乐知识。按键的音乐知识用于创建特定于歌曲的滤波器组，以减弱音高乐器的存在。随后是音频的子带处理，以检测其中存在人声的音乐八度音阶。文本处理用于使用免费提供的歌词来估算演唱段落的持续时间。这用于获得声音/非声音分割的动态阈值。音频和文本处理的这种配对有助于创建更准确的系统。对流行歌曲的小型数据库进行的实验评估表明了该方法的有效性。进行了系统的整体和每个组件的评估，并讨论了各种改进。

著录项

来源
《Visual Communications and Image Processing 2005 pt.2》|2005年|P.752-762|共11页
会议地点 Beijing(CN)
作者
Arun Shenoy; Yuansheng Wu; Ye Wang;
展开▼
作者单位

School of Computing, National University of Singapore, 3 Science Drive 2, Singapore 117543;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类图像通信、多媒体通信;
关键词
Karaoke; singing voice; vocal segmentation; tonic; key; inverse comb filtering; rhythm; lyrics;

机译：卡拉OK;歌声;人声分割;声调;音调;反梳滤波;节奏;歌词;

相似文献

外文文献
中文文献
专利

1. KaraMIR: A Project for Cover Song Identification and Singing Voice Analysis Using a Karaoke Songs Dataset [J] . Ladislav Mar?ík, Petr Marti?ek, Jaroslav Pokorny, International journal of semantic computing . 2018,第4期

机译：Karamir：使用卡拉OK歌曲数据集进行封面歌曲识别和唱歌语音分析的项目
2. Singing voice outcomes following singing voice therapy [J] . Dastolfo-Hromack Christina, Thomas Tracey L., Rosen Clark A., The Laryngoscope: A Medical Journal for Clinical and Research Contributions in Otolaryngology, Head and Neck Medicine and Surgery, Facial Plastic and Reconstructive Surgery .. . 2016,第11期

机译：唱歌语音治疗后唱歌语音结果
3. Validation of the German version of the Singing Voice Handicap Index [Validierung des Singing Voice Handicap Index in der deutschen Fassung] [J] . LorenzA., KleberB., BüttnerM., HNO . 2013,第8期

机译：德文版的歌唱残障指数的验证[德文版的歌唱残障指数的验证]
4. Kara1k: A Karaoke Dataset for Cover Song Identification and Singing Voice Analysis [C] . Yann Bayle, Ladislav Maršík, Martin Rusek, IEEE International Symposium on Multimedia . 2017

机译：Kara1k：用于翻唱歌曲识别和歌唱语音分析的卡拉OK数据集
5. Singing in Life's Twilight: Serious Karaoke as Everyday Aging Practice in Urban Japan [D] . Tong, Koon Fung Benny. 2019

机译：在生活中唱歌：严肃的卡拉OK作为日本城市日常老龄化实践
6. Neural Dynamics of Karaoke-Like Voice Imitation in Singing Performance [O] . Sascha Frühholz, Wiebke Trost, Irina Constantinescu, 2020

机译：歌唱表演中类似卡拉OK语音模仿的神经动力学
7. 752 Singing Voice Detection for Karaoke Application [O] . Arun Shenoy, Yuansheng Wu, Ye Wang 2008

机译：752卡拉OK演唱语音检测

Singing Voice Detection for Karaoke Application

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅