首页> 外文会议>Visual Communications and Image Processing 2005 pt.2 >Singing Voice Detection for Karaoke Application
【24h】

Singing Voice Detection for Karaoke Application

机译:卡拉OK应用中的歌唱语音检测

获取原文
获取原文并翻译 | 示例

摘要

We present a framework to detect the regions of singing voice in musical audio signals. This work is oriented towards the development of a robust transcriber of lyrics for karaoke applications. The technique leverages on a combination of low-level audio features and higher level musical knowledge of rhythm and tonality. Musical knowledge of the key is used to create a song-specific filterbank to attenuate the presence of the pitched musical instruments. This is followed by subband processing of the audio to detect the musical octaves in which the vocals are present. Text processing is employed to approximate the duration of the sung passages using freely available lyrics. This is used to obtain a dynamic threshold for vocalon-vocal segmentation. This pairing of audio and text processing helps create a more accurate system. Experimental evaluation on a small database of popular songs shows the validity of the proposed approach. Holistic and per-component evaluation of the system is conducted and various improvements are discussed.
机译:我们提出了一个框架来检测音乐音频信号中歌声的区域。这项工作的目的是为卡拉OK应用程序开发强大的歌词转录器。该技术结合了低级音频功能和高级的节奏和音调音乐知识。按键的音乐知识用于创建特定于歌曲的滤波器组,以减弱音高乐器的存在。随后是音频的子带处理,以检测其中存在人声的音乐八度音阶。文本处理用于使用免费提供的歌词来估算演唱段落的持续时间。这用于获得声音/非声音分割的动态阈值。音频和文本处理的这种配对有助于创建更准确的系统。对流行歌曲的小型数据库进行的实验评估表明了该方法的有效性。进行了系统的整体和每个组件的评估,并讨论了各种改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号