Singing voice separation with pre-learned dictionary and reconstructed voice spectrogram

首页> 外文期刊>Neural computing & applications >Singing voice separation with pre-learned dictionary and reconstructed voice spectrogram

【24h】

Singing voice separation with pre-learned dictionary and reconstructed voice spectrogram

机译：使用预先学习的字典和重建语音谱图来唱歌语音分离

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently the mixture spectrogram of a song is usually considered as a superposition of a sparse spectrogram and a low-rank spectrogram, which correspond to the vocal part and the accompaniment part of the song, respectively. Based on this observation, one can separate singing voice from the background music. However, the quality of such separation might be limited, since the vocal part may be not described very well by low rank, and moreover its more prior information, such as annotation, should be considered when designing separation algorithm. Based on these considerations, in this paper, we present two categories, time-frequency-based source separation algorithms. Specifically, one incorporates both the vocal and instrumental spectrograms as sparse matrix and low-rank matrix, meanwhile combines some side information of vocal part, i.e., the reconstructed voice spectrogram from the annotation. The others further consider both the vocal and instrumental spectrograms as sparse matrix and group-sparse matrix, respectively. Evaluations on the iKala dataset show that the proposed methods are effective and efficient for both the separated singing voice and music accompaniment.

机译：最近，歌曲的混合谱图通常被认为是稀疏谱图的叠加和低秩谱图，其分别对应于歌曲的声音部分和伴奏部分。基于这种观察，可以将歌声与背景音乐分开。然而，这种分离的质量可能受到限制，因为声音部分可以通过低等级进行非常好，而且在设计分离算法时，应该考虑其更高的信息，例如注释。在本文的基础上，我们提出了两类，基于时频的源分离算法。具体地，一种将声乐和仪器谱图与稀疏矩阵和低秩矩阵结合在一起，同时组合了声乐部分的某些侧面信息，即，来自注释的重建语音谱图。其他人进一步考虑声乐和乐谱分别作为稀疏矩阵和组稀疏矩阵。 IKALA数据集的评估表明，所提出的方法对于分离的歌唱语音和音乐伴奏是有效和有效的。

著录项

来源
《Neural computing & applications》 |2020年第8期|共12页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工神经网络计算机;人工智能理论;
关键词
Singing voice separation; Low rank; Group-sparse; Dictionary Learning;

机译：唱歌语音分离;低级;群稀疏;字典学习;

相似文献

外文文献
中文文献
专利

1. Singing voice separation with pre-learned dictionary and reconstructed voice spectrogram [J] . Neural computing & applications . 2020,第8期

机译：使用预先学习的字典和重建语音谱图来唱歌语音分离
2. Singing Voice Separation by Low-Rank and Sparse Spectrogram Decomposition with Pre-learned Dictionaries [J] . SHIWEI YU, HONGJUAN ZHANG, ZHIYAO DUAN Journal of the Audio Engineering Society . 2017,第5期

机译：用预先学习的词典拼接低级和稀疏频谱图分解的语音分离
3. Low-Rank Sparse Representation with Pre-Learned Dictionaries and Side Information for Singing Voice Separation [J] . Chenghong Yang, Hongjuan Zhang Advances in Pure Mathematics . 2018,第4期

机译：低阶稀疏表示与预先学习的词典和附带信息一起唱歌进行语音分离
4. SEPARATION OF SINGING VOICE FROM MUSIC ACCOMPANIMENT WITH UNVOICED SOUNDS RECONSTRUCTION FOR MONAURAL RECORDINGS [C] . Chao-Ling Hsu, Jyh-Shing Roger Jang, Te-Lu Tsai AES Convention . 2008

机译：单声道记录的清音声音重建音乐伴奏的歌声分离
5. CLASSIFYING ADOLESCENT SINGING VOICES (SECONDARY, MUSIC, EDUCATION, CHORAL) [D] . WOLVERTON, VANCE D. 1985

机译：对青少年歌声进行分类（中学，音乐，教育和合唱）
6. Aerosol emission of adolescents voices during speaking singing and shouting [O] . Dirk Mürbe, Martin Kriegel, Julia Lange, 2021

机译：青少年在演讲期间发出青少年的发射唱歌和喊叫
7. Music and Voice Separation Using Log-Spectral Amplitude Estimator Based on Kernel Spectrogram Models Backfitting [O] . 2015

机译：基于内核谱图模型的基于核谱幅度估计器的音乐和语音分离

Singing voice separation with pre-learned dictionary and reconstructed voice spectrogram

摘要

著录项

相似文献

相关主题

期刊订阅