首页> 外文会议>European Signal Processing Conference >Sparse time-frequency representations in audio processing, as studied through a symmetrized lognormal model

【24h】

Sparse time-frequency representations in audio processing, as studied through a symmetrized lognormal model

机译：通过对称对数正态模型研究的音频处理中的稀疏时频表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Time-frequency representations are ubiquitous in speech and audio signal processing, their use being motivated by both auditory physiology and the mathematics of Fourier analysis. Nonpara-metric statistical models (or equivalently transform based signal processing methods) formulated in this space provide a principled way to decompose sounds into their constituent parts, as well as an effective means of exploiting the local correlation present in the time-frequency structure of naturally generated acoustic signals. Here we describe how an appropriate generative statistical model, even under very simple assumptions, provides a means of exploring sparse time-frequency representations in audio. We introduce a symmetrized lognormal model for spectral coefficients, which shows good agreement across a broad range of speech samples taken from the TIMIT database, and demonstrate preliminary speech enhancement results based on a maximum a posteriori shrinkage estimator.

机译：时频表示在语音和音频信号处理中无处不在，其使用受到听觉生理和傅立叶分析的数学的激励。在此空间中制定的非参数统计模型（或等效的基于变换的信号处理方法）提供了一种将声音分解为其组成部分的原理方法，并且是一种利用自然时频结构中存在的局部相关性的有效手段产生的声音信号。在这里，我们描述了即使在非常简单的假设下，合适的生成统计模型也如何为探索音频中的稀疏时频表示提供了一种方法。我们针对频谱系数引入对称对数正态模型，该模型在从TIMIT数据库获取的广泛语音样本中显示出良好的一致性，并展示了基于最大后验收缩估计量的初步语音增强结果。

著录项

来源
《European Signal Processing Conference》|2007年|355-359|共5页
会议地点
作者
Wolfe Patrick J.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Sparse Time-Frequency Representations for Polyphonic Audio Based on Combined Efficient Fan-Chirp Transforms [J] . Journal of the Audio Engineering Society . 2019,第11期

机译：基于有效扇Fan组合的复音音频稀疏时频表示
2. Audio inpainting: Evaluation of time-frequency representations and structured sparsity approaches [J] . Lieb Florian, Stark Hans-Georg Signal processing . 2018,第DECa期

机译：音频修复：时频表示和结构化稀疏方法的评估
3. Time-Frequency Spectral Representation Models to Simulate Nonstationary Processes and Their Use to Generate Ground Motions [J] . H. P. Hong, X. Z. Cui Journal of Engineering Mechanics . 2020,第9期

机译：时间频谱表示模型，用于模拟非间断过程及其用于生成地面运动的模型
4. SPARSE TIME-FREQUENCY REPRESENTATIONS IN AUDIO PROCESSING, AS STUDIED THROUGH A SYMMETRIZED LOGNORMAL MODEL [C] . Patrick J. Wolfe European Signal Processing Conference . 2007

机译：音频处理中的稀疏时频表示，通过对称的Lognormal模型研究
5. L1 Minimization for Sparse Audio Processing. [D] . Jacobson, Judah Solomon. 2012

机译：稀疏音频处理的L1最小化。
6. Deep Layer Kernel Sparse Representation Network for the Detection of Heart Valve Ailments from the Time-Frequency Representation of PCG Recordings [O] . Samit Kumar Ghosh, R. N. Ponnalagu, R. K. Tripathy, 2020

机译：深层内核稀疏表示网络用于检测心阀疾病从PCG录制的时频表示
7. Sparse Time-Frequency Representations in Audio Processing, As Studied Through a Symmetrized Lognormal Model [O] . Wolfe Patrick 2007

机译：通过对称对数正态模型研究的音频处理中的稀疏时频表示

Sparse time-frequency representations in audio processing, as studied through a symmetrized lognormal model

摘要

著录项

相似文献

相关主题

期刊订阅