Towards Interpretive Models for 2-D Processing of Speech.

机译：面向语音二维处理的解释模型。

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Two-dimensional (2-D) processing of speech has recently been explored as an alternative representational framework that explicitly analyzes temporal, spectral, and joint spectrotemporal energy fluctuations or 'modulations' present in time-frequency distributions (e.g., in the spectrogram or auditory spectrogram). This paper considers 2-D Fourier analysis of local time-frequency regions of wideband spectrograms, a representation referred to as the (wideband) Grating Compression Transform (WGCT). We develop frequency dependent models of speech signals in the WGCT context related to speech production characteristics, building on previous work in modeling narrowband- based GCT representations. Model evaluation through simulations and error analysis is performed. Comparison shows the model effectiveness, and important distinctions, including 'dual' behavior, between the wide and narrowband models. Our results motivate a novel taxonomy of speech signal behavior for use as an interpretative framework (i.e., in relation to speech production characteristics) for 2-D processing of speech using the GCT and potentially other 2-D approaches and time-frequency distributions. We demonstrate the ability of the model to represent real speech content through using demodulation techniques for analysis/synthesis of wideband spectrograms and co-channel speaker separation using prior pitch information.

著录项

作者
Wang, T. T.; Quatieri, T. F.;
展开▼
作者单位

展开▼
年度 2011
页码 1-16
总页数 16
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Speech; Error analysis; Narrowband; Taxonomy; 2-d processing of speech; Grating compression transform; Wideband spectrogram; Spectrogram reconstruction; Co-channel speaker separation;

机译：语音;误差分析;窄带;分类;语音二维处理;光栅压缩变换;宽带频谱图;频谱图重建;同频扬声器分离;

相似文献

外文文献
中文文献
专利

1. Towards Interpretive Models for 2-D Processing of Speech [J] . Wang T.T., Quatieri T.F. Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第7期

机译：走向语音二维处理的解释模型
2. Modelling of risk factors for defence aircraft industry using interpretive structural modelling, interpretive ranking process and system dynamics [J] . Selladurai Pitchaimuthu, Jitesh J. Thakkar, P.R.C. Gopal Measuring Business Excellence . 2019,第3期

机译：使用解释性结构建模，解释性排名过程和系统动力学对国防飞机行业的风险因素进行建模
3. Model identification of a noncausal 2-D AR process using a causal 2-D AR model on the nonsymmetric half-plane [J] . ByoungSeon Choi IEEE Transactions on Signal Processing . 2003,第5期

机译：使用非对称半平面上的因果2D AR模型识别非因果2D AR过程的模型
4. Interpretive Processing - Combining Data Driven Velocity Modelling and Geological Knowledge for [C] . R. Bastia, N. Kettouche, J.K. Fruehn I EAGE Conference Exhibition . 2009

机译：解释处理 - 组合数据驱动速度建模和地质知识
5. Analytical Modeling, Testing, and Comparison of 1-D, 2-D, and 3-D Dewatering Process [D] . Ratnasamy, Ratnayesuraj Chelvarajah. 2017

机译：1-D，2-D和3-D脱水过程的分析建模，测试和比较
6. A Brain for Speech. Evolutionary Continuity in Primate and Human Auditory-Vocal Processing [O] . Francisco Aboitiz 2018

机译：说话的大脑。灵长类动物和人类听觉-声音处理的进化连续性
7. Interpreting Mars ionospheric anomalies over crustal magnetic field regions using a 2-D ionospheric model [O] . Majd Matta, Michael Mendillo, Paul Withers, 2015

机译：用2-D电离层模型将火星电离区异常解释在地壳磁场区

Towards Interpretive Models for 2-D Processing of Speech.

摘要

著录项

相似文献

相关主题

期刊订阅