首页> 美国政府科技报告 >2-D Processing of Speech with Application to Pitch and Formant Estimation
【24h】

2-D Processing of Speech with Application to Pitch and Formant Estimation

机译:应用于音高和估计估计的二维语音处理

获取原文

摘要

The grating compression transform (GCT) maps harmonically-related signal components to a concentrated entity in a spatial 2-D frequency plane * The GCT forms the basis of a pitch estimator that uses the radial distance to the largest peak of the GCT * The resulting pitch estimator appears robust under noise conditions and amenable to extension to two-speaker pitch estimation * The GCT forms the basis of a formant estimator that exploits separability of speech source and vocal tract information via changing pitch * Although the spectrogram provides a useful starting point for the GCT, alternate transforms can provide improved performance * Fan-chirp transform is one possibility * Possible GCT directions * Alternate time-frequency distributions * Pitch estimation Extended evaluation to a larger corpus and use of voiced/unvoiced speech Two-speaker pitch estimation * Formant estimation in noise * GCT as model of auditory cortical processing (Sthamma, Ezzat, and Poggio).

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号