...
首页> 外文期刊>IEICE Electronics Express >Linear-scale perceptual feature extraction for Speech Bandwidth Extensions
【24h】

Linear-scale perceptual feature extraction for Speech Bandwidth Extensions

机译:语音带宽扩展的线性尺度感知特征提取

获取原文
   

获取外文期刊封面封底 >>

       

摘要

References(6) This paper presents a new method to extract linear-scale perceptual feature as a subsitute of MFCCs for highband (3.4kHz∼) in Speech Bandwidth Extensions(BWE). The feature extraction method is based on the mel-scale constrained Nonnegative Matrix Factorization(NMF), which decompose linear-scale log spectrum into a linear combination of mel-scale latent variables. While MFCCs parametrization contains non-invertible procedures, suggested feature is represented in linear-scale and proper to recover the highband time-domain speech. Experiment results report that suggested feature shows better instrumental performance with narrowband MFCCs than real cepstrum without additional computation.
机译:参考文献(6)提出了一种提取线性尺度感知特征的新方法,作为语音带宽扩展(BWE)中高频带(3.4kHz〜)的MFCC的替代。特征提取方法基于梅尔尺度约束的非负矩阵分解(NMF),它将线性尺度对数谱分解为梅尔尺度潜在变量的线性组合。尽管MFCC的参数化包含不可逆的过程,但建议的功能以线性比例表示,并且适合恢复高频带时域语音。实验结果报告表明,该功能在不进行额外计算的情况下,与真正的倒谱相比,具有窄带MFCC的仪器性能更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号