【24h】

Speaker recognition based on transformed line spectral frequencies

机译:基于变换后的线谱频率的说话人识别

获取原文

摘要

Line spectral frequencies (LSF) and five types of transformed LSF are studied for robust text-independent speaker identification. Transformations are constructed by considering physical aspects of the vocal tract. These aspects are: location of formantsulls; bandwidth of formantsulls; bandwidth and location of formants; bandwidth and location of nulls; interval of adjacent formant and null locations. Identification tests using the TIMIT database verify that all features are useful for speaker recognition; the bandwidth and location of formants, especially, show the best performance. Simulation results also show that LSF and some of the transformed LSF give better performance than Mel-frequency cepstral coefficient (MFCC).
机译:研究了线性频谱频率(LSF)和五种转换后的LSF,以实现鲁棒的与文本无关的说话人识别。转换是通过考虑声道的物理方面来构造的。这些方面是:共振峰/零点的位置;共振峰/零点的带宽;共振峰的带宽和位置;空值的带宽和位置;相邻共振峰和空位置的间隔。使用TIMIT数据库进行的识别测试可验证所有功能均有助于说话人识别;共振峰的带宽和位置尤其显示出最佳性能。仿真结果还表明,LSF和某些经过变换的LSF的性能优于梅尔频率倒谱系数(MFCC)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号