首页> 外文期刊>International Journal of Biometric and Bioinformatics >Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT on Row Mean of Spectrogram for Speaker Identification
【24h】

Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT on Row Mean of Spectrogram for Speaker Identification

机译:完整/分组频谱图上的二维DCT和频谱图行均值上的1-D DCT用于说话人识别的性能比较

获取原文
获取外文期刊封面目录资料

摘要

The goal of this paper is to present a very simple approach to text dependent speaker identification using a combination of spectrograms and well known Discrete Cosine Transform (DCT). This approach is based on use of DCT to find similarities between spectrograms obtained from speech samples. The set of spectrograms forms the database for our experiments rather than raw speech samples. Performance of this approach is compared for different number of coefficients of DCT when DCT is applied on entire spectrogram, when DCT is applied to spectrogram divided into blocks and when DCT is applied to the Row Mean of a spectrogram. Performance comparison shows that, number of mathematical computations required for DCT on Row Mean of spectrogram method is drastically less as compared to other two methods with almost equal identification rate.
机译:本文的目的是通过结合频谱图和众所周知的离散余弦变换(DCT),提出一种非常简单的方法来识别与文本相关的说话人。此方法基于使用DCT来查找从语音样本获得的频谱图之间的相似性。这组频谱图形成了我们实验的数据库,而不是原始语音样本。当将DCT应用于整个频谱图时,将DCT应用于分成块的频谱图时以及将DCT应用于频谱图的行均值时,将针对不同数量的DCT系数比较此方法的性能。性能比较表明,与其他两种几乎具有相同识别率的方法相比,在频谱图方法的行均值上进行DCT所需的数学计算次数大大减少。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号