Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT on Row Mean of Spectrogram for Speaker Identification

H. B. Kekre; Prachi J. Natu; Shachi J. Natu; Tanuja Kiran Sarode

首页> 外文期刊>International Journal of Biometric and Bioinformatics >Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT on Row Mean of Spectrogram for Speaker Identification

【24h】

Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT on Row Mean of Spectrogram for Speaker Identification

机译：完整/分组频谱图上的二维DCT和频谱图行均值上的1-D DCT用于说话人识别的性能比较

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The goal of this paper is to present a very simple approach to text dependent speaker identification using a combination of spectrograms and well known Discrete Cosine Transform (DCT). This approach is based on use of DCT to find similarities between spectrograms obtained from speech samples. The set of spectrograms forms the database for our experiments rather than raw speech samples. Performance of this approach is compared for different number of coefficients of DCT when DCT is applied on entire spectrogram, when DCT is applied to spectrogram divided into blocks and when DCT is applied to the Row Mean of a spectrogram. Performance comparison shows that, number of mathematical computations required for DCT on Row Mean of spectrogram method is drastically less as compared to other two methods with almost equal identification rate.

机译：本文的目的是通过结合频谱图和众所周知的离散余弦变换（DCT），提出一种非常简单的方法来识别与文本相关的说话人。此方法基于使用DCT来查找从语音样本获得的频谱图之间的相似性。这组频谱图形成了我们实验的数据库，而不是原始语音样本。当将DCT应用于整个频谱图时，将DCT应用于分成块的频谱图时以及将DCT应用于频谱图的行均值时，将针对不同数量的DCT系数比较此方法的性能。性能比较表明，与其他两种几乎具有相同识别率的方法相比，在频谱图方法的行均值上进行DCT所需的数学计算次数大大减少。

著录项

来源
《International Journal of Biometric and Bioinformatics》 |2010年第3期|共页
作者
H. B. Kekre; Prachi J. Natu; Shachi J. Natu; Tanuja Kiran Sarode;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
专利

1. SPEAKER IDENTIFICATION USING 2-D DCT, WALSH AND HAAR ON FULL AND BLOCK SPECTROGRAM [J] . Dr. H. B. Kekre, Dr. Tanuja K. Sarode, Shachi J. Natu, International Journal on Computer Science and Engineering . 2010,第5期

机译：扬声器识别使用2-D DCT，WALSH和HAAR全部和块谱图
2. Block classification for an adaptive 1-D/2-D DCT video coding [J] . Tan K.H., Ghanbari M. IEE Proceedings. Part K . 1996,第3期

机译：自适应1-D / 2-D DCT视频编码的块分类
3. Speaker Identification using Row Mean of DCT and Walsh Hadamard Transform [J] . Dr. H B Kekre, Vaishali Kulkarni, Sunil Venkatraman, International Journal on Computer Science and Engineering . 2011,第3期

机译：使用DCT和Walsh Hadamard变换的行均值的说话人识别
4. Comparison between the performance of spectrogram and multi-window spectrogram in digital modulated communication signals [C] . Tan Jo Lynn, Ahmad Zuri bin Shaameri IEEE International Confernece on Telecommunications and Malaysia International Confernece on Communications . 2007

机译：数字调制通信信号中频谱图和多窗谱图性能的比较
5. Performance comparison of MCTF-type codecs and hybrid DCT-based codecs. [D] . Bhamidipati, Phanikumar K. 2004

机译：MCTF型编解码器和基于混合DCT的编解码器的性能比较。
6. Comparison of the dose on specific 3DCT images and the accumulated dose for cardiac structures in esophageal tumors radiotherapy: whether specific 3DCT images can be used for dose assessment? [O] . Ying Tong, Guanzhong Gong, Ming Su, 2019

机译：特定3DCT图像上的剂量与食管肿瘤放射治疗中心脏结构累积剂量的比较：是否可以将特定3DCT图像用于剂量评估？
7. Performance Comparison of Speaker Identification Using DCT, Walsh, Haar on Full and Row Mean of Spectrogram [O] . Dr. H. B. Kekre, Senior Professor, Dr. T. K. Sarode, 2011

机译：使用DCT，Walsh，Haar进行频谱图全时和行均值的说话人识别的性能比较

Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT on Row Mean of Spectrogram for Speaker Identification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅