Integration of Speaker and Speech Recognition Systems

机译：说话人和语音识别系统的集成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech is the most important and primary mode of communication and also the most natural and efficient form of exchanging information among humans. This paper presents a detailed study of textdependent Speaker and Speech Recognition system. Speaker recognition system uses vector quantization (VQ) as the modeling technique while features of the speech signal are extracted using Mel Frequency Cepstum Coefficients (MFCC). K-means clustering algorithm has been used to obtain the vector quantized codebook. For Speech recognition system, formant frequencies of the word sample are used to determine the unknown word. Speaker recognition system yields highest accuracy with Hanning window and Mel perceptual feature extraction realized with 35 filter bank. Accuracy also improves as the number of vectors in the VQ codebook is increased from 64 to 100 whereas for Speech recognition, highest accuracy obtained is 95%.

机译：言语是最重要和最主要的沟通方式，也是人类之间交换信息的最自然和有效的形式。本文提出了对TextDependendent扬声器和语音识别系统的详细研究。扬声器识别系统使用矢量量化（VQ）作为建模技术，而使用MEL频率Cepstum系数（MFCC）提取语音信号的特征。 K-means群集算法已被用于获得矢量量化码本。对于语音识别系统，Word样本的格式频率用于确定未知字。扬声器识别系统通过35个滤波器组成的汉宁窗口和梅尔感知特征提取产生最高精度。由于VQ码本的矢量数量从64增加到100，而对于语音识别，所获得的最高精度是95％的载体。

著录项

来源
《IEEE international conference on signal processing systems》|2011年|325-329|共5页
会议地点
作者
Isha Dhawan; Neelu Jain;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信理论;
关键词
Formants; Feature Extraction; Kmeans clustering; Mel Frequency Cepsrtum Coefficients; Vector Quantization.;

机译：共振峰特征提取; Kmeans聚类;梅尔频率倒谱系数;向量量化。;

相似文献

外文文献
中文文献
专利

1. Speaker clustering and transformation for speaker adaptation in speech recognition systems [J] . Padmanabhan M., Bahl L.R. IEEE Transactions on Speech and Audio Proceeding . 1998,第1期

机译：语音识别系统中的说话人适应和说话人聚类和转换
2. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
3. Spectral Transformation of Lombard Speech to Normal Speech for Speaker Recognition Systems [J] . S. Uma Maheswari, J. Divya, A. Shahina, Australian Journal of Basic and Applied Sciences . 2015,第2015期

机译：朗伯语音到普通语音的频谱转换，用于说话人识别系统
4. Integration of Speaker and Speech Recognition Systems [C] . Isha Dhawan, Neelu Jain IEEE international conference on signal processing systems . 2011

机译：扬声器和语音识别系统的整合
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. The integration of a continuous-speech-recognition system with the QMR diagnostic program. [O] . S. Shiffman, C. D. Lane, K. B. Johnson, 1992

机译：连续语音识别系统与QMR诊断程序的集成。
7. Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis [O] . Desh Raj, Pavel Denisov, Zhuo Chen, 2021

机译：言语分离，日复日记和识别的整合：系统描述，比较和分析
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Integration of Speaker and Speech Recognition Systems

摘要

著录项

相似文献

相关主题

期刊订阅