首页> 外国专利> A Robust Speaker Recognition Algorithm Using the Wavelet Transform

A Robust Speaker Recognition Algorithm Using the Wavelet Transform

机译：基于小波变换的鲁棒说话人识别算法

页面导航

摘要
著录项
相似文献

摘要

PURPOSE: A system for identifying a speaker strong to an external noise is provided to use a wavelet transform to separate original signals into four subbands, and to construct independent codebooks for three frequency bands having excellent capacities to finally have one decision-making value, so as to prevent a noise of a subband from influencing other subbands. CONSTITUTION: A voice detector detects a voice start point and a voice end point. A voice analyzer analyzes voices of each word, and finally finds a linear prediction coefficient and a mel-frequency ceptrum coefficient. If an algorithm is a vector quantization algorithm, a trainer makes codebooks representing each voice by using a K-means clustering algorithm for specific vectors obtained from the voice analyzer. A recognizer compares inputted speaker data with the codebooks to select a codebook having the nearest vector space distance, and decides a speaker corresponding to the codebook as recognition.

机译：目的：提供一种用于识别对外部噪声影响强的说话者的系统，该系统使用小波变换将原始信号分成四个子带，并为三个具有出色容量的频带构建独立的码本，从而最终具有一个决策价值，因此以防止子带的噪声影响其他子带。组成：语音检测器检测语音起点和语音终点。语音分析器分析每个单词的语音，最后找到线性预测系数和梅尔频率感受系数。如果算法是矢量量化算法，则培训师将针对从语音分析器获得的特定矢量使用K-均值聚类算法，制作代表每个语音的码本。识别器将输入的说话者数据与代码簿进行比较，以选择具有最接近向量空间距离的代码簿，并将与该代码簿相对应的说话者确定为识别者。

著录项

公开/公告号KR100436305B1

专利类型
公开/公告日2004-06-23

原文格式PDF
申请/专利权人
展开▼

申请/专利号KR20020015517
发明设计人 전명근;
展开▼

申请日2002-03-22
分类号G10L17/00;
国家 KR
入库时间 2022-08-21 22:46:56

相似文献

专利
外文文献
中文文献