首页> 外国专利> A Robust Speaker Recognition Algorithm Using the Wavelet Transform

A Robust Speaker Recognition Algorithm Using the Wavelet Transform

机译:基于小波变换的鲁棒说话人识别算法

摘要

PURPOSE: A system for identifying a speaker strong to an external noise is provided to use a wavelet transform to separate original signals into four subbands, and to construct independent codebooks for three frequency bands having excellent capacities to finally have one decision-making value, so as to prevent a noise of a subband from influencing other subbands. CONSTITUTION: A voice detector detects a voice start point and a voice end point. A voice analyzer analyzes voices of each word, and finally finds a linear prediction coefficient and a mel-frequency ceptrum coefficient. If an algorithm is a vector quantization algorithm, a trainer makes codebooks representing each voice by using a K-means clustering algorithm for specific vectors obtained from the voice analyzer. A recognizer compares inputted speaker data with the codebooks to select a codebook having the nearest vector space distance, and decides a speaker corresponding to the codebook as recognition.
机译:目的:提供一种用于识别对外部噪声影响强的说话者的系统,该系统使用小波变换将原始信号分成四个子带,并为三个具有出色容量的频带构建独立的码本,从而最终具有一个决策价值,因此以防止子带的噪声影响其他子带。组成:语音检测器检测语音起点和语音终点。语音分析器分析每个单词的语音,最后找到线性预测系数和梅尔频率感受系数。如果算法是矢量量化算法,则培训师将针对从语音分析器获得的特定矢量使用K-均值聚类算法,制作代表每个语音的码本。识别器将输入的说话者数据与代码簿进行比较,以选择具有最接近向量空间距离的代码簿,并将与该代码簿相对应的说话者确定为识别者。

著录项

  • 公开/公告号KR100436305B1

    专利类型

  • 公开/公告日2004-06-23

    原文格式PDF

  • 申请/专利权人

    申请/专利号KR20020015517

  • 发明设计人 전명근;

    申请日2002-03-22

  • 分类号G10L17/00;

  • 国家 KR

  • 入库时间 2022-08-21 22:46:56

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号