...
首页> 外文期刊>Procedia Computer Science >Robust Speaker Identification Incorporating High Frequency Features
【24h】

Robust Speaker Identification Incorporating High Frequency Features

机译:结合高频功能的强大扬声器识别功能

获取原文
           

摘要

Speaker identification system identifies the person by his/her speech sample. Speaker Identification (SI) system should posses a robust feature extraction unit and a good classifier. Mel frequency cepstral coefficient (MFCC) is very old feature extraction scheme, which has been regarded as standard set of feature vectors for speaker identification. The mel filter bank used in MFCC method, captures the speaker information more effectively in lower frequencies than higher frequencies. Hence high frequency region characteristics are lost. This problem is solved in the proposed method. The speech signal comprises both voiced and unvoiced segments. The voiced segment includes high energy, low frequency components and unvoiced segment includes low energy, high frequency components. In proposed method, the speech sample is divided into voiced and unvoiced segments. The voiced speech segment is filtered using mel filter bank to generate MFCC from lower frequencies of speech signal and unvoiced speech segment is filtered using inverted mel filter bank to generate IMFCC from higher frequencies of speech signal.
机译:说话者识别系统通过他/她的语音样本识别该人。说话人识别(SI)系统应具有强大的特征提取单元和良好的分类器。梅尔频率倒谱系数(MFCC)是一种非常古老的特征提取方案,已被视为说话人识别的标准特征向量集。 MFCC方法中使用的梅尔滤波器组,在低频下比高频下更有效地捕获说话者信息。因此,高频区域特性丢失。在提出的方法中解决了这个问题。语音信号包括有声段和无声段。浊音段包括高能量,低频分量,清音段包括低能量,高频分量。在所提出的方法中,语音样本被分为有声段和无声段。使用mel滤波器组对发声的语音段进行滤波,以从较低频率的语音信号生成MFCC;使用反向mel滤波器组对发声的语音段进行滤波,以从较高的语音信号频率生成IMFCC。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号