Robust Speaker Identification Incorporating High Frequency Features

Latha

首页> 外文期刊>Procedia Computer Science >Robust Speaker Identification Incorporating High Frequency Features

【24h】

Robust Speaker Identification Incorporating High Frequency Features

机译：结合高频功能的强大扬声器识别功能

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker identification system identifies the person by his/her speech sample. Speaker Identification (SI) system should posses a robust feature extraction unit and a good classifier. Mel frequency cepstral coefficient (MFCC) is very old feature extraction scheme, which has been regarded as standard set of feature vectors for speaker identification. The mel filter bank used in MFCC method, captures the speaker information more effectively in lower frequencies than higher frequencies. Hence high frequency region characteristics are lost. This problem is solved in the proposed method. The speech signal comprises both voiced and unvoiced segments. The voiced segment includes high energy, low frequency components and unvoiced segment includes low energy, high frequency components. In proposed method, the speech sample is divided into voiced and unvoiced segments. The voiced speech segment is filtered using mel filter bank to generate MFCC from lower frequencies of speech signal and unvoiced speech segment is filtered using inverted mel filter bank to generate IMFCC from higher frequencies of speech signal.

机译：说话者识别系统通过他/她的语音样本识别该人。说话人识别（SI）系统应具有强大的特征提取单元和良好的分类器。梅尔频率倒谱系数（MFCC）是一种非常古老的特征提取方案，已被视为说话人识别的标准特征向量集。 MFCC方法中使用的梅尔滤波器组，在低频下比高频下更有效地捕获说话者信息。因此，高频区域特性丢失。在提出的方法中解决了这个问题。语音信号包括有声段和无声段。浊音段包括高能量，低频分量，清音段包括低能量，高频分量。在所提出的方法中，语音样本被分为有声段和无声段。使用mel滤波器组对发声的语音段进行滤波，以从较低频率的语音信号生成MFCC;使用反向mel滤波器组对发声的语音段进行滤波，以从较高的语音信号频率生成IMFCC。

著录项

来源
《Procedia Computer Science》 |2016年第1期|共8页
作者
Latha;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. DISCRIMINATIVE FEATURE EXTRACTION BASED ON SELF-ADAPTIVE FREQUENCY WARPING FOR ROBUST SPEAKER IDENTIFICATION [J] . YANPING LI, ZHENMIN TANG, HUI DING, International Journal of Information Acquisition . 2008,第4期

机译：基于自自适应频率包裹的鲁棒说话人鉴别特征提取
2. STATISTICAL FEATURE OF PITCH FREQUENCY DISTRIBUTIONS FOR ROBUST SPEAKER IDENTIFICATION [J] . Zhang Linghua, Zheng Baoyu, Yang Zhen Journal of Electronics (CHINA) . 2005,第4期

机译：健壮的说话人识别的音高频率分布的统计特征
3. STATISTICAL FEATURE OF PITCH FREQUENCY DISTRIBUTIONS FOR ROBUST SPEAKER IDENTIFICATION [J] . Zhang Linghua, Zheng Baoyu, Yang Zhen 电子科学学刊（英文版） . 2005,第004期

机译：健壮的说话人识别的音高频率分布的统计特征
4. Incorporating Auditory Feature Uncertainties in Robust Speaker Identification [C] . Yang Shao, Srinivasan, S., . 2007

机译：将听觉特征不确定性纳入可靠的说话人识别中
5. Robust features for speaker identification. [D] . Assaleh, Khaled Talal. 1993

机译：强大的扬声器识别功能。
6. A Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery [O] . Md. Atiqul Islam, Wissam A. Jassim, Ng Siew Cheok, -1

机译：强大的说话人识别系统利用听觉外围模型的响应
7. Robust Speaker Identification Incorporating High Frequency Features [O] . Latha 2016

机译：结合高频功能的强大扬声器识别功能
8. Integrated Feature Normalization and Enhancement for Robust Speaker Recognition Using Acoustic Factor Analysis (Preprint). [R] . Hasan, T., Hansen, J. H. 2012

机译：使用声学因子分析（预印本）进行稳健的说话人识别的集成特征归一化和增强。

Robust Speaker Identification Incorporating High Frequency Features

摘要

著录项

相似文献

相关主题

期刊订阅