Speaker recognition based on transformed line spectral frequencies

机译：基于变换后的线谱频率的说话人识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Line spectral frequencies (LSF) and five types of transformed LSF are studied for robust text-independent speaker identification. Transformations are constructed by considering physical aspects of the vocal tract. These aspects are: location of formantsulls; bandwidth of formantsulls; bandwidth and location of formants; bandwidth and location of nulls; interval of adjacent formant and null locations. Identification tests using the TIMIT database verify that all features are useful for speaker recognition; the bandwidth and location of formants, especially, show the best performance. Simulation results also show that LSF and some of the transformed LSF give better performance than Mel-frequency cepstral coefficient (MFCC).

机译：研究了线性频谱频率（LSF）和五种转换后的LSF，以实现鲁棒的与文本无关的说话人识别。转换是通过考虑声道的物理方面来构造的。这些方面是：共振峰/零点的位置;共振峰/零点的带宽;共振峰的带宽和位置;空值的带宽和位置;相邻共振峰和空位置的间隔。使用TIMIT数据库进行的识别测试可验证所有功能均有助于说话人识别;共振峰的带宽和位置尤其显示出最佳性能。仿真结果还表明，LSF和某些经过变换的LSF的性能优于梅尔频率倒谱系数（MFCC）。

著录项

来源
《Intelligent Signal Processing and Communication Systems, 2004. ISPACS 2004. Proceedings of 2004 International Symposium on》|2004年|p.177-180|共4页
会议地点
作者
Bong Jin Lee; Kim S.; Hong-Goo Kang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
speaker recognition; spectral analysis; Gaussian processes; covariance matrices; speaker recognition; transformed line spectral frequencies; text-independent speaker identification; vocal tract physical aspects; formant bandwidth; formant location; null bandwidth; null location; TIMIT database; Mel-frequency cepstral coefficient; Gaussian mixture model; diagonal covariance matrices;

机译：说话人识别;频谱分析;高斯过程;协方差矩阵;说话人识别;变换后的线谱频率;与文本无关的说话人识别;声道物理方面;共振峰带宽;共振峰位置;零带宽;零点位置; TIMIT数据库;梅尔频率倒谱系数高斯混合模型对角协方差矩阵;

相似文献

外文文献
中文文献
专利

1. Fractional Fourier transform based features for speaker recognition using support vector machine [J] . Pawan K. Ajmera, Raghunath S. Holambe Computers and Electrical Engineering . 2013,第2期

机译：基于分数傅里叶变换的说话人识别特征，使用支持向量机
2. The Wavelet and Fourier Transforms in Feature Extraction for Text-Dependent, Filterbank-Based Speaker Recognition [J] . Claude Turner, Anthony Joseph, Murat Aksu, Procedia Computer Science . 2011,第1期

机译：特征提取中的小波和傅立叶变换，用于基于文本的，基于滤波器组的说话人识别
3. Spectral shifting of speaker-specific information for narrow band telephonic speaker recognition [J] . Thiruvaran T., Sethu V., Ambikairajah E., Electronics Letters . 2015,第25期

机译：特定于说话者的信息的频谱移位，可用于窄带电话说话者识别
4. Speaker recognition based on transformed line spectral frequencies [C] . Bong Jin Lee, Kim S., Hong-Goo Kang International Symposium on Intelligent Signal Processing and Communication Systems . 2004

机译：基于转换线谱频率的扬声器识别
5. English phoneme and word recognition by nonnative English speakers as a function of spectral resolution and English experience. [D] . Padilla, Monica. 2003

机译：非母语英语使用者的英语音素和单词识别与频谱分辨率和英语体验的关系。
6. Wavelet transform-based photoacoustic time-frequency spectral analysis for bone assessment [O] . Weiya Xie, Ting Feng, Mengjiao Zhang, 2021

机译：基于小波变换的骨骼评估的光声时频谱分析
7. Automatic speaker recognition dynamic feature identification and classification using distributed discrete cosine transform based mel frequency cepstral coefficients and fuzzy vector quantization [O] . Hossan M 2011

机译：基于分布式离散余弦变换的梅尔频率倒谱系数和模糊矢量量化自动说话人识别动态特征识别与分类
8. Spectral 'Fingerprinting' of Phytoplankton Populations by Two-Dimensional Fluorescence and Fourier-Transform-Based Pattern Recognition [R] . Oldham, P. B., Zillioux, E. J., Warner, I. M. 1985

机译：基于二维荧光和傅里叶变换的模式识别浮游植物种群的光谱“指纹图谱”

Speaker recognition based on transformed line spectral frequencies

摘要

著录项

相似文献

相关主题

期刊订阅