Speaker Identification Using Pseudo Pitch Synchronized Phase Information in Voiced Sound

机译：在语音中使用伪音高同步相位信息识别说话人

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In conventional speaker identification methods based on mel-frequency cepstral coefficients (MFCCs), phase information is ignored. Our recent studies have shown that phase information contains speaker dependent characteristics. We propose a new extraction method to extract pitch synchronous phase information from the voiced section only. Speaker identifi- cation experiments were performed using the NTT clean database and JNAS database. Using the new phase extraction method, we obtained a relative reduction in the speaker error rate of approximately 27% and 46%, respectively, for the two databases. We also obtained a relative error reduction of approximately 52% and 42%, respectively, when combining phase information with the MFCC-based method.

机译：在基于梅尔频率倒谱系数（MFCC）的常规说话人识别方法中，相位信息被忽略。我们最近的研究表明，相位信息包含说话者相关的特征。我们提出了一种仅从浊音部分中提取音高同步相位信息的新提取方法。使用NTT clean数据库和JNAS数据库进行了说话人识别实验。使用新的相位提取方法，对于这两个数据库，我们分别将说话人错误率分别降低了约27％和46％。当将相位信息与基于MFCC的方法结合时，我们也分别获得了约52％和42％的相对误差减少。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2011年|1-6|共6页
会议地点
作者
Kohta Shimada; Kazumasa Yamamoto; Seiichi Nakagawa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机的应用;信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Modelling a Voice Activated Speaker Identification System using MFCC-Pitch-Formant Vector [J] . Avik Sengupta, Rabindranath Ghosh Journal of The Institution of Engineers (India): Series B . 2012,第1期

机译：使用MFCC音高形成矢量对语音激活的说话人识别系统建模
2. Modelling a Voice Activated Speaker Identification System using MFCC-Pitch-Formant Vector [J] . Avik Sengupta, Rabindranath Ghosh Journal of The Institution of Engineers (India), Series B . 2012,第1期

机译：使用MFCC音高形成矢量对声控说话人识别系统建模
3. Modelling a Voice Activated Speaker Identification System using MFCC-Pitch-Formant Vector [J] . Avik Sengupta, Rabindranath Ghosh Journal of The Institution of Engineers (India), Series B . 2012,第1期

机译：使用MFCC音高形成矢量对声控说话人识别系统建模
4. Speaker Identification Using Pseudo Pitch Synchronized Phase Information in Voiced Sound [C] . Kohta Shimada, Kazumasa Yamamoto, Seiichi Nakagawa Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2011

机译：扬声器识别在声音中使用伪音调同步相位信息
5. Phonetic Motivation for Diachronic Sound Change in Bantu Languages as Evidenced by Voiceless Prenasalized Stop Perception by Native Somali Chizigula Speakers [D] . Boone, Haley 2018

机译：母语索马里奇济古拉语者的无声鼻塞停止感知证明了班图语中历时声音变化的语音动机
6. The Sound of Voice: Voice-Based Categorization of Speakers’ Sexual Orientation within and across Languages [O] . Simone Sulpizio, Fabio Fasoli, Anne Maass, -1

机译：语音之声：基于语音的说话者性别内和跨语言性取向分类
7. Voice source characterization using pitch synchronous discrete cosine transform for speaker identification [O] . Ramakrishnan AG, Abhiram B, Prasanna Mahadeva SR 2015

机译：使用基音同步离散余弦变换进行说话人识别的语音源表征

Speaker Identification Using Pseudo Pitch Synchronized Phase Information in Voiced Sound

摘要

著录项

相似文献

相关主题

期刊订阅