Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs

Pafan Doungpaisan; Anirach Mingkhwan

首页> 外文期刊>International Journal of Electrical and Computer Engineering >Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs

【24h】

Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs

机译：使用功率谱和MFCC通过扬声器音频信号示例查询

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Search engine is the popular term for an information retrieval (IR) system. Typically, search engine can be based on full-text indexing. Changing the presentation from the text data to multimedia data types make an information retrieval process more complex such as a retrieval of image or sounds in large databases. This paper introduces the use of language and text independent speech as input queries in a large sound database by using Speaker identification algorithm. The method consists of 2 main processing first steps, we separate vocal and non-vocal identification after that vocal be used to speaker identification for audio query by speaker voice. For the speaker identification and audio query by process, we estimate the similarity of the example signal and the samples in the queried database by calculating the Euclidian distance between the Mel frequency cepstral coefficients (MFCC) and Energy spectrum of acoustic features. The simulations show that the good performance with a sustainable computational cost and obtained the average accuracy rate more than 90%.

机译：搜索引擎是信息检索（IR）系统的流行术语。通常，搜索引擎可以基于全文索引。将表示形式从文本数据更改为多媒体数据类型会使信息检索过程变得更加复杂，例如在大型数据库中检索图像或声音。本文介绍了通过说话人识别算法在大型声音数据库中将语言和文本无关的语音用作输入查询的方法。该方法包括两个主要的处理第一步，我们将语音识别和非语音识别分开，然后将语音用于说话人识别，以进行说话人语音查询。对于说话人识别和按过程进行音频查询，我们通过计算梅尔频率倒谱系数（MFCC）与声学特征能谱之间的欧几里得距离，来估计示例信号与所查询数据库中样本的相似性。仿真结果表明，该算法具有良好的性能和可承受的计算成本，并且平均准确率超过90％。

著录项

来源
《International Journal of Electrical and Computer Engineering》 |2017年第6期|共16页
作者
Pafan Doungpaisan; Anirach Mingkhwan;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算机的应用;
关键词

相似文献

外文文献
中文文献
专利

1. Fusing MFCC and LPC Features Using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals [J] . Chowdhury Anurag, Ross Arun IEEE transactions on information forensics and security . 2020,第期

机译：使用一维三重态CNN融合MFCC和LPC功能，以在严重降级的音频信号中识别扬声器
2. Speaker Recognition for Hindi Speech Signal using MFCC-GMM Approach [J] . Ankur Maurya, Divya Kumar, R.K. Agarwal Procedia Computer Science . 2018,第5期

机译：使用MFCC-GMM方法的印地语语音信号扬声器识别
3. Identification of Noisy Speech Signals using Bispectrum-based 2D-MFCC and Its Optimization through Genetic Algorithm as a Feature Extraction Subsystem [J] . BENYAMIN KUSUMOPUTRO, AGUS BUONO, LINA WSEAS Transactions on Computers . 2012,第7a9期

机译：基于双谱的二维MFCC识别语音噪声信号及其作为特征提取子系统的遗传算法优化
4. COMPARISON OF MPEG-7 AUDIO SPECTRUM PROJECTION FEATURES AND MFCC APPLIED TO SPEAKER RECOGNITION, SOUND CLASSIFICATION AND AUDIO SEGMENTATION [C] . Hyoung-Gook Kim, Thomas Sikora IEEE International Conference on Acoustics, Speech, and Signal Processing . 2004

机译：MPEG-7音频频谱投影功能和MFCC的比较应用于扬声器识别，声音分类和音频分割
5. Signal Processing Augmentations to Spectrum-based Modeling for Speaker Recognition [D] . Metzger, Richard A., II. 2018

机译：信号处理增强到基于频谱的建模，用于说话人识别
6. Comparison of Matching Pursuit Algorithm with Other Signal Processing Techniques for Computation of the Time-Frequency Power Spectrum of Brain Signals [O] . Subhash Chandran KS, Ashutosh Mishra, Vinay Shirhatti, 2016

机译：匹配追踪算法与其他信号处理技术的脑信号时频功率谱计算比较
7. Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs [O] . Pafan Doungpaisan, Anirach Mingkhwan 2017

机译：通过使用功率谱和MFCC的扬声器音频信号查询

Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs

摘要

著录项

相似文献

相关主题

期刊订阅