首页> 外文期刊>International Journal of Electrical and Computer Engineering >Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs
【24h】

Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs

机译:使用功率谱和MFCC通过扬声器音频信号示例查询

获取原文
           

摘要

Search engine is the popular term for an information retrieval (IR) system. Typically, search engine can be based on full-text indexing. Changing the presentation from the text data to multimedia data types make an information retrieval process more complex such as a retrieval of image or sounds in large databases. This paper introduces the use of language and text independent speech as input queries in a large sound database by using Speaker identification algorithm. The method consists of 2 main processing first steps, we separate vocal and non-vocal identification after that vocal be used to speaker identification for audio query by speaker voice. For the speaker identification and audio query by process, we estimate the similarity of the example signal and the samples in the queried database by calculating the Euclidian distance between the Mel frequency cepstral coefficients (MFCC) and Energy spectrum of acoustic features. The simulations show that the good performance with a sustainable computational cost and obtained the average accuracy rate more than 90%.
机译:搜索引擎是信息检索(IR)系统的流行术语。通常,搜索引擎可以基于全文索引。将表示形式从文本数据更改为多媒体数据类型会使信息检索过程变得更加复杂,例如在大型数据库中检索图像或声音。本文介绍了通过说话人识别算法在大型声音数据库中将语言和文本无关的语音用作输入查询的方法。该方法包括两个主要的处理第一步,我们将语音识别和非语音识别分开,然后将语音用于说话人识别,以进行说话人语音查询。对于说话人识别和按过程进行音频查询,我们通过计算梅尔频率倒谱系数(MFCC)与声学特征能谱之间的欧几里得距离,来估计示例信号与所查询数据库中样本的相似性。仿真结果表明,该算法具有良好的性能和可承受的计算成本,并且平均准确率超过90%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号