首页> 外文期刊>International Journal on Smart Sensing and Intelligent Systems >PERCEPTUAL HASHING ALGORITHM FOR SPEECH CONTENT IDENTIFICATION BASED ON SPECTRUM ENTROPY IN COMPRESSED DOMAIN
【24h】

PERCEPTUAL HASHING ALGORITHM FOR SPEECH CONTENT IDENTIFICATION BASED ON SPECTRUM ENTROPY IN COMPRESSED DOMAIN

机译:压缩域中基于谱熵的语音内容识别的哈希算法

获取原文
           

摘要

This paper proposes a new perceptual hashing algorithm for speech content identification with compressed domain based on MDCT (Modified Discrete Cosine Transform) Spectrum Entropy. It aims primarily to solve problems of large computational complexity and poor real-time performance that appear when applying traditional identification methods to the compressed speeches. The process begins by extracting the MDCT coefficients, which are the intermediately decoded results of compressed speeches in MP3 format. In order to reduce the computational complexity, these coefficients are divided into sub-bands and the energy of MDCT spectrum is then calculated. Sub-bands of MDCT spectrum energy are then mapped to a similar mass function in information entropy theory. The function will be used as a perceptual feature and set to extract binary hash values. Experimental results show that the proposed algorithm keeps greater robustness to content-preserving operations while also maintaining efficiency. As a result of the partial decoding process, the real-time performance can meet the requirements of applications in real-time communication terminals.
机译:提出了一种基于改进的离散余弦变换频谱熵的压缩域语音内容识别感知哈希算法。它的主要目的是解决将传统识别方法应用于压缩语音时出现的计算量大和实时性能差的问题。该过程开始于提取MDCT系数,该系数是MP3格式的压缩语音的中间解码结果。为了降低计算复杂度,将这些系数划分为子带,然后计算MDCT频谱的能量。然后,将MDCT频谱能量的子带映射到信息熵理论中的相似质量函数。该函数将用作感知功能,并设置为提取二进制哈希值。实验结果表明,该算法在保持内容效率的同时,对内容保存操作具有更高的鲁棒性。作为部分解码过程的结果,实时性能可以满足实时通信终端中应用的要求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号