首页> 外文期刊>IEEE transactions on audio, speech and language processing >Content-Dependent Watermarking Scheme in Compressed Speech With Identifying Manner and Location of Attacks
【24h】

Content-Dependent Watermarking Scheme in Compressed Speech With Identifying Manner and Location of Attacks

机译:识别方式和攻击位置的压缩语音中与内容有关的水印方案

获取原文
获取原文并翻译 | 示例

摘要

As speech compression technologies have advanced, digital recording devices have become increasingly popular. However, data formats used in popular speech codecs are known a priori, such that compressed data can be modified easily via insertion, deletion, and replacement. This work proposes a content-dependent watermarking scheme suitable for codebook-excited linear prediction (CELP)-based speech codec that ensures the integrity of compressed speech data. Speech data are initially partitioned into many groups, each of which includes multiple speech frames. The watermark embedded in each frame is then generated according to the line spectrum frequency (LSF) feature in the current frame, the pitch extracted from the succeeding frame, the watermark embedded in the preceding frame, and the group index which is determined by the location of the current frame. Finally, some of the least significant bits (LSBs) of the indices indicating the excitation pulse positions or excitation vectors are substituted for the watermark. Conventional watermarking schemes can only detect whether compressed speech data are intact. They cannot determine where compressed speech data are altered by insertion, deletion, or replacement, whereas the proposed scheme can. Experiments established that the proposed scheme used in the G.723.1 6.3 kb/s speech codecs embeds 12 bits in each compressed speech frame with 189 bits, and only decreases the perceptual evaluation of speech quality (PESQ) by 0.11. Additionally, its accuracy in detecting the locations of attacked frames is very high, with only two normal frames mistaken as attacked frames. Therefore, the proposed watermarking scheme effectively ensures the integrity of compressed speech data.
机译:随着语音压缩技术的发展,数字记录设备变得越来越流行。但是,在流行的语音编解码器中使用的数据格式是先验的,因此可以通过插入,删除和替换轻松地修改压缩数据。这项工作提出了一种基于内容的水印方案,适用于基于码本激励的线性预测(CELP)的语音编解码器,可确保压缩语音数据的完整性。语音数据最初被分为许多组,每个组包括多个语音帧。然后根据当前帧中的线频谱频率(LSF)特征,从后续帧中提取的音高,嵌入在前一帧中的水印以及由位置确定的组索引来生成嵌入到每个帧中的水印。当前帧的最后,将指示激励脉冲位置或激励矢量的索引中的某些最低有效位(LSB)替换为水印。传统的水印方案只能检测压缩的语音数据是否完整。他们无法确定通过插入,删除或替换在何处更改了压缩语音数据,而所提出的方案却可以。实验确定,在G.723.1 6.3 kb / s语音编解码器中使用的拟议方案在每个压缩语音帧中以189位嵌入12位,并且仅使语音质量的感知评估(PESQ)降低0.11。此外,它在检测被攻击帧位置方面的准确性非常高,只有两个正常帧被误认为是被攻击帧。因此,提出的水印方案有效地保证了压缩语音数据的完整性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号