首页> 外文会议>Conference on Security, Steganography, and Watermarking of Multimedia Contents >Application of conditional entropy measures to steganalysis
【24h】

Application of conditional entropy measures to steganalysis

机译:条件熵措施在塞巴析中的应用

获取原文

摘要

Many commercial steganographic programs use least significant bit (LSB) embedding techniques to hide data in 24-bit color images. We present the results from a new steganalysis algorithm that uses a variety of entropy and conditional entropy features of various image bitplanes to detect the presence of LSB hiding. Our technique uses a Support Vector Machine (SVM) for bivariate classification. We use the SVMLight implementation due to Joachims (available at http://svmlight.joachims.org/). A novel Genetic Algorithm (GA) approach was used to optimize the feature set used by the classifier. Results include correct identification rates as high as >98% and false positive rates as low as <2%. We have applied using the staganography programs stegHide and Hide4PGP. The hiding algorithms are capable of both sequential and distributed LSB embedding. The image library consisted of 40,000 digital images of varying size and content, which form a diverse test set. Training sets consisted of as many as 34,000 images, half "clean" and the other half a disjoint set containing embedded data. The hidden data consisted of files with various sizes and various information densities, ranging from very low average entropy (e.g., standard word processing or spreadsheet files) to very high entropy (compressed data). The testing phase used a similarly prepared set, disjoint from the training data. Our work includes comparisons with current state-of-the-art techniques, and a detailed study of how results depend on training set size and feature sets used.
机译:许多商业隐写计划使用最低有效位(LSB)嵌入技术来隐藏24位彩色图像中的数据。我们介绍了一种新的隐分算法的结果,该算法使用各种图像位平面的各种熵和条件熵特征来检测LSB隐藏的存在。我们的技术使用支持向量机(SVM)进行双变量分类。我们使用Goachims的SVMlight实现(在http://svmlight.joachims.org/提供)。一种新的遗传算法(GA)方法用于优化分类器使用的特征集。结果包括高达> 98%的正确识别率,假阳性率低至<2%。我们使用Steghide和Hide4PGP使用旧吊记程序应用。隐藏算法能够顺序和分布式LSB嵌入。图像库由40,000个不同尺寸和内容的数字图像组成,其形成多样化的测试集。培训集由多达34,000个图像组成,一半“清洁”,另一半的禁用集包含嵌入式数据。隐藏数据由具有各种大小和各种信息密度的文件组成,从非常低的平均熵(例如,标准字处理或电子表格文件)到非常高的熵(压缩数据)。测试阶段使用类似准备的集合,与训练数据不相交。我们的工作包括利用当前最先进的技术的比较,并详细研究结果如何依赖于训练集大小和使用功能集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号