...
首页> 外文期刊>IEICE transactions on information and systems >API-Based Software Birthmarking Method Using Fuzzy Hashing
【24h】

API-Based Software Birthmarking Method Using Fuzzy Hashing

机译:基于模糊散列的基于API的软件出生标记方法

获取原文
   

获取外文期刊封面封底 >>

       

摘要

The software birthmarking technique has conventionally been studied in fields such as software piracy, code theft, and copyright infringement. The most recent API-based software birthmarking method (Han et al., 2014) extracts API call sequences in entire code sections of a program. Additionally, it is generated as a birthmark using a cryptographic hash function (MD5). It was reported that different application types can be categorized in a program through pre-filtering based on DLL/API numbersames. However, similarity cannot be measured owing to the cryptographic hash function, occurrence of false negatives, and it is difficult to functionally categorize applications using only DLL/API numbersames. In this paper, we propose an API-based software birthmarking method using fuzzy hashing. For the native code of a program, our software birthmarking technique extracts API call sequences in the segmented procedures and then generates them using a fuzzy hash function. Unlike the conventional cryptographic hash function, the fuzzy hash is used for the similarity measurement of data. Our method using a fuzzy hash function achieved a high reduction ratio (about 41% on average) more than an original birthmark that is generated with only the API call sequences. In our experiments, when threshold ε is 0.35, the results show that our method is an effective birthmarking system to measure similarities of the software. Moreover, our correlation analysis with top 50 API call frequencies proves that it is difficult to functionally categorize applications using only DLL/API numbersames. Compared to prior work, our method significantly improves the properties of resilience and credibility.
机译:传统上已经在诸如软件盗版,代码盗窃和版权侵权的领域中研究了软件胎记标记技术。最新的基于API的软件出生标记方法(Han等,2014)从程序的整个代码部分提取API调用序列。此外,它使用加密哈希函数(MD5)作为胎记生成。据报道,可以通过基于DLL / API数字/名称的预过滤在程序中对不同的应用程序类型进行分类。但是,由于密码哈希函数,错误否定的发生,无法测量相似性,并且仅使用DLL / API数字/名称很难对应用程序进行功能分类。在本文中,我们提出了一种使用模糊哈希的基于API的软件出生标记方法。对于程序的本机代码,我们的软件胎记技术会在分段过程中提取API调用序列,然后使用模糊哈希函数生成它们。与常规的密码散列函数不同,模糊散列用于数据的相似性测量。与仅使用API​​调用序列生成的原始胎记相比,我们使用模糊哈希函数的方法实现了较高的缩减率(平均约41%)。在我们的实验中,当阈值ε为0.35时,结果表明我们的方法是衡量软件相似性的有效胎记标记系统。此外,我们对前50个API调用频率的相关性分析证明,仅使用DLL / API数字/名称很难对应用程序进行功能分类。与以前的工作相比,我们的方法显着提高了弹性和可信度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号