Learning analysis strategies for octagon and context sensitivity from labeled data generated by static analyses

Kihong Heo; Oh Hakjoo; Yang Hongseok

首页> 外文期刊>Formal Methods in System Design >Learning analysis strategies for octagon and context sensitivity from labeled data generated by static analyses

【24h】

Learning analysis strategies for octagon and context sensitivity from labeled data generated by static analyses

机译：从静态分析生成的标记数据中学习八边形和上下文敏感性的分析策略

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present a method for automatically learning an effective strategy for clustering variables for the Octagon analysis from a given codebase. This learned strategy works as a preprocessor of Octagon. Given a program to be analyzed, the strategy is first applied to the program and clusters variables in it. We then run a partial variant of the Octagon analysis that tracks relationships among variables within the same cluster, but not across different clusters. The notable aspect of our learning method is that although the method is based on supervised learning, it does not require manually-labeled data. The method does not ask human to indicate which pairs of program variables in the given codebase should be tracked. Instead it uses the impact pre-analysis for Octagon from our previous work and automatically labels variable pairs in the codebase as positive or negative. We implemented our method on top of a static buffer-overflow detector for C programs and tested it against open source benchmarks. Our experiments show that the partial Octagon analysis with the learned strategy scales up to 100KLOC and is 33x faster than the one with the impact pre-analysis (which itself is significantly faster than the original Octagon analysis), while increasing false alarms by only 2%. The general idea behind our methodis applicable to other types of static analyses as well. We demonstrate that our method is also effective to learn a strategy for context-sensitivity of interval analysis.

机译：我们提出了一种方法，用于自动学习从给定代码库为八角形分析聚类变量的有效策略。这种学到的策略可以作为Octagon的预处理程序。给定要分析的程序，该策略首先应用于该程序并将其聚类。然后，我们运行八边形分析的部分变体，该变体跟踪同一集群内而不是不同集群之间变量之间的关系。我们的学习方法的一个显着方面是，尽管该方法基于监督学习，但它不需要手动标记的数据。该方法不会要求人类指出应跟踪给定代码库中的哪些程序变量对。相反，它使用先前工作中对Octagon的影响预分析，并自动将代码库中的变量对标记为正或负。我们在用于C程序的静态缓冲区溢出检测器之上实现了我们的方法，并针对开源基准对其进行了测试。我们的实验表明，采用所学策略的部分八边形分析可扩展到100KLOC，比进行影响预分析的部分快33倍（其本身比原始八角形分析快得多），同时将虚假警报仅增加2％。我们方法背后的一般思想也适用于其他类型的静态分析。我们证明了我们的方法对于学习区间分析的上下文敏感性策略也是有效的。

著录项

来源
《Formal Methods in System Design》 |2018年第2期|189-220|共32页
作者
Kihong Heo; Oh Hakjoo; Yang Hongseok;
展开▼
作者单位

Univ Penn, Philadelphia, PA 19104 USA;

Korea Univ, Seoul, South Korea;

Korea Adv Inst Sci & Technol, Daejeon, South Korea;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Static analysis; Machine learning; Relational analysis; Context-sensitivity;

机译：静态分析;机器学习;关系分析;语境敏感性;

相似文献

外文文献
中文文献
专利

1. Selective conjunction of context-sensitivity and octagon domain toward scalable and precise global static analysis [J] . Heo Kihong, Oh Hakjoo, Yi Kwangkeun Software . 2017,第11期

机译：上下文敏感度和八边形域对可伸缩且精确的全局静态分析的选择性结合
2. Multiple-Enzyme-Digestion Strategy Improves Accuracy and Sensitivity of Label- and Standard-Free Absolute Quantification to a Level That Is Achievable by Analysis with Stable Isotope-Labeled Standard Spiking [J] . Wisniewski Jacek R., Wegler Christine, Artursson Per Journal of proteome research . 2019,第1期

机译：多酶消化策略可提高标签和无标准的绝对定量的准确性和敏感性，以通过稳定同位素标记标准尖刺可实现的水平
3. Analysing the Importance-Competence Gap of Distance Educators With the Increased Utilisation of Online Learning Strategies in a Developing World Context [J] . Adéle Bezuidenhout International Review of Research in Open and Distributed Learning . 2018,第3期

机译：在发展中世界背景下，随着在线学习策略使用率的提高，分析远程教育者的重要能力差距
4. Learning a Variable-Clustering Strategy for Octagon from Labeled Data Generated by a Static Analysis [C] . Kihong Heo, Hakjoo Oh, Hongseok Yang International symposium on static analysis . 2016

机译：从静态分析生成的标记数据中学习八边形的可变聚类策略
5. IDENTIFICATION OF STRUCTURAL ELEMENT STIFFNESSES FROM INCOMPLETE STATIC TEST DATA (FINITE, PARAMETER, AEROSPACE, SYSTEM, SENSITIVITY ANALYSIS). [D] . SANAYEI, MASOUD. 1986

机译：从不完整的静态测试数据（有限，参数，航空，系统，灵敏度分析）中识别结构元素的刚度。
6. To what degree does the missing-data technique influence the estimated growth in learning strategies over time? A tutorial example of sensitivity analysis for longitudinal data [O] . Liesje Coertjens, Vincent Donche, Sven De Maeyer, 2011

机译：缺失数据技术在多大程度上会影响学习策略在一段时间内的估计增长？纵向数据敏感性分析的教程示例
7. To what degree does the missing-data technique influence the estimated growth in learning strategies over time? A tutorial example of sensitivity analysis for longitudinal data. [O] . Liesje Coertjens, Vincent Donche, Sven De Maeyer, 2017

机译：失踪数据技术在多大程度上会影响学习策略的估计增长？纵向数据灵敏度分析的教程示例。

Learning analysis strategies for octagon and context sensitivity from labeled data generated by static analyses

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅