Perception and Practices of Differential Testing

机译：差异测试的认识和实践

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Tens of thousands engineers are contributing to Google's codebase that spans billions of lines of code. To ensure high code quality, tremendous amount of effort has been made with new testing techniques and frameworks. However, with increasingly complex data structures and software systems, traditional test case based testing strategies cannot scale well to achieve the desired level of test adequacy. Differential (Diff) is one of the new testing techniques adapted to fill this gap. It uses the same input to run two versions of a software system, namely base and test, where base is the verified/tested version of the system while test is the modified version. The output of two runs are then thoroughly compared to find abnormalities that may lead to possible bugs. Over the past few years, differential testing has been quickly adopted by hundreds of teams across all major product areas at Google. Meanwhile, many new differential testing frameworks were developed to simplify the creation, maintenance, and analysis of diff tests. Curious by this emerging popularity, we conducted the first empirical study on differential testing in practice at large scale. In this study, we investigated common practices and usage of diff tests. We further explore the features of diff tests that users value the most and the pain points of using diff tests. Through this user study, we discovered that differential testing does not replace fine-grained testing techniques such as unit tests. Instead it supplements existing testing suites. It helps users verify the impact on unmodified and unfamiliar components in the absence of a test oracle. In terms of limitations, diff tests often take long time to run and appear to generate noisy and flaky outcomes. Finally, we highlight problems (including smart data differencing, sampling, and traceability) to guide future research in differential testing.

机译：成千上万的工程师正在为跨越数十亿行代码的Google代码库做出贡献。为了确保高质量的代码，新的测试技术和框架已经付出了巨大的努力。但是，随着数据结构和软件系统的日益复杂，基于传统测试案例的测试策略无法很好地扩展以达到所需的测试充分性水平。差分（Diff）是适合填补这一空白的新测试技术之一。它使用相同的输入来运行软件系统的两个版本，即基本版本和测试版本，其中基本版本是系统的经过验证/测试的版本，而测试版本是修改版本。然后将两次运行的输出进行彻底比较，以发现可能导致错误的异常。在过去的几年中，差异测试已被Google所有主要产品领域的数百个团队迅速采用。同时，开发了许多新的差异测试框架来简化差异测试的创建，维护和分析。对这种新兴的流行感到好奇，我们在实践中进行了首次关于差异测试的实证研究。在这项研究中，我们调查了差异测试的常见做法和用法。我们将进一步探讨用户最重视的差异测试的功能以及使用差异测试的痛点。通过此用户研究，我们发现差异测试不能替代细粒度的测试技术，例如单元测试。相反，它补充了现有的测试套件。它可以帮助用户在没有测试Oracle的情况下验证对未修改和不熟悉的组件的影响。在局限性方面，差异测试通常需要花费很长时间才能运行，并且看起来会产生嘈杂和不稳定的结果。最后，我们重点介绍了问题（包括智能数据区分，采样和可追溯性），以指导将来进行差异测试的研究。

著录项

来源
《2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice》|2019年|71-80|共10页
会议地点 Montreal(CA)
作者
Muhammad Gulzar; Yongkang Zhu; Xiaofeng Han;
展开▼
作者单位

University of California, Los Angeles;

Google, Inc;

Google, Inc;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
data structures; program testing;

机译：数据结构;程序测试;;
入库时间 2022-08-26 14:32:50

相似文献

外文文献
中文文献
专利

1. Barriers to Minimizing Respiratory Viral Testing in Bronchiolitis: Physician Perceptions on Testing Practices. [J] . Maria Z Huang, Kyung E Rhee, Lauren Gist, Hospital pediatrics. . 2019,第2期

机译：使支气管炎中呼吸道病毒检测最小化的障碍：医师对测试实践的看法。
2. Testing the impartiality of surveys to measure differential risk perception [J] . Thomas P. J. Measurement . 2015,第Null期

机译：测试调查的公正性以衡量差异风险感知
3. Testing a key assumption in animal communication: between-individual variation in female visual systems alters perception of male signals Testing a key assumption in animal communication: between-individual variation in female visual systems alters perception of male signals Testing a key assumption in animal communication: between-individual variation in female visual systems alters perception of male signals [J] . Amanda L. Ensminger, Jeffrey R. Lucas, Matthew D. Shawkey, Biology Open . 2017,第12期

机译：测试动物交流中的一个关键假设：雌性视觉系统之间的个体差异会改变男性对信号的感知测试动物交流中的一个关键假设：雌性视觉系统之间的个体差异会改变男性的信号感知。女性视觉系统之间的个体差异改变了男性信号的感知
4. Perception and Practices of Differential Testing [C] . Muhammad Gulzar, Yongkang Zhu, Xiaofeng Han IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice . 2019

机译：差异测试的感知和实践
5. Examining elementary teachers' perceptions of the impact of high-stakes testing on classroom teaching practices: A mixed methods study. [D] . Borden-Hudson, LaTonya. 2010

机译：检验基本教师对高风险测试对课堂教学实践的影响的看法：混合方法研究。
6. A genetic risk assessment for prostate cancer influences patients’ risk perception and use of repeat PSA testing: a cross-sectional study in Danish general practice [O] . Jacob Fredsøe, Pia Kirkegaard, Adrian Edwards, 2020

机译：前列腺癌的遗传风险评估会影响患者的风险感知和重复PSA测试的使用：丹麦一般实践中的一项横断面研究
7. Children’s Perceptions of Tests: A Content Analysis Gokce Bulgan 10.12973/eu-jer.7.2.159 Pages: 159-167 The Relationships between Quality of Work Life, School Alienation, Burnout, Affective Commitment and Organizational Citizenship: A Study on Teachers Huseyin Akar 10.12973/eu-jer.7.2.169 Pages: 169-180 Determination of Teacher Candidates’ Views Concerning V Diagrams Used in General Biology Laboratories Kadriye Kayacan 10.12973/eu-jer.7.2.181 Pages: 181-187 Use of Instructional Technologies by Teachers in the Educational Process: Metaphor Analysis Study Hakan Sarac 10.12973/eu-jer.7.2.189 Pages: 189-202 The Analysis of Kutadgu Bilig in Terms of Values Education Aysegul Tural 10.12973/eu-jer.7.2.203 Pages: 203-209 Investigating the Resilience Levels of Parents with Children with Multiple Disabilities Based on Different Variables Sinem Kadi, Muzeyyen Eldeniz Cetin 10.12973/eu-jer.7.2.211 Pages: 211-223 Examination of Postgraduate Theses on History Textbooks in Turkey in Terms of Some Variables Eray Alaca 10.12973/eu-jer.7.2.225 Pages: 225-232 Influences of Technology Integrated Professional Development Course on Mathematics Teachers Umit Kul 10.12973/eu-jer.7.2.233 Pages: 233-243 The Book of My Dreams Hatice Degirmenci Gundogmus 10.12973/eu-jer.7.2.245 Pages: 245-249 Student Definitions of Intercultural Competence (IC)- Are They Context-Specific? Nadine Binder, Ozen Odag, Anne Leiser, Lisa Ludders, Karina Karolina Kedzior 10.12973/eu-jer.7.2.251 Pages: 251-265 Investigation of the Visuals Associated with the National identity in Turkish Republic Revolution History and Kemalism Textbooks Mehmet Elban 10.12973/eu-jer.7.2.267 Pages: 267-279 The Interplay of Emotional Instability and Socio-Environmental Aspects of Schools during Adolescence Alexander Lätsch 10.12973/eu-jer.7.2.281 Pages: 281-293 Problems of Gifted and Talented Students Regarding Cursive Handwriting: Parent Opinions Hatice Kadioglu Ates 10.12973/eu-jer.7.2.295 Pages: 295-301 A Study of Curriculum Literacy and Information Literacy Levels of Teacher Candidates in Department of Social Sciences Education Serhat Sural, Nurhak Cem Dedebali 10.12973/eu-jer.7.2.303 Pages: 303-317 Why Should Bilingualized Dictionary of Turkish Be Used in Teaching Turkish as a Foreign Language? Sami Baskin 10.12973/eu-jer.7.2.319 Pages: 319-327 Alternative Observation Tools for the Scope of Contemporary Education Supervision: An Action Research Saadet Kuru Cetin 10.12973/eu-jer.7.2.329 Pages: 329-340 The Reflection of Neoliberal Economic Policies on Education: Privatization of Education in Turkey Arslan Bayram 10.12973/eu-jer.7.2.341 Pages: 341-347 Exploring Prospective Teachers’ Reflections in the Context of Conducting Clinical Interviews Rukiye Didem Taylan 10.12973/eu-jer.7.2.349 Pages: 349-358 Consistency between Constructivist Profiles and Instructional Practices of Prospective Physics Teachers Ozlem Ates, Gul Unal Coban, Serap Kaya Sengoren 10.12973/eu-jer.7.2.359 Pages: 359-372 Fraction Multiplication and Division Word Problems Posed by Different Years of Pre-Service Elementary Mathematics Teachers Tuba Aydogdu Iskenderoglu 10.12973/eu-jer.7.2.373 Pages: 373-385 The Effect of 7E Learning Model on Conceptual Understandings of Prospective Science Teachers on "de Broglie Matter Waves" Subject [O] . 2018

机译：测试的儿童的看法：内容分析戈斯布尔根10.12973 / EU-jer.7.2.159页：研究教师侯赛因·阿卡尔10.12973：159-167工作生活，学校异化，倦怠感，感情承诺和组织公民的质量之间的关系/eu-jer.7.2.169页：169-180测定教师候选人查看关于V图中使用的普通生物学实验室Kadriye Kayacan 10.12973 / EU-jer.7.2.181页数：181-187使用教学技术的由教师在教育过程：隐喻分析研究哈坎Sarac 10.12973 / EU-jer.7.2.189页：福乐智慧的189-202的分析条款价值观教育Aysegul图拉尔10.12973 / EU-jer.7.2.203页数：203-209调查家长的复原力水平与儿童多重残疾基于不同的变量Sinem卡迪Muzeyyen Eldeniz切廷10.12973 / EU-jer.7.2.211页：关于历史教科书在土耳其研究生论文的211-223考试中的一些条款的瓦尔iables ERAY阿拉卡10.12973 / EU-jer.7.2.225页：225-232数学教师乌米特库尔10.12973 / EU-jer.7.2.233页数影响技术的综合专业发展课程：233-243我梦想中的书Hatice Degirmenci Gundogmus 10.12973 / EU-jer.7.2.245页：跨文化能力（IC）的245-249学生的定义 - 他们上下文的具体情况？纳丁活页夹，澳升Odag，安妮·莱泽，丽莎Ludders，卡琳娜卡罗利纳Kedzior 10.12973 / EU-jer.7.2.251页：与土耳其共和国革命历史和基马尔主义教科书穆罕默德Elban 10.12973国家身份的视觉效果相关的251-265调查/欧盟jer.7.2.267页：青春期亚历山大·拉奇10.12973 / EU-jer.7.2.281页面中学校的情绪不稳定和社会环境方面的267-279的互动性：优秀的281-293问题天赋的学生对于行草手写：家长意见Hatice Kadioglu阿泰10.12973 / EU-jer.7.2.295页：教师候选人的课程素养和信息素养水平在社会科学系教育塞尔哈特腓肠，295-301研究Nurhak杰姆Dedebali 10.12973 / EU-耶.7.2.303页数：土耳其的303-317为什么要Bilingualized字典是在教学中使用土耳其语作为一门外语？萨米巴斯10.12973 / EU-jer.7.2.319页：319-327替代观测工具对当代教育督导的范围：一个行动研究Saadet库鲁切廷10.12973 / EU-jer.7.2.329页数：329-340的反思教育新自由主义经济政策：在土耳其阿尔斯兰拜拉姆10.12973 / EU-jer.7.2.341页面教育的私有化：341-347中进行临床访谈Rukiye DIDEM泰兰10.12973 / EU-jer.7.2背景下探索未来教师的思考。 349页：建构概况和前景物理教师Ozlem阿泰的教学实践，古尔ÜNAL科班之间349-358一致性SERAP卡亚Sengoren 10.12973 / EU-jer.7.2.359页数：359-372馏分乘法和除法文字题所构成的不同几年前服务初等数学教师图拔Aydogdu Iskenderoglu 10.12973 / EU-jer.7.2.373页数：373-385前瞻科学教师的概念理解7E学习模式的影响“德布罗意物质波”主题

Perception and Practices of Differential Testing

摘要

著录项

相似文献

相关主题

期刊订阅