Clone Detection on Large Scala Codebases

机译：大型Scala代码库上的克隆检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Code clones are identical or similar code segments. The wide existence of code clones can increase the cost of maintenance and jeopardise the quality of software. The research community has developed many techniques to detect code clones, however, there is little evidence of how these techniques may perform in industrial use cases. In this paper, we aim to uncover the differences when such techniques are applied in industrial use cases. We conducted large scale experimental research on the performance of two state-of-the-art code clone detection techniques, SourcererCC and AutoenCODE, on both open source projects and an industrial project written in the Scala language. Our results reveal that both algorithms perform differently on the industrial project, with the largest drop in precision being 30.7%, and the largest increase in recall being 32.4%. By manually labelling samples of the industrial project by its developers, we discovered that there are substantially less Type-3 clones in the aforementioned project than that in the open source projects.

机译：代码克隆是相同或相似的代码段。代码克隆的广泛存在会增加维护成本并危及软件质量。研究团体已经开发了许多技术来检测代码克隆，但是，几乎没有证据表明这些技术在工业用例中会如何发挥作用。在本文中，我们旨在揭示将这些技术应用于工业用例时的差异。我们在开源项目和用Scala语言编写的工业项目中，对两种最先进的代码克隆检测技术SourcererCC和AutoenCODE的性能进行了大规模的实验研究。我们的结果表明，两种算法在工业项目上的执行效果都不同，精度下降最大的是30.7％，召回率的最大增长是32.4％。通过由开发人员手动标记工业项目的样本，我们发现上述项目中的Type-3克隆要比开源项目中的要少得多。

著录项

来源
《International Workshop on Software Clones》|2020年|38-44|共7页
会议地点
作者
Wahidur Rahman; Yisen Xu; Fan Pu; Jifeng Xuan; Xiangyang Jia; Michail Basios; Leslie Kanthan; Lingbo Li; Fan Wu; Baowen Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
production engineering computing; public domain software; software maintenance; software quality; source code (software);

机译：生产工程计算;公共领域软件;软件维护;软件质量;源代码（软件）;

相似文献

外文文献
中文文献
专利

1. Protection on Wireless Sensor Network from Clone Attack using the SDN-Enabled Hybrid Clone Node Detection Mechanisms [J] . Computer Communications . 2020,第Feba期

机译：使用支持SDN的混合克隆节点检测机制保护无线传感器网络免受克隆攻击
2. Re: Blaha J et al.: The monoclonal anti-CD157 antibody clone SY11B5, used for high sensitivity detection of PNH clones on WBCs, fails to detect a common polymorphic variant encoded by BST-1 [J] . Sutherland D. Robert, Musani Rumina Cytometry, Part B. Clinical cytometry: the journal of the International Society for Analytical Cytology . 2019,第1期

机译：Re：Blaha J等人：用于高灵敏度检测的单克隆抗CD157抗体克隆Sy11b5在WBC上的PNH克隆的高灵敏度检测，不能检测由BST-1编码的常见多态性变体
3. The Monoclonal Anti-CD157 Antibody Clone SY11B5, Used for High Sensitivity Detection of PNH Clones on WBCs, Fails to Detect a Common Polymorphic Variant Encoded by BST-1 [J] . Blaha Johanna, Schwarz Klaus, Fischer Claudia, Cytometry, Part B. Clinical cytometry: the journal of the International Society for Analytical Cytology . 2018,第4期

机译：用于高灵敏度检测的单克隆抗CD157抗体克隆SY11B5在WBC上的PNH克隆的高灵敏度检测，不能检测BST-1编码的常见多态性变体
4. Clone Detection on Large Scala Codebases [C] . Wahidur Rahman, Yisen Xu, Fan Pu, International Workshop on Software Clones . 2020

机译：大型Scala CodeBases的克隆检测
5. Efficient Detection of Mutated Clones Using Augmented Scope Trees [D] . Mohammed, Mubarek. 2019

机译：使用增强范围树有效检测突变克隆
6. First Detection of Human ST131-CTX-M-15-O25-B2 Clone and High-Risk Clonal Lineages of ESBL/pAmpC-Producing E. coli Isolates from Diarrheic Poultry in Tunisia [O] . Ahlem Jouini, Amira Klibi, Imen Elarbi, 2021

机译：首先检测来自突尼斯的腹泻家禽的ESBL / PAMPC的E.Coli分离物的人ST131-CTX-M-15-O25-B2克隆和高风险克隆谱系
7. Enhanced Pre-processing and Parameterization Process of Generic Code Clone Detection Model for Clones in Java Applications [O] . Nur Nadzirah Mokhtar, Al-Fahim Mubarak-Ali, Mohd Azwan 2020

机译：增强Java应用程序中克隆通用代码克隆检测模型的预处理和参数化过程
8. Cloning of the Major Protein of the Caulobacter Crescentus Periodic Surface Layer: Detection and Characterization of the Cloned Peptide by Protein Expression Assays [R] . Smit, J., Agabian, N. 1984

机译：新月形杆菌周期表面层主要蛋白质的克隆：蛋白质表达分析检测克隆肽的表达

Clone Detection on Large Scala Codebases

摘要

著录项

相似文献

相关主题

期刊订阅