Why My Code Summarization Model Does Not Work: Code Comment Improvement with Category Prediction

QIUYUAN CHEN; XIN XIA; HAN HU; DAVID LO; SHANPING LI

首页> 外文期刊>ACM transactions on software engineering and methodology >Why My Code Summarization Model Does Not Work: Code Comment Improvement with Category Prediction

【24h】

Why My Code Summarization Model Does Not Work: Code Comment Improvement with Category Prediction

机译：为什么我的代码摘要模型不起作用：代码评论改进与类别预测

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Code summarization aims at generating a code comment given a block of source code and it is normally performed by training machine learning algorithms on existing code block-comment pairs. Code comments in practice have different intentions. For example, some code comments might explain how the methods work, while others explain why some methods are written. Previous works have shown that a relationship exists between a code block and the category of a comment associated with it. In this article, we aim to investigate to which extent we can exploit this relationship to improve code summarization performance. We first classify comments into six intention categories and manually label 20,000 code-comment pairs. These categories include "what," "why," "how-to-use," "how-it-is-done," "property, "and "others. "Based on this dataset, we conduct an experiment to investigate the performance of different state-of-the-art code summarization approaches on the categories. We find that the performance of different code summarization approaches varies substantially across the categories. Moreover, the category for which a code summarization model performs the best is different for the different models. In particular, no models perform the best for "why" and "property" comments among the six categories. We design a composite approach to demonstrate that comment category prediction can boost code summarization to reach better results. The approach leverages classified code-category labeled data to train a classifier to infer categories. Then it selects the most suitable models for inferred categories and outputs the composite results. Our composite approach outperforms other approaches that do not consider comment categories and obtains a relative improvement of 8.57% and 16.34% in terms of ROUGE-L and BLEU-4 score, respectively.

机译：代码摘要旨在给定源代码块的代码评论，它通常是通过现有代码块评论对的培训机器学习算法来执行的。实践中的代码评论有不同的意图。例如，一些代码评论可能会解释方法的工作原理，而其他则解释为什么写入某些方法。以前的作品已经表明，代码块和与其关联的评论的类别之间存在关系。在本文中，我们的目标是调查我们如何利用这种关系来提高代码摘要性能。我们首先将评论分为六个意图类别，并手动标记20,000个代码评论对。这些类别包括“什么”，“为什么”，“如何使用，”“hof-in-to-do，”“属性”和“其他”。根据此数据集进行了一个实验来调查不同最先进的代码摘要的性能对类别的方法。我们发现不同代码摘要方法的性能大幅度变化。此外，对于不同模型，代码摘要模型执行最佳的类别是不同的。特别是，没有模型在六个类别中的“为什么”和“属性”评论中最佳。我们设计一种复合方法来证明评论类别预测可以提高代码摘要以达到更好的结果。该方法利用分类的代码类标记的数据来训练分类器到推断类别。然后它为推断类别选择最合适的模型，并输出复合结果。我们的综合方法优于不考虑评论类别的其他方法，并分别在Rouge-L和Bleu-4分别获得8.57％和16.34％的相对提高。

著录项

来源
《ACM transactions on software engineering and methodology》 |2021年第2期|25.1-25.29|共29页
作者
QIUYUAN CHEN; XIN XIA; HAN HU; DAVID LO; SHANPING LI;
展开▼
作者单位

College of Computer Science and Technology Zhejiang University;

Faculty of Information Technology Monash University Victoria Australia;

Faculty of Information Technology Monash University Victoria Australia;

School of Information Systems Singapore Management University Singapore;

College of Computer Science and Technology Zhejiang University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Code summarization; code comment; comment classification;

机译：代码摘要;代码评论;评论分类;

相似文献

外文文献
中文文献
专利

1. Video Summarization With Attention-Based Encoder–Decoder Networks [J] . Ji Zhong, Xiong Kailin, Pang Yanwei, IEEE Transactions on Circuits and Systems for Video Technology . 2020,第6期

机译：基于关注的编码器解码器网络的视频概述
2. Unified QSAR approach to antimicrobials. Part 3: First multi-tasking QSAR model for Input-Coded prediction, structural back-projection, and complex networks clustering of antiprotozoal compounds. [J] . Prado-Prado FJ, Gonzalez-Diaz H, de-la-Vega OM, Bioorganic and medicinal chemistry . 2008,第11期

机译：统一的QSAR抗菌方法。第3部分：第一个多任务QSAR模型，用于输入编码的预测，结构反投影和反原生动物化合物的复杂网络聚类。
3. SummCoder: An unsupervised framework for extractive text summarization based on deep auto-encoders [J] . Joshi Akanksha, Fidalgo E., Alegre E., Expert Systems with Application . 2019,第SEPa期

机译：SummCoder：基于深度自动编码器的用于抽取文本摘要的无监督框架
4. Improvement of the quality of medical databases: data-mining-based prediction of diagnostic codes from previous patient codes [C] . Mehdi DJENNAOUF, Gregojre FICHEUR, Regis BEUSCART, Medical Informatics in Europe Conference. . 2015

机译：提高医疗数据库质量：基于数据挖掘的诊断代码预测来自先前的患者代码
5. Emergence of dual coding mechanisms in a network model of the locust antennal lobe: Transient temporal binding vs. a high dimensional rate code. [D] . Patel, Mainak. 2009

机译：蝗虫触角叶网络模型中双重编码机制的出现：瞬时时间绑定与高维速率码。
6. Validity of diagnosis codes for identifying cutaneous squamous cell carcinoma in The Health Improvement Network [O] . Z.C. Chiesa Fuxench, A.B. Troxel, J.M. Gelfand -1

机译：健康改善网络中诊断皮肤鳞状细胞癌的诊断代码的有效性
7. A Bayesian regression framework for concrete creep prediction improvement: application to Eurocode 2 model [O] . Hikmat Daou, Wassim Raphael 2021

机译：混凝土蠕变预测改进的贝叶斯回归框架：在欧洲码的应用2模型

Why My Code Summarization Model Does Not Work: Code Comment Improvement with Category Prediction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅