首页> 外文会议>International Conference on Enterprise Information Systems >Evaluating a LSTM Neural Network and a Word2vec Model in the Classification of Self-admitted Technical Debts and Their Types in Code Comments
【24h】

Evaluating a LSTM Neural Network and a Word2vec Model in the Classification of Self-admitted Technical Debts and Their Types in Code Comments

机译:评估LSTM神经网络和单词2VEC模型在自录技术债务的分类中,代码评论中的类型

获取原文

摘要

Context: Software development teams constantly opt for faster, lower quality solutions to solve current problems without planning for the future. This situation will have a negative long-term impact and is called technical debt. Similar to a financial debt, technical debts require interest payments and must be managed and detected so that the team can evaluate the best way to deal with them. One way to detect technical debts is through classification of source code comments. Developers often insert comments warning of the need to improve their own code in the future. This is known as Self-Admitted Technical Debt (SATD). Objective: Combine Word2vec for word embedding with a Long short-term memory (LSTM) neural network model to identify SATDs from comments in source code and compare with other studies and LSTM without word embedding. Method: We plan and execute an experimental process with model's effectiveness data validation. Results: In general, the classification improves when all SATD types were grouped in a single label. In relation to other studies, the LSTM model with Word2vec achieved better recall and f-measure. The LSTM model without word embedding achieves greater recall, but perform worse in precision and f-measure. Conclusion: We found evidence that LSTM models combined with word embedding are promising for the development a more effective SATD classifier.
机译:背景信息:软件开发团队不断选择更快,更低质量的解决方案,以解决当前问题而不规划未来。这种情况将产生负面的长期影响,称为技术债务。类似于财务债务,技术债务需要利息支付,必须管理和检测,以便该团队可以评估处理它们的最佳方式。检测技术债务的一种方法是通过源代码评论的分类。开发人员经常插入评论警告,需要在将来提高自己的代码。这被称为自我录取的技术债务(SATD)。目的:将Word2VEC与长短期内存(LSTM)神经网络模型相结合,以识别SATDS从源代码中的评论,并与其他研究和LSTM进行比较,而没有嵌入的话。方法:我们计划并执行模型有效性数据验证的实验过程。结果:一般来说,当所有SATD类型都在单个标签中分组时,分类可提高。关于其他研究,具有Word2VEC的LSTM模型更好地召回和F测量。没有单词嵌入的LSTM模型达到了更大的召回,但在精度和F测量中执行​​更糟。结论:我们发现证据表明,LSTM模型与Word Embeddated相结合的是开发更有效的SATD分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号