Normalizing source code vocabulary to support program comprehension and software quality

机译：标准化源代码词汇以支持程序理解和软件质量

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The literature reports that source code lexicon plays a paramount role in program comprehension, especially when software documentation is scarce, outdated or simply not available. In source code, a significant proportion of vocabulary can be either acronyms and-or abbreviations or concatenation of terms that can not be identified using consistent mechanisms such as naming conventions. It is, therefore, essential to disambiguate concepts conveyed by identifiers to support program comprehension and reap the full benefit of Information Retrieval-based techniques (e.g., feature location and traceability) whose linguistic information (i.e., source code identifiers and comments) used across all software artifacts (e.g., requirements, design, change requests, tests, and source code) must be consistent. To this aim, we propose source code vocabulary normalization approaches that exploit contextual information to align the vocabulary found in the source code with that found in other software artifacts. We were inspired in the choice of context levels by prior works and by our findings. Normalization consists of two tasks: splitting and expansion of source code identifiers. We also investigate the effect of source code vocabulary normalization approaches on software maintenance tasks. Results of our evaluation show that our contextual-aware techniques are accurate and efficient in terms of computation time than state of the art alternatives. In addition, our findings reveal that feature location techniques can benefit from vocabulary normalization when no dynamic information is available.

机译：文献报道源代码词典在程序理解中起着至关重要的作用，尤其是在软件文档稀少，过时或根本不可用的情况下。在源代码中，词汇的很大一部分可能是缩写词和/或缩写，或者是使用诸如命名约定之类的一致机制无法识别的术语的串联。因此，有必要对标识符传达的概念进行歧义处理以支持程序理解，并充分利用基于信息检索的技术（例如，特征位置和可追溯性）的全部优势，这些技术的语言信息（即源代码标识符和注释）在所有软件工件（例如需求，设计，变更请求，测试和源代码）必须保持一致。为此，我们提出了源代码词汇规范化方法，该方法利用上下文信息将源代码中的词汇与其他软件工件中的词汇对齐。先前的工作和我们的发现启发了我们选择上下文级别的灵感。标准化包括两个任务：拆分和扩展源代码标识符。我们还研究了源代码词汇规范化方法对软件维护任务的影响。我们的评估结果表明，与最新技术相比，我们的上下文感知技术在计算时间方面更准确，更高效。此外，我们的发现表明，当没有动态信息可用时，特征定位技术可以从词汇规范化中受益。

著录项

来源
《International Conference on Software Engineering》|2013年|1385-1388|共4页
会议地点
作者
Guerrouj Latifa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Source code linguistic analysis; information retrieval; program comprehension; software quality;

机译：源代码语言分析;信息检索;程序理解;软件质量;

相似文献

外文文献
中文文献
专利

1. Source Code, Object Code, and The Da Vinci Code: The Debate on Intellectual Property Protection for Software Programs [J] . Murali Neelakantan, Alex Armstrong The computer & internet lawyer . 2006,第10期

机译：源代码，目标代码和达芬奇代码：关于软件程序知识产权保护的辩论
2. A coding scheme to support systematic analysis of software comprehension [J] . von Mayrhauser A., Lang S. IEEE Transactions on Software Engineering . 1999,第4期

机译：一种支持对软件理解进行系统分析的编码方案
3. Software Quality and Security in Teachers' and Students' Codes When Learning a New Programming Language [J] . Shlomi Boutnaru, Arnon Hershkovitz Interdisciplinary Journal of e-Skills and Lifelong Learning . 2015,第2期

机译：学习新的编程语言时，教师和学生代码中的软件质量和安全性
4. Normalizing Source Code Vocabulary to Support Program Comprehension and Software Quality [C] . Latifa Guerrouj International Conference on Software Engineering . 2013

机译：标准化源代码词汇表以支持程序理解和软件质量
5. Supporting source code comprehension during software evolution and maintenance [D] . Alhindawi, Nouh. 2013

机译：在软件开发和维护过程中支持源代码理解
6. Extracting medical knowledge for a coded problem list vocabulary from the UMLS Knowledge Sources. [O] . J. W. Hales, K. M. Schoeffler, D. P. Kessler 1998

机译：从UMLS知识源中提取医学知识用于编码的问题列表词汇。
7. Source Code Comprehension Strategies and Metrics to Predict Comprehension Effort in Software Maintenance and Evolution Tasks- An Empirical Study with Industry Practitioners [O] . Kazuki Nishizono, Shuji Morisaki, Rodrigo Vivanco, 2012

机译：源代码理解策略和指标预测软件维护和演化任务中的理解努力 - 与行业从业者的实证研究
8. Mining Program Source Code for Improving Software Quality. [R] . Xie, T. 2013

机译：提高软件质量的采矿程序源代码。

Normalizing source code vocabulary to support program comprehension and software quality

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅