首页> 外文会议>International Conference on Program Comprehension >An Empirical Exploration of Regularities in Open-Source Software Lexicons

【24h】

An Empirical Exploration of Regularities in Open-Source Software Lexicons

机译：开源软件词典中规律的实证探索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The software lexicon is an important source of information during program comprehension activities and it has been in the focus of several recent case studies. Identifiers and comments, which constitute a lexicon in software, encode domain concepts and design decisions made by programmers. The paper presents an exploratory study that investigates regularities in the software lexicons of open-source projects by analyzing distributions of tokens in diverse software artifacts. The study examined source code of 142 systems from different domains, written in 12 different programming languages, as well as bug reports and external documentation. We discover that distributions of lexical tokens in studied artifacts follow the Zipf-Mandelbrot law, which is an empirical law in statistical natural language processing. Furthermore, the study reveals that the Zipf-Mandelbrot law is not confined to program lexicons in object-oriented languages, as shown in the previous studies, but also emerges in source code written using procedural, functional and markup languages, as well as other software artifacts. Our study also indicates that a previously devised software science equation does not hold for describing the program vocabulary-length relationship and more studies are necessary.

机译：软件莱克西森是计划理解活动期间的重要信息来源，它一直是几个案例研究的重点。构成软件，编码域概念和程序员的设计决策中的lexicon的标识符和评论。本文提出了一个探索性研究，通过分析各种软件工件中的令牌分布，调查开源项目软件词汇的规律。该研究检查了来自不同域的142个系统的源代码，以12种不同的编程语言编写，以及错误报告和外部文档。我们发现研究人员的词法令牌的分布遵循Zipf-Mandelbrot法，这是统计自然语言处理的实证法。此外，该研究表明，ZIPF-MENDELBROT LAVE不仅限于面向对象语言的词典，如前面的研究中所示，而且还在使用过程，功能和标记语言以及其他软件编写的源代码中出现文物。我们的研究还表明，先前设计的软件科学方程没有持有描述程序词汇长度关系，并且需要更多的研究。

著录项

来源
《International Conference on Program Comprehension 》|2009年||共5页
会议地点
作者
Derrin Pierret; Denys Poshyvanyk;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.5-53;
关键词

相似文献

外文文献
中文文献
专利

1. Towards a better understanding of software evolution: an empirical study on open-source software [J] . Iulian Neamtiu, Guowu Xie, Jianbo Chen Journal of software: evolution and process . 2013 ,第3期

机译：更好地理解软件演化：关于开源软件的实证研究
2. An open-source software ecosystem for the interactive exploration of ultrafast electron scattering data [J] . Bradley?J.?Siwick, Mark?J.?Stern, Martin?R.?Otto, Advanced Structural and Chemical Imaging . 2018 ,第1期

机译：开源软件生态系统，用于交互式探索超快电子散射数据
3. Using Open-Source Software Defined Radio Platforms for Empirical Characterization of Man-Made Impulsive Noise [J] . Otilia Popescu, John Musson, Dimitrie C. Popescu Electromagnetic Compatibility Magazine, IEEE . 2020 ,第4期

机译：使用开源软件定义的无线电平台进行人造脉冲噪声的经验表征
4. An Empirical Exploration of Regularities in Open-Source Software Lexicons [C] . Derrin Pierret, Denys Poshyvanyk International Conference on Program Comprehension . 2009

机译：开源软件词典中规律的实证探索
5. A requirements-based exploration of open-source software development projects -- Towards a natural language processing software analysis framework. [D] . Vlas, Radu Eduard. 2012

机译：基于需求的开源软件开发项目探索-走向自然语言处理软件分析框架。
6. An open-source software ecosystem for the interactive exploration of ultrafast electron scattering data [O] . Laurent P. René de Cotret, Martin R. Otto, Mark J. Stern, -1

机译：开源软件生态系统用于交互式探索超快电子散射数据
7. An Empirical Exploration of Regularities in Open-Source Software Lexicons [O] . Derrin Pierret, Denys Poshyvanyk 2009

机译：开源软件词典规律性的实证研究

An Empirical Exploration of Regularities in Open-Source Software Lexicons

摘要

著录项

相似文献

相关主题

期刊订阅