首页> 外文会议>International Conference on Software Maintenance and Evolution >Is this code written in English? A study of the natural language of comments and identifiers in practice
【24h】

Is this code written in English? A study of the natural language of comments and identifiers in practice

机译:这段代码用英文写作吗?关于实践中评论和标识中的自然语言的研究

获取原文

摘要

Comments and identifiers are the main source of documentation of source-code and are therefore an integral part of the development and the maintenance of a program. As English is the world language, most comments and identifiers are written in English. However, if they are in any other language, a developer without knowledge of this language will almost perceive the code to be undocumented or even obfuscated. In absence of industrial data, academia is not aware of the extent of the problem of non-English comments and identifiers in practice. In this paper, we propose an approach for the language identification of source-code comments and identifiers. With the approach, a large-scale study has been conducted of the natural language of source-code comments and identifiers, analyzing multiple open-source and industry systems. The results show that a significant amount of the industry projects contain comments and identifiers in more than one language, whereas none of the analyzed open-source systems has this problem.
机译:评论和标识符是源代码的文件的主要来源,因此是开发的一个组成部分和程序的维护。由于英语是世界语言,大多数意见和标识符都用英文编写。但是,如果他们是任何其他语言,那么一个没有知识的开发人员几乎会认为未记录或甚至混淆的代码。在没有工业数据的情况下,学术界并不意识到在实践中非英语评论和标识符的问题。在本文中,我们提出了一种方法来识别源代码评论和标识符的语言识别。通过这种方法,已经进行了大规模的研究,源代码评论和标识符的自然语言,分析了多个开源和行业系统。结果表明,大量的行业项目包含一种以上一种语言的评论和标识符,而没有分析的开源系统则具有此问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号