【24h】

Search in Source Code Based on Identifying Popular Fragments

机译:基于识别流行性片段搜索源代码

获取原文

摘要

When programmers write new code, they are often interested in finding definitions of functions, existing, working fragments with the same or similar functionality, and reusing as much of that code as possible. Short fragments that are often returned by search engines as results to user queries do not give enough information to help programmers determine how to reuse them. Understanding code and determining how to use it, is a manual and time-consuming process. In general, programmers want to find initial points such as relevant functions. They want to easily understand how the functions are used and see the sequence of function invocations in order to understand how concepts are implemented. Our main goal is to enable programmers to find relevant functions to query terms and their usages. In our approach, identifying popular fragments is inspired by PageRank algorithm, where the "popularity" of a function is determined by how many functions call it. We designed a model based on the vector space model by which we are able to establish relevance among facts which content contains terms that match programmer's queries. The result is an ordered list of relevant functions that reflects the associations between concepts in the functions and a programmer's query.
机译:当程序员编写新代码时,它们通常有兴趣查找具有相同或相似功能的函数,现有的工作片段的定义,并尽可能多地重用该代码。搜索引擎通常返回的短片段作为用户查询的结果不提供足够的信息来帮助程序员确定如何重用它们。了解代码并确定如何使用它,是一种手动和耗时的过程。通常,程序员希望查找诸如相关功能之类的初始点。他们希望轻松理解如何使用功能并查看功能调用序列,以便了解如何实现概念。我们的主要目标是使程序员能够查询查询条款及其用途的相关功能。在我们的方法中,识别流行性片段由PageRank算法启发,其中函数的“流行度”是由函数调用的函数的决定。我们设计了一种基于矢量空间模型的模型,我们能够建立相关的相关性,其中内容包含符合程序员查询的术语。结果是有序列表的相关函数列表,其反映了函数中概念和程序员查询之间的关联。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号