首页> 外文期刊>Empirical Software Engineering >A feature location approach for mapping application features extracted from crowd-based screencasts to source code
【24h】

A feature location approach for mapping application features extracted from crowd-based screencasts to source code

机译:用于映射从基于人群的扫描到源代码中提取的应用程序功能的特征位置方法

获取原文
           

摘要

Crowd-based multimedia documents such as screencasts have emerged as a source for documenting requirements,the workflow and implementation issues of open source and agile software projects. For example,users can show and narrate how they manipulate an application's GUI to perform a certain functionality,or a bug reporter could visually explain how to trigger a bug or a security vulnerability. Unfortunately,the streaming nature of programming screencasts and their binary format limit how developers can interact with a screencast's content. In this research,we present an automated approach for mining and linking the multimedia content found in screencasts to their relevant software artifacts and,more specifically,to source code. We apply LDA-based mining approaches that take as input a set of screencast artifacts,such as GUI text and spoken word,to make the screencast content accessible and searchable to users and to link it to their relevant source code artifacts. To evaluate the applicability of our approach,we report on results from case studies that we conducted on existing WordPress and Mozilla Firefox screencasts. We found that our automated approach can significantly speed up the feature location process. For WordPress,we find that our approach using screencast speech and GUI text can successfully link relevant source code files within the top 10 hits of the result set with median Reciprocal Rank (RR) of 50% (rank 2) and 100% (rank 1). In the case of Firefox,our approach can identify relevant source code directories within the top 100 hits using screencast speech and GUI text with the median RR = 20%,meaning that the first true positive is ranked 5 or higher in more than 50% of the cases. Also,source code related to the frontend implementation that handles high-level or GUI-related aspects of an application is located with higher accuracy. We also found that term frequency rebalancing can further improve the linking results when using less noisy scenarios or locating less technical implementation of scenarios. Investigating the results of using original and weighted screencast data sources (speech,GUI,speech and GUI) that can result in having the highest median RR values in both case studies shows that speech data is an important information source that can result in having RR of 100%.
机译:基于人群的多媒体文档(如屏幕Ccast)被出现为文档要求的来源,开源和敏捷软件项目的工作流程和实施问题。例如,用户可以显示和叙述它们如何操纵应用程序的GUI来执行某个功能,或者错误记者可以在视觉上解释如何触发错误或安全漏洞。不幸的是,编程截图的流性质及其二进制格式限制了开发人员如何与截图的内容进行交互。在这项研究中,我们提出了一种自动化方法,用于挖掘和将在截图中发现的多媒体内容连接到其相关软件工件,更具体地,源代码。我们将基于LDA的挖掘方法应用于输入一组跳过文物,例如GUI文本和口语单词,以使截图内容可访问并可搜索用户并将其链接到其相关源代码工件。为了评估我们的方法的适用性,我们报告了我们在现有WordPress和Mozilla Firefox Scastcasts上进行的案例研究的结果。我们发现,我们的自动化方法可以显着加速特征位置过程。对于WordPress,我们发现我们使用屏幕广播语音和GUI文本的方法可以成功链接在结果集的前10个命中的相关源代码文件,其中中值互惠级(RR)为50%(秩2)和100%(等级1 )。在Firefox的情况下,我们的方法可以识别前100个命中的相关源代码目录,使用截图语音和GUI文本,中位数RR = 20%,这意味着第一个真正的阳性在50%以上排名5或更高案件。此外,与处理应用程序的高级或GUI相关方面的前端实现相关的源代码具有更高的精度。我们还发现,当使用较少的噪声场景或定位方案的技术实现较少时,术语频率重新平衡可以进一步改善链接结果。调查使用原始和加权截图数据源(语音,GUI,语音和GUI)的结果,这两种情况下都可以导致具有最高中位数的RR值表明,语音数据是可能导致具有RR的重要信息源100%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号