A feature location approach for mapping application features extracted from crowd-based screencasts to source code

Parisa Moslehi; Bram Adams; Juergen Rilling

首页> 外文期刊>Empirical Software Engineering >A feature location approach for mapping application features extracted from crowd-based screencasts to source code

【24h】

A feature location approach for mapping application features extracted from crowd-based screencasts to source code

机译：用于映射从基于人群的扫描到源代码中提取的应用程序功能的特征位置方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Crowd-based multimedia documents such as screencasts have emerged as a source for documenting requirements,the workflow and implementation issues of open source and agile software projects. For example,users can show and narrate how they manipulate an application's GUI to perform a certain functionality,or a bug reporter could visually explain how to trigger a bug or a security vulnerability. Unfortunately,the streaming nature of programming screencasts and their binary format limit how developers can interact with a screencast's content. In this research,we present an automated approach for mining and linking the multimedia content found in screencasts to their relevant software artifacts and,more specifically,to source code. We apply LDA-based mining approaches that take as input a set of screencast artifacts,such as GUI text and spoken word,to make the screencast content accessible and searchable to users and to link it to their relevant source code artifacts. To evaluate the applicability of our approach,we report on results from case studies that we conducted on existing WordPress and Mozilla Firefox screencasts. We found that our automated approach can significantly speed up the feature location process. For WordPress,we find that our approach using screencast speech and GUI text can successfully link relevant source code files within the top 10 hits of the result set with median Reciprocal Rank (RR) of 50% (rank 2) and 100% (rank 1). In the case of Firefox,our approach can identify relevant source code directories within the top 100 hits using screencast speech and GUI text with the median RR = 20%,meaning that the first true positive is ranked 5 or higher in more than 50% of the cases. Also,source code related to the frontend implementation that handles high-level or GUI-related aspects of an application is located with higher accuracy. We also found that term frequency rebalancing can further improve the linking results when using less noisy scenarios or locating less technical implementation of scenarios. Investigating the results of using original and weighted screencast data sources (speech,GUI,speech and GUI) that can result in having the highest median RR values in both case studies shows that speech data is an important information source that can result in having RR of 100%.

机译：基于人群的多媒体文档（如屏幕Ccast）被出现为文档要求的来源，开源和敏捷软件项目的工作流程和实施问题。例如，用户可以显示和叙述它们如何操纵应用程序的GUI来执行某个功能，或者错误记者可以在视觉上解释如何触发错误或安全漏洞。不幸的是，编程截图的流性质及其二进制格式限制了开发人员如何与截图的内容进行交互。在这项研究中，我们提出了一种自动化方法，用于挖掘和将在截图中发现的多媒体内容连接到其相关软件工件，更具体地，源代码。我们将基于LDA的挖掘方法应用于输入一组跳过文物，例如GUI文本和口语单词，以使截图内容可访问并可搜索用户并将其链接到其相关源代码工件。为了评估我们的方法的适用性，我们报告了我们在现有WordPress和Mozilla Firefox Scastcasts上进行的案例研究的结果。我们发现，我们的自动化方法可以显着加速特征位置过程。对于WordPress，我们发现我们使用屏幕广播语音和GUI文本的方法可以成功链接在结果集的前10个命中的相关源代码文件，其中中值互惠级（RR）为50％（秩2）和100％（等级1 ）。在Firefox的情况下，我们的方法可以识别前100个命中的相关源代码目录，使用截图语音和GUI文本，中位数RR = 20％，这意味着第一个真正的阳性在50％以上排名5或更高案件。此外，与处理应用程序的高级或GUI相关方面的前端实现相关的源代码具有更高的精度。我们还发现，当使用较少的噪声场景或定位方案的技术实现较少时，术语频率重新平衡可以进一步改善链接结果。调查使用原始和加权截图数据源（语音，GUI，语音和GUI）的结果，这两种情况下都可以导致具有最高中位数的RR值表明，语音数据是可能导致具有RR的重要信息源100％。

著录项

来源
《Empirical Software Engineering》 |2020年第6期|4873-4926|共54页
作者
Parisa Moslehi; Bram Adams; Juergen Rilling;
展开▼
作者单位

Concordia Universitys Montreal QC Canada;

Queen's University Kingston ON Canada;

Concordia Universitys Montreal QC Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Crowd-based documentation; Mining video content; Speech analysis; Feature location; Software traceability; Information extraction; Software documentation;

机译：基于人群的文件;采矿视频内容;言语分析;特征位置;软件可追溯性;信息提取;软件文档;

相似文献

外文文献
中文文献
专利

1. Semi-Automatically Extracting Features from Source Code of Android Applications [J] . Tetsuya KANDA, Yuki MANABE, Takashi ISHIO, IEICE transactions on information and systems . 2013,第12期

机译：从Android应用程序的源代码中半自动提取功能
2. Semi-Automatically Extracting Features from Source Code of Android Applications [J] . Tetsuya KANDA, Yuki MANABE, Takashi ISHIO, IEICE Transactions on Information and Systems . 2013,第12期

机译：从Android应用程序的源代码中半自动提取功能
3. COMPARISON OF PARTICLE-TRACKING FEATURES IN GEANT4 AND MCNPX CODES FOR APPLICATIONS IN MAPPING OF PROTON RANGE UNCERTAINTY [J] . BRYAN BEDNARZ, GTY CHEN, HARALD PAGANETTI, Nuclear Technology . 2011,第1期

机译：GEANT4和MCNPX编码中粒子追踪特征在质子范围不确定性制图中的比较
4. Feature Location Using Crowd-Based Screencasts [C] . Parisa Moslehi, Bram Adams, Juergen Rilling IEEE/ACM International Conference on Mining Software Repositories . 2018

机译：使用基于人群的截屏视频进行功能定位
5. Combining information retrieval modules and structural information for source code bug localization and feature location. [D] . Shao, Peng. 2011

机译：结合信息检索模块和结构信息，以进行源代码错误定位和功能定位。
6. Three-Dimensional Convolutional Autoencoder Extracts Features of Structural Brain Images With a Diagnostic Label-Free Approach: Application to Schizophrenia Datasets [O] . Hiroyuki Yamaguchi, Yuki Hashimoto, Genichi Sugihara, 2021

机译：三维卷积AutoEncoder用无标签无标签方法提取结构脑图像的特征：应用于精神分裂症数据集
7. Three-dimensional convolutional autoencoder extracts features of structural brain images with a “diagnostic label-free” approach: Application to schizophrenia datasets [O] . Hiroyuki Yamaguchi, Yuki Hashimoto, Genichi Sugihara, 2020

机译：三维卷积AutoEncoder用“无标签无标签”方法提取结构脑图像的特征：应用于精神分裂症数据集

A feature location approach for mapping application features extracted from crowd-based screencasts to source code

摘要

著录项

相似文献

相关主题

期刊订阅