首页> 外文会议>IEEE Aerospace Conference >Discovering Relationships Among Software Artifacts
【24h】

Discovering Relationships Among Software Artifacts

机译:发现软件工件之间的关系

获取原文

摘要

Software systems have become ubiquitous in today's world. Most software will evolve after initial deployment. Software changes that are a part of that evolution often are documented in a requirements change document. One of the challenges when changing software is understanding the portions of the existing requirements and the existing code that could be affected by the change in order to avoid or minimize unexpected side effects from the changes. Researchers have addressed the problem of minimizing the effect of changes by using different methods, including text mining and clustering. Some approaches to determine change impact are based on information retrieval (IR) techniques using both term frequency-inverse document frequency (TF—IDF) and latent semantic indexing (LSI) methods. Other approaches are based on visualization techniques using degree and betweenness centrality measures. In this research, we approach the problem by applying IR techniques along with data mining. We apply TF-IDF and LSI to investigate which changes have a high potential of modifying existing requirements. We also analyze similarities between changes that do not map to existing requirements. In both cases, our threshold for identifying similarity is 80%. We designed our approach to identify, for a given change, one or more requirements that have a high potential of being associated with the change as well as identifying intra-document requirements or changes that have a high potential for consolidation. We were able to identify requirements that had a similarity of at least 80% to a change request using TF-IDF and LSI. We were also able to isolate changes that did not show a high similarity to any requirement, thus indicating that the change request was likely a request for a new requirement. The results are encouraging for assessing the impact of software change requests on requirements of an existing system.
机译:在当今世界,软件系统已经无处不在。最初部署后,大多数软件都会发展。属于这种演变的一部分的软件更改通常记录在需求更改文档中。更改软件时的挑战之一是了解可能受更改影响的现有需求部分和现有代码,以避免或最大程度地减少更改带来的意外副作用。研究人员已经解决了通过使用不同的方法(包括文本挖掘和聚类)将更改的影响最小化的问题。确定更改影响的一些方法是基于信息检索(IR)技术的,该技术同时使用术语频率反文档频率(TF-IDF)和潜在语义索引(LSI)方法。其他方法基于使用程度和中间性中心度度量的可视化技术。在这项研究中,我们通过将IR技术与数据挖掘一起应用来解决该问题。我们使用TF-IDF和LSI来研究哪些更改具有修改现有需求的巨大潜力。我们还分析了未映射到现有需求的变更之间的相似性。在这两种情况下,我们识别相似性的门槛均为80%。我们设计了一种方法,以针对给定的变更识别一个或多个与变更相关联的潜在需求,以及识别文档内的需求或具有合并潜力的变更。我们能够使用TF-IDF和LSI识别与变更请求至少有80%相似性的需求。我们还能够隔离出与任何需求没有高度相似性的变更,从而表明变更请求很可能是对新需求的请求。该结果对于评估软件变更请求对现有系统需求的影响是令人鼓舞的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号