...
首页> 外文期刊>Journal of Big Data >Exploring the applicability of low-shot learning in mining software repositories
【24h】

Exploring the applicability of low-shot learning in mining software repositories

机译:探索低速学习在采矿软件存储库中的适用性

获取原文

摘要

Abstract BackgroundDespite the well-documented and numerous recent successes of deep learning, the application of standard deep architectures to many classification problems within empirical software engineering remains problematic due to the large volumes of labeled data required for training. Here we make the argument that, for some problems, this hurdle can be overcome by taking advantage of low-shot learning in combination with simpler deep architectures that reduce the total number of parameters that need to be learned.FindingsWe apply low-shot learning to the task of classifying UML class and sequence diagrams from Github, and demonstrate that surprisingly good performance can be achieved by using only tens or hundreds of examples for each category when paired with an appropriate architecture. Using a large, off-the-shelf architecture, on the other hand, doesn’t perform beyond random guessing even when trained on thousands of samples.ConclusionOur findings suggest that identifying problems within empirical software engineering that lend themselves to low-shot learning could accelerate the adoption of deep learning algorithms within the empirical software engineering community.
机译:抽象背景尽管深度学习已有大量文献证明,但最近取得了许多成功,但由于训练所需的大量标记数据,标准深度架构在经验软件工程中的许多分类问题上的应用仍然存在问题。在这里,我们提出一个论点,即对于某些问题,可以通过将低速学习与更简单的深度架构相结合来克服此障碍,这些更深的架构减少了需要学习的参数总数。 Github对UML类和序列图进行分类的任务,并证明了当与适当的体系结构配对时,每个类别仅使用数十个或数百个示例就可以实现令人惊讶的良好性能。另一方面,即使使用成千上万的样本进行训练,使用大型现成的体系结构也无法进行随机猜测。结论我们的发现表明,确定经验性软件工程中的问题有助于进行低调学习。加速在经验软件工程界内部采用深度学习算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号