首页> 外文期刊>Empirical Software Engineering >Code Localization in Programming Screencasts
【24h】

Code Localization in Programming Screencasts

机译:编程截屏中的代码本地化

获取原文
获取原文并翻译 | 示例
           

摘要

Programming screencasts are growing in popularity and are often used by developers as a learning source. The source code shown in these screencasts is often not available for download or copy-pasting. Without having the code readily available, developers have to frequently pause a video to transcribe the code. This is time-consuming and reduces the effectiveness of learning from videos. Recent approaches have applied Optical Character Recognition (OCR) techniques to automatically extract source code from programming screencasts. One of their major limitations, however, is the extraction of noise such as the text information in the menu, package hierarchy, etc. due to the imprecise approximation of the code location on the screen. This leads to incorrect, unusable code. We aim to address this limitation and propose an approach to significantly improve the accuracy of code localization in programming screencasts, leading to a more precise code extraction. Our approach uses a Convolutional Neural Network to automatically predict the exact location of code in an image. We evaluated our approach on a set of frames extracted from 450 screencasts covering Java, C#, and Python programming topics. The results show that our approach is able to detect the area containing the code with 94% accuracy and that our approach significantly outperforms previous work. We also show that applying OCR on the code area identified by our approach leads to a 97% match with the ground truth on average, compared to only 31% when OCR is applied to the entire frame.
机译:编程截屏节目越来越受欢迎,并且经常被开发人员用作学习资源。这些截屏视频中显示的源代码通常无法下载或粘贴。在没有随时可用的代码的情况下,开发人员必须经常暂停视频以转录代码。这很耗时,并且降低了从视频中学习的效率。最近的方法已经应用了光学字符识别(OCR)技术来自动从编程屏幕广播中提取源代码。然而,它们的主要限制之一是由于屏幕上代码位置的不精确近似,导致噪声的提取,例如菜单中的文本信息,程序包层次结构等。这导致不正确,无法使用的代码。我们旨在解决这一局限性,并提出一种方法来显着提高编程屏幕广播中代码本地化的准确性,从而实现更精确的代码提取。我们的方法使用卷积神经网络来自动预测图像中代码的确切位置。我们从从450个涵盖Java,C#和Python编程主题的屏幕录像中提取的一组帧中评估了我们的方法。结果表明,我们的方法能够以94%的精度检测包含代码的区域,并且我们的方法明显优于以前的工作。我们还表明,在通过我们的方法确定的代码区域上应用OCR可以平均与地面实况进行97%的匹配,而将OCR应用于整个帧时则仅为31%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号