首页> 外文期刊>Pattern recognition letters >Source code author identification with unsupervised feature learning
【24h】

Source code author identification with unsupervised feature learning

机译:具有无监督功能学习的源代码作者识别

获取原文
获取原文并翻译 | 示例
       

摘要

Automatic identification of source code authors has many applications in different fields such as source code plagiarism detection, and law suit prosecution. This paper presents a new source code author identification system based on an unsupervised feature learning technique. As a method of extracting features from high dimensional data, unsupervised feature learning has obtained a great success in many fields such as character recognition and image classification. However, according to our knowledge it has not been applied for source code author identification systems. Therefore, we investigated an unsupervised feature learning technique called sparse auto-encoder as a method of extracting features from source code files. Our system was evaluated with several datasets and results have shown that performance is very close to the state of art techniques in the source code identification field.
机译:源代码作者的自动识别在源代码窃检测和诉讼起诉等不同领域中有许多应用。本文提出了一种基于无监督特征学习技术的新型源代码作者识别系统。作为一种从高维数据中提取特征的方法,无监督特征学习在字符识别和图像分类等许多领域都取得了巨大的成功。但是,据我们所知,它尚未应用于源代码作者识别系统。因此,我们研究了一种称为稀疏自动编码器的无监督特征学习技术,该技术是一种从源代码文件中提取特征的方法。我们的系统通过几个数据集进行了评估,结果表明,该性能与源代码识别领域的技术水平非常接近。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号