首页> 外国专利> BUILDING MULTI-REPRESENTATIONAL LEARNING MODELS FOR STATIC ANALYSIS OF SOURCE CODE

BUILDING MULTI-REPRESENTATIONAL LEARNING MODELS FOR STATIC ANALYSIS OF SOURCE CODE

机译:构建多代表学习模型,用于源代码的静态分析

摘要

Techniques for building multi-representational learning models for static analysis of source code are disclosed. In some embodiments, a system/process/computer program product for building multi-representational learning models for static analysis of source code includes receiving training data, wherein the training data includes a set of source code files for training a multi-representational learning (MRL) model for classifying malicious source code and benign source code based on a static analysis; generating a first feature vector based on a set of characters extracted from the set of source code files; generating a second feature vector based on a set of tokens extracted from the set of source code files; and performing an ensemble of the first feature vector and the second feature vector to form a target feature vector for classifying malicious source code and benign source code based on the static analysis.
机译:公开了用于构建用于源代码静态分析的多代表学习模型的技术。在一些实施例中,用于构建用于源代码静态分析的多代表学习模型的系统/过程/计算机程序产品包括接收训练数据,其中训练数据包括用于训练多代表学习的一组源代码文件(MRL )基于静态分析对恶意源代码和良性源代码进行分类的模型;基于从源代码文件集中提取的一组字符生成第一特征向量;基于从源代码文件集中提取的一组令牌生成第二特征向量;并执行第一特征向量和第二特征向量的集合,以形成目标特征向量,用于基于静态分析对恶意源代码和良性源代码进行分类。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号