首页> 外国专利> BUILDING MULTI-REPRESENTATIONAL LEARNING MODELS FOR STATIC ANALYSIS OF SOURCE CODE

BUILDING MULTI-REPRESENTATIONAL LEARNING MODELS FOR STATIC ANALYSIS OF SOURCE CODE

机译：构建多代表学习模型，用于源代码的静态分析

页面导航

摘要
著录项
相似文献

摘要

Techniques for building multi-representational learning models for static analysis of source code are disclosed. In some embodiments, a system/process/computer program product for building multi-representational learning models for static analysis of source code includes receiving training data, wherein the training data includes a set of source code files for training a multi-representational learning (MRL) model for classifying malicious source code and benign source code based on a static analysis; generating a first feature vector based on a set of characters extracted from the set of source code files; generating a second feature vector based on a set of tokens extracted from the set of source code files; and performing an ensemble of the first feature vector and the second feature vector to form a target feature vector for classifying malicious source code and benign source code based on the static analysis.

机译：公开了用于构建用于源代码静态分析的多代表学习模型的技术。在一些实施例中，用于构建用于源代码静态分析的多代表学习模型的系统/过程/计算机程序产品包括接收训练数据，其中训练数据包括用于训练多代表学习的一组源代码文件（MRL ）基于静态分析对恶意源代码和良性源代码进行分类的模型;基于从源代码文件集中提取的一组字符生成第一特征向量;基于从源代码文件集中提取的一组令牌生成第二特征向量;并执行第一特征向量和第二特征向量的集合，以形成目标特征向量，用于基于静态分析对恶意源代码和良性源代码进行分类。

著录项

公开/公告号US2021240826A1

专利类型
公开/公告日2021-08-05

原文格式PDF
申请/专利权人 PALO ALTO NETWORKS INC.;
展开▼

申请/专利号US202016779271
发明设计人 BRODY JAMES KUTT;WILLIAM REDINGTON HEWLETT II;OLEKSII STAROV;YUCHEN ZHOU;FANG LIU;
展开▼

申请日2020-01-31
分类号G06F21/56;G06N20;G06K9/62;G06F8/75;G06F8/41;
国家 US
入库时间 2022-08-24 20:20:34

相似文献

专利
外文文献
中文文献