首页> 外文会议>IEEE International Conference on Data Mining >Apk2vec: Semi-Supervised Multi-view Representation Learning for Profiling Android Applications
【24h】

Apk2vec: Semi-Supervised Multi-view Representation Learning for Profiling Android Applications

机译:APK2VEC:半监控的多视图表示学习,用于分析Android应用程序

获取原文

摘要

Building behavior profiles of Android applications (apps) with holistic, rich and multi-view information (e.g., incorporating several semantic views of an app such as API sequences, system calls, etc.) would help catering downstream analytics tasks such as app categorization, recommendation and malware analysis significantly better. Towards this goal, we design a semisupervised Representation Learning (RL) framework named apk2vec to automatically generate a compact representation (aka profile/embedding) for a given app. More specifically, apk2vec has the three following unique characteristics which make it an excellent choice for large-scale app profiling: (1) it encompasses information from multiple semantic views such as API sequences, permissions, etc., (2) being a semi-supervised embedding technique, it can make use of labels associated with apps (e.g., malware family or app category labels) to build high quality app profiles, and (3) it combines RL and feature hashing which allows it to efficiently build profiles of apps that stream over time (i.e., online learning). The resulting semi-supervised multi-view hash embeddings of apps could then be used for a wide variety of downstream tasks such as the ones mentioned above. Our extensive evaluations with more than 42,000 apps demonstrate that apk2vec's app profiles could significantly outperform state-of-the-art techniques in four app analytics tasks namely, malware detection, familial clustering, app clone detection and app recommendation.
机译:构建Android应用程序(应用程序)与全面,丰富和多视图信息行为档案(例如,包含一个应用程序的若干意见语义如API序列,系统调用等),将有助于餐饮下游分析任务,如应用程序分类,推荐和恶意软件分析显著更好。为了实现这一目标,我们设计了一个名为apk2vec,自动将半监督学习代表(RL)框架生成一层致密的表示(又名资料/嵌入)对于一个给定的应用程序。更具体地,具有apk2vec下列三个独特的特征,这使得它对于大规模应用分析的最佳选择:(1)它包括从诸如API序列,权限等多个语义图(2)是一个半的信息,监督的嵌入技术,它可以利用与应用程序(例如,恶意软件家族或应用类别标签)相关联的标签来建立高品质的应用程序简档,和(3)它结合RL和特征散列这使得它能够有效地构建的应用程序的个人资料,流过时间(即,在线学习)。应用程序所得到的半监督多视图散列的嵌入然后可以用于各种各样的下游任务如那些上面提到的。我们与42,000多家应用广泛的评估表明,apk2vec的应用程序配置文件能够在国家的最先进的显著跑赢大盘技术四种应用分析的任务,即恶意软件检测,家族聚集性,应用克隆检测和应用建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号