One-Shot Neural Architecture Search: Maximising Diversity to Overcome Catastrophic Forgetting

Zhang Miao; Li Huiqi; Pan Shirui; Chang Xiaojun; Zhou Chuan; Ge Zongyuan; Su Steven

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >One-Shot Neural Architecture Search: Maximising Diversity to Overcome Catastrophic Forgetting

【24h】

One-Shot Neural Architecture Search: Maximising Diversity to Overcome Catastrophic Forgetting

机译：单次神经结构搜索：最大化多样性以克服灾难性遗忘

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

One-shot neural architecture search (NAS) has recently become mainstream in the NAS community because it significantly improves computational efficiency through weight sharing. However, the supernet training paradigm in one-shot NAS introduces catastrophic forgetting, where each step of the training can deteriorate the performance of other architectures that contain partially-shared weights with current architecture. To overcome this problem of catastrophic forgetting, we formulate supernet training for one-shot NAS as a constrained continual learning optimization problem such that learning the current architecture does not degrade the validation accuracy of previous architectures. The key to solving this constrained optimization problem is a novelty search based architecture selection (NSAS) loss function that regularizes the supernet training by using a greedy novelty search method to find the most representative subset. We applied the NSAS loss function to two one-shot NAS baselines and extensively tested them on both a common search space and a NAS benchmark dataset. We further derive three variants based on the NSAS loss function, the NSAS with depth constrain (NSAS-C) to improve the transferability, and NSAS-G and NSAS-LG to handle the situation with a limited number of constraints. The experiments on the common NAS search space demonstrate that NSAS and it variants improve the predictive ability of supernet training in one-shot NAS with remarkable and efficient performance on the CIFAR-10, CIFAR-100, and ImageNet datasets. The results with the NAS benchmark dataset also confirm the significant improvements these one-shot NAS baselines can make.

机译：单次神经结构搜索（NAS）最近成为NAS社区的主流，因为它通过重量分享显着提高了计算效率。然而，单次NAS中的超空网培训范例介绍了灾难性的遗忘，其中训练的每个步骤都可以恶化包含具有当前架构的部分共享权重的其他架构的性能。为了克服这种灾难性遗忘的问题，我们为单次NAS制定了单次NAS的超值培训，因为限制的连续学习优化问题，使得学习当前架构不会降低以前的体系结构的验证精度。解决该约束优化问题的关键是一种基于新颖的搜索的架构选择（NSAS）丢失功能，其通过使用贪婪的新颖性搜索方法来查找最代表性的子集进行正常培训。我们将NSAS损耗函数应用于两个单次NAS基线，并在公共搜索空间和NAS基准数据集中广泛测试它们。我们进一步基于NSAS损耗函数的三个变体，NSA具有深度约束（NSAS-C），以提高可转换性，NSAS-G和NSA-LG以有限数量的约束来处理情况。公共NAS搜索空间的实验表明，NSA和IT变体提高了在一拍NAS中超空网训练的预测能力，在CiFar-10，CiFar-100和Imagenet数据集中具有显着和高效的性能。使用NAS基准数据集的结果还确认了这些单次NAS基线可以制造的显着改进。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2021年第9期|2921-2935|共15页
作者
Zhang Miao; Li Huiqi; Pan Shirui; Chang Xiaojun; Zhou Chuan; Ge Zongyuan; Su Steven;
展开▼
作者单位

Beijing Inst Technol Sch Informat & Elect Beijing 100081 Peoples R China|Monash Univ Fac Informat Technol Clayton Vic 3800 Australia|Univ Technol Sydney Fac Engn & Informat Technol Ultimo NSW 2007 Australia;

Beijing Inst Technol Sch Informat & Elect Beijing 100081 Peoples R China;

Monash Univ Fac Informat Technol Clayton Vic 3800 Australia;

Monash Univ Fac Informat Technol Clayton Vic 3800 Australia|King Abdulaziz Univ Fac Comp & Informat Technol Jeddah 21589 Saudi Arabia;

Chinese Acad Sci Acad Math & Syst Sci Beijing 100081 Peoples R China;

Monash Univ Monash E Res Ctr Clayton Vic 3800 Australia;

Univ Technol Sydney Fac Engn & Informat Technol Ultimo NSW 2007 Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Computer architecture; Training; Optimization; Neural networks; Search methods; Australia; Germanium; AutoML; neural architecture search; continual learning; catastrophic forgetting; novelty search;

机译：计算机架构;培训;优化;神经网络;搜索方法;澳大利亚;锗;自动化;神经结构搜索;持续学习;灾难性的遗忘;新奇搜索;

相似文献

外文文献
中文文献
专利

1. Efficient Pose: Efficient human pose estimation with neural architecture search [J] . Wenqiang Zhang, Jiemin Fang, Xinggang Wang, 计算可视媒体(英文版) . 2021,第003期
2. Search recommendation model based on user search behavior and gradual forgetting collaborative filtering strategy [J] . LIU Chuan-chang 中国邮电高校学报：英文版 . 2010,第003期
3. Overcoming Catastrophic Forgetting with Hard Attention to the Task [J] . Joan Serra, Didac Suris, Marius Miron, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：克服灾难性遗忘，很难注意这项任务
4. Understanding and Simplifying One-Shot Architecture Search [J] . Gabriel Bender, Pieter-Jan Kindermans, Barret Zoph, JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：了解和简化一站式架构搜索
5. Understanding and Simplifying One-Shot Architecture Search [J] . Gabriel Bender, Pieter-Jan Kindermans, Barret Zoph, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：了解和简化一站式架构搜索
6. Overcoming Multi-Model Forgetting in One-Shot NAS With Diversity Maximization [C] . Miao Zhang, Huiqi Li, Shirui Pan, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2020

机译：通过多样性最大化克服一发NAS中的多模型遗忘
7. Holographic Generative Memory: Neurally Inspired One-shot Learning with Memory Augmented Neural Networks [D] . Graham, Dillon R. 2018

机译：全息生成性记忆：记忆增强神经网络的神经启发式一发式学习。
8. Overcoming catastrophic forgetting in neural networks [O] . James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, 2017

机译：克服神经网络中的灾难性遗忘
9. Overcoming Long-Term Catastrophic Forgetting Through Adversarial Neural Pruning and Synaptic Consolidation [O] . Jian Peng, Bo Tang, Hao Jiang, 2021

机译：通过对抗性神经修剪和突触巩固克服长期灾难性遗忘
10. Catastrophe Prediction with Neural Network [R] . Saklak, M., Grage, H. 1994

机译：基于神经网络的突变预测

One-Shot Neural Architecture Search: Maximising Diversity to Overcome Catastrophic Forgetting

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅