首页> 外文会议>International Workshop on Reproducible Research in Pattern Recognition >torchdistill: A Modular, Configuration-Driven Framework for Knowledge Distillation
【24h】

torchdistill: A Modular, Configuration-Driven Framework for Knowledge Distillation

机译:Torchdistill:一种模块化,配置驱动的知识蒸馏框架

获取原文

摘要

While knowledge distillation (transfer) has been attracting attentions from the research community, the recent development in the fields has heightened the need for reproducible studies and highly generalized frameworks to lower barriers to such high-quality, reproducible deep learning research. Several researchers voluntarily published frameworks used in their knowledge distillation studies to help other interested researchers reproduce their original work. Such frameworks, however, are usually neither well generalized nor maintained, thus researchers are still required to write a lot of code to refactor/build on the frameworks for introducing new methods, models, datasets and designing experiments. In this paper, we present our developed open-source framework built on PyTorch and dedicated for knowledge distillation studies. The framework is designed to enable users to design experiments by declarative PyYAML configuration files, and helps researchers complete the recently proposed ML Code Completeness Checklist. Using the developed framework, we demonstrate its various efficient training strategies, and implement a variety of knowledge distillation methods. We also reproduce some of their original experimental results on the ImageNet and COCO datasets presented at major machine learning conferences such as ICLR, NeurIPS, CVPR and ECCV, including recent state-of-the-art methods.
机译:虽然知识蒸馏(转让)一直吸引研究界的关注,但田地最近的发展提高了可再现的研究和高度广泛的框架,以降低这种高质量,可重复的深度学习研究的障碍。几位研究人员自愿公布的框架,用于他们的知识蒸馏研究,以帮助其他感兴趣的研究人员重现他们的原创作品。然而,这种框架通常既不是概括的也不维护,因此研究人员仍然需要在框架上编写大量代码来重构/构建用于引入新方法,模型,数据集和设计实验。在本文中,我们介绍了在Pytorch上建立的开发的开源框架,并专注于知识蒸馏研究。该框架旨在使用户能够通过声明的PyAML配置文件设计实验,并帮助研究人员完成最近提出的ML代码完整性检查表。使用发达的框架,我们展示了其各种有效的培训策略,并实施了各种知识蒸馏方法。我们还在主要机器学习会议(如ICLR,Neurips,CVPR和ECCV)上提供了一些原始实验结果,包括ICLR,Neurips,CVPR和ECCV,包括最近的最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号