PyTorch implementation of ICLR2020 paper 'Towards Fast Adaptation of Neural Architectures with Meta Learning'
- python: 3.x
- Pytorch: 1.0+
- tensorboardX >= 1.4
- torchvision >= 0.2.1
- ptflops
bash train_tnas.sh
Here, the workload is very high, because we only get meta-theta in search stage. It is worth noting that the network weights cannot be used to transfer to evaluate the final performance. Similar other NAS (e.g., darts) and auto-maml, we need to train the model from scratch. In the implementation, we set 600 tasks (consistent with meta learning) to evaluate. Therefore, you need to do the following steps:
- for each task, decoding the architecture theta according to the 5-way, 1/5-shot samples.
- training each decoded arch to get weights with maml method and pick the best model in the validation set.
- for each task, test the model in 5-way, 15-query samples.
- average the results of 600 tasks.
About training time: each model needs about 6h to converge in a single GTX 1080. So you need about 600*6h = 3600h in total in a single GTX 1080.
bash train_auto_maml.sh
- 5-way, 1-shot
normal cell and reduction cell
- 5-way, 5-shot
normal cell and reduction cell
bash train_auto_maml_from_scratch.sh
@InProceedings{lian2020iclr,
title={Towards Fast Adaptation of Neural Architectures with Meta Learning},
author={Lian, Dongze and Zheng, Yin and Xu, Yintao and Lu, Yanxiong and Lin, Leyu and Zhao, Peilin and Huang, Junzhou and Gao, Shenghua},
booktitle={International Conference on Learning Representations (ICLR)},
year={2020}
}