Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?
Paper
•
2303.04143
•
Published
Boris Knyazev, Doha Hwang, Simon Lacoste-Julien
https://arxiv.org/abs/2303.04143
See the list of pretrained GHN models at huggingface.co/SamsungSAILMontreal/ghn3.
See code examples at github.com/SamsungSAILMontreal/ghn3
The model is trained on the bknyaz/deepnets1m dataset of neural architectures.