Projects per year
Abstract / Description of output
The field of few-shot learning has recently seen substantial advancements. Most of these advancements came from casting few-shot learning as a meta-learning problem. Model Agnostic Meta Learning or MAML is currently one of the best approaches for few-shot learning via meta-learning. MAML is simple, elegant and very powerful, however, it has a variety of issues, such as being very sensitive to neural network
architectures, often leading to instability during training, requiring arduous hyperparameter searches to stabilize training and achieve high generalization and being very computationally expensive at both training and inference times. In this paper, we propose various modifications to MAML that not only stabilize the system, but also substantially improve the generalization performance, convergence speed and computational overhead of MAML, which we call MAML++.
architectures, often leading to instability during training, requiring arduous hyperparameter searches to stabilize training and achieve high generalization and being very computationally expensive at both training and inference times. In this paper, we propose various modifications to MAML that not only stabilize the system, but also substantially improve the generalization performance, convergence speed and computational overhead of MAML, which we call MAML++.
Original language | English |
---|---|
Number of pages | 11 |
Publication status | Published - 2019 |
Event | Seventh International Conference on Learning Representations - New Orleans, United States Duration: 6 May 2019 → 9 May 2019 https://iclr.cc/ |
Conference
Conference | Seventh International Conference on Learning Representations |
---|---|
Abbreviated title | ICLR 2019 |
Country/Territory | United States |
City | New Orleans |
Period | 6/05/19 → 9/05/19 |
Internet address |
Fingerprint
Dive into the research topics of 'How to train your MAML'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Bonseyes - Platform for Open Development of Systems of Artificial Intelligence
1/12/16 → 31/01/20
Project: Research