Abstract
Few-shot learning (FSL) is an important and topical problem in computer vision that has motivated extensive research into numerous methods spanning from sophisticated metalearning methods to simple transfer learning baselines. We seek to push the limits of a simple-but-effective pipeline for real-world few-shot image classification in practice. To this end, we explore few-shot learning from the perspective of neural architecture, as well as a three stage pipeline of pre-training on external data, meta-training with labelled few-shot tasks, and task-specific fine-tuning on unseen tasks. We investigate questions such as: (1) How pre-training on external data benefits FSL? (2) How state of the art transformer architectures can be exploited? and (3) How to best exploit fine-tuning? Ultimately, we show that a simple transformer-based pipeline yields surprisingly good performance on standard benchmarks such as Mini-ImageNet, CIFAR-FS, CDFSL and Meta-Dataset. Our code is available at https://hushell.github.io/pmf.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 9058-9067 |
Number of pages | 10 |
ISBN (Electronic) | 978-1-6654-6946-3 |
ISBN (Print) | 978-1-6654-6947-0 |
DOIs | |
Publication status | Published - 27 Sep 2022 |
Event | IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022 - New Orleans, United States Duration: 19 Jun 2022 → 24 Jun 2022 https://cvpr2022.thecvf.com/ |
Publication series
Name | IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
---|---|
Publisher | IEEE |
ISSN (Print) | 1063-6919 |
ISSN (Electronic) | 2575-7075 |
Conference
Conference | IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022 |
---|---|
Abbreviated title | CVPR 2022 |
Country/Territory | United States |
City | New Orleans |
Period | 19/06/22 → 24/06/22 |
Internet address |