Fine-Grained Image Analysis with Deep Learning: A Survey

Xiu-Shen Wei, Yi-Zhe Song, Oisin Mac Aodha, Jianxin Wu, Yuxin Peng, Jinhui Tang, Jian Yang, Serge Belongie

Research output: Contribution to journalArticlepeer-review

Abstract

Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition, and underpins a diverse set of real-world applications. The task of FGIA targets analyzing visual objects from subordinate categories, e.g., species of birds or models of cars. The small inter-class and large intra-class variation inherent to fine-grained image analysis makes it a challenging problem. Capitalizing on advances in deep learning, in recent years we have witnessed remarkable progress in deep learning powered FGIA. In this paper we present a systematic survey of these advances, where we attempt to re-define and broaden the field of FGIA by consolidating two fundamental fine-grained research areas – fine grained image recognition and fine-grained image retrieval. In addition, we also review other key issues of FGIA, such as publicly available benchmark datasets and related domain-specific applications. We conclude by highlighting several research directions and open problems which need further exploration from the community.
Original languageEnglish
Pages (from-to)8927-8948
Number of pages22
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume44
Issue number12
Early online date9 Nov 2021
DOIs
Publication statusPublished - 1 Dec 2022

Keywords / Materials (for Non-textual outputs)

  • fine-grained image analysis
  • deep learning
  • fine-grained image recognition
  • fine-grained image retrieval

Fingerprint

Dive into the research topics of 'Fine-Grained Image Analysis with Deep Learning: A Survey'. Together they form a unique fingerprint.

Cite this