Abstract
Key to effective person re-identification (Re-ID) is modelling discriminative and view-invariant factors of person appearance at both high and low semantic levels. Recently developed deep Re-ID models either learn a holistic single semantic level feature representation and/or require laborious human annotation of these factors as attributes. We propose Multi-Level Factorisation Net (MLFN), a novel network architecture that factorises the visual appearance of a person into latent discriminative factors at multiple semantic levels without manual annotation. MLFN is composed of multiple stacked blocks. Each block contains multiple factor modules to model latent factors at a specific level, and factor selection modules that dynamically select the factor modules to interpret the content of each input image. The outputs of the factor selection modules also provide a compact latent factor descriptor that is complementary to the conventional deeply learned features. MLFN achieves state-of-the-art results on three Re-ID datasets, as well as compelling results on the general object categorisation CIFAR-100 dataset.
Original language | English |
---|---|
Title of host publication | Computer Vision and Pattern Recognition 2018 |
Publisher | Institute of Electrical and Electronics Engineers |
Number of pages | 10 |
DOIs | |
Publication status | Published - 17 Dec 2018 |
Event | Computer Vision and Pattern Recognition 2018 - Salt Lake City, United States Duration: 18 Jun 2018 → 22 Jun 2018 http://cvpr2018.thecvf.com/ http://cvpr2018.thecvf.com/ http://cvpr2018.thecvf.com/ |
Publication series
Name | |
---|---|
Publisher | IEEE |
ISSN (Print) | 1063-6919 |
ISSN (Electronic) | 2575-7075 |
Conference
Conference | Computer Vision and Pattern Recognition 2018 |
---|---|
Abbreviated title | CVPR 2018 |
Country/Territory | United States |
City | Salt Lake City |
Period | 18/06/18 → 22/06/18 |
Internet address |