Abstract
Weakly supervised learning of object detection is an important problem in image understanding that still does not have a satisfactory solution. In this paper, we address this problem by exploiting the power of deep convolutional neural networks pre-trained on large-scale image-level classification tasks. We propose a weakly supervised deep detection architecture that modifies one such network to operate at the level of image regions, performing simultaneously region selection and classification. Trained as an image classifier, the architecture implicitly learns object detectors that are better than alternative weakly supervised detection systems on the PASCAL VOC data. The model, which is a simple and elegant end-to-end architecture, outperforms standard data augmentation and fine-tuning techniques for the task of image-level classification as well.
| Original language | English |
|---|---|
| Title of host publication | 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) |
| Publisher | Institute of Electrical and Electronics Engineers |
| Pages | 2846-2854 |
| Number of pages | 9 |
| ISBN (Electronic) | 978-1-4673-8851-1 |
| ISBN (Print) | 978-1-4673-8852-8 |
| DOIs | |
| Publication status | Published - 12 Dec 2016 |
| Event | 29th IEEE Conference on Computer Vision and Pattern Recognition - Las Vegas, United States Duration: 26 Jun 2016 → 1 Jul 2016 http://cvpr2016.thecvf.com/ |
Publication series
| Name | |
|---|---|
| Publisher | IEEE |
| ISSN (Electronic) | 1063-6919 |
Conference
| Conference | 29th IEEE Conference on Computer Vision and Pattern Recognition |
|---|---|
| Abbreviated title | CVPR 2016 |
| Country/Territory | United States |
| City | Las Vegas |
| Period | 26/06/16 → 1/07/16 |
| Internet address |
Keywords / Materials (for Non-textual outputs)
- Computer Vision
- Deep Learning
- object detection
- weakly supervised learning
Fingerprint
Dive into the research topics of 'Weakly Supervised Deep Detection Networks'. Together they form a unique fingerprint.Profiles
-
Hakan Bilen
- School of Informatics - Reader
- Institute of Perception, Action and Behaviour
- Language, Interaction, and Robotics
Person: Academic: Research Active