TY - GEN
T1 - Combining Image-Level and Segment-Level Models for Automatic Annotation
AU - Kuettel, Daniel
AU - Guillaumin, Matthieu
AU - Ferrari, Vittorio
PY - 2012
Y1 - 2012
N2 - For the task of assigning labels to an image to summarize its contents, many early attempts use segment-level information and try to determine which parts of the images correspond to which labels. Best performing methods use global image similarity and nearest neighbor techniques to transfer labels from training images to test images. However, global methods cannot localize the labels in the images, unlike segment-level methods. Also, they cannot take advantage of training images that are only locally similar to a test image. We propose several ways to combine recent image-level and segment-level techniques to predict both image and segment labels jointly. We cast our experimental study in an unified framework for both image-level and segment-level annotation tasks. On three challenging datasets, our joint prediction of image and segment labels outperforms either prediction alone on both tasks. This confirms that the two levels offer complementary information.
AB - For the task of assigning labels to an image to summarize its contents, many early attempts use segment-level information and try to determine which parts of the images correspond to which labels. Best performing methods use global image similarity and nearest neighbor techniques to transfer labels from training images to test images. However, global methods cannot localize the labels in the images, unlike segment-level methods. Also, they cannot take advantage of training images that are only locally similar to a test image. We propose several ways to combine recent image-level and segment-level techniques to predict both image and segment labels jointly. We cast our experimental study in an unified framework for both image-level and segment-level annotation tasks. On three challenging datasets, our joint prediction of image and segment labels outperforms either prediction alone on both tasks. This confirms that the two levels offer complementary information.
KW - image auto-annotation
KW - image region labelling
KW - keyword-based image retrieval
U2 - 10.1007/978-3-642-27355-1_5
DO - 10.1007/978-3-642-27355-1_5
M3 - Conference contribution
SN - 978-3-642-27354-4
T3 - Lecture Notes in Computer Science
SP - 16
EP - 28
BT - Advances in Multimedia Modeling
A2 - Schoeffmann, Klaus
A2 - Merialdo, Bernard
A2 - Hauptmann, AlexanderG.
A2 - Ngo, Chong-Wah
A2 - Andreopoulos, Yiannis
A2 - Breiteneder, Christian
PB - Springer Berlin Heidelberg
ER -