Image and Natural Language Processing for Multimedia Information Retrieval

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Image annotation, the task of automatically generating description words for a picture, is a key component in various image search and retrieval applications. Creating image databases for model development is, however, costly and time consuming, since the keywords must be hand-coded and the process repeated for new collections. In this work we exploit the vast resource of images and documents available on the web for developing image annotation models without any human involvement. We describe a probabilistic framework based on the assumption that images and their co-occurring textual data are generated by mixtures of latent topics. Applications of this framework to image annotation and retrieval show performance gains over previously proposed approaches, despite the noisy nature of our dataset. We also discuss how the proposed model can be used for story picturing, i.e., to find images that appropriately illustrate a text and demonstrate its utility when interfaced with an image caption generator.

Original languageEnglish
Title of host publicationADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS
EditorsC Gurrin, Y He, G Kazai, U Kruschwitz, S Little, T Roelleke, S Ruger, K VanRijsbergen
Place of PublicationBERLIN
PublisherSpringer
Pages12-12
Number of pages1
ISBN (Print)978-3-642-12274-3
Publication statusPublished - 2010
Event32nd European Conference on Information Retrieval Research - Milton Keynes
Duration: 28 Mar 201031 Mar 2010

Conference

Conference32nd European Conference on Information Retrieval Research
CityMilton Keynes
Period28/03/1031/03/10

Fingerprint

Dive into the research topics of 'Image and Natural Language Processing for Multimedia Information Retrieval'. Together they form a unique fingerprint.

Cite this