Automatic Image Annotation Using Auxiliary Text Information

Yansong Feng, Mirella Lapata

Research output: Chapter in Book/Report/Conference proceedingConference contribution


The availability of databases of images labelled with keywords is necessary for developing and evaluating image annotation models. Dataset collection is however a costly and time consuming task. In this paper we exploit the vast resource of images available on the web. We create a database of pictures that are naturally embedded into news articles and propose to use their captions as a proxy for annotation keywords. Experimental results show that an image annotation model can be developed on this dataset alone without the overhead of manual annotation. We also demonstrate that the news article associated with the picture can be used to boost image annotation performance.
Original languageEnglish
Title of host publicationACL 2008, Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, June 15-20, 2008, Columbus, Ohio, USA
PublisherAssociation for Computational Linguistics
Number of pages9
Publication statusPublished - 2008

Fingerprint Dive into the research topics of 'Automatic Image Annotation Using Auxiliary Text Information'. Together they form a unique fingerprint.

Cite this