An Evaluation of Image-based Verb Prediction Models against Human Eye-tracking Data

Spandana Gella, Frank Keller

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recent research in language and vision has developed models for predicting and disambiguating verbs from images. Here, we ask whether the predictions made by such models correspond to human intuitions about visual verbs. We show that the image regions a verb prediction model identifies as salient for a given verb correlate with the regions fixated by human observers performing a verb classification task.
Original languageEnglish
Title of host publicationThe 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Place of PublicationNew Orleans, Louisiana
PublisherAssociation for Computational Linguistics
Pages758-763
Number of pages6
DOIs
Publication statusPublished - 30 Jun 2018
Event16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Hyatt Regency New Orleans Hotel, New Orleans, United States
Duration: 1 Jun 20186 Jun 2018
http://naacl2018.org/

Conference

Conference16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Abbreviated titleNAACL HLT 2018
Country/TerritoryUnited States
CityNew Orleans
Period1/06/186/06/18
Internet address

Fingerprint

Dive into the research topics of 'An Evaluation of Image-based Verb Prediction Models against Human Eye-tracking Data'. Together they form a unique fingerprint.

Cite this