An Analysis of Action Recognition Datasets for Language and Vision Tasks

Spandana Gella, Frank Keller

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

A large amount of recent research has focused on tasks that combine language
and vision, resulting in a proliferation of datasets and methods. One such task
is action recognition, whose applications include image annotation, scene understanding and image retrieval. In this survey, we categorize the existing approaches based on how they conceptualize this problem and provide a detailed review of existing datasets, highlighting their diversity as well as advantages and disadvantages. We focus on recently developed datasets which link visual information with linguistic resources and provide a fine-grained syntactic and semantic analysis of actions in images.
Original languageEnglish
Title of host publicationProceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Short Papers)
PublisherAssociation for Computational Linguistics
Pages64-71
Number of pages8
ISBN (Print)978-1-945626-76-0
DOIs
Publication statusPublished - 4 Aug 2017
Event55th annual meeting of the Association for Computational Linguistics (ACL) - Vancouver, Canada
Duration: 30 Jul 20174 Aug 2017
http://acl2017.org/

Conference

Conference55th annual meeting of the Association for Computational Linguistics (ACL)
Abbreviated titleACL 2017
Country/TerritoryCanada
CityVancouver
Period30/07/174/08/17
Internet address

Fingerprint

Dive into the research topics of 'An Analysis of Action Recognition Datasets for Language and Vision Tasks'. Together they form a unique fingerprint.

Cite this