Abstract / Description of output
A large amount of recent research has focused on tasks that combine language
and vision, resulting in a proliferation of datasets and methods. One such task
is action recognition, whose applications include image annotation, scene understanding and image retrieval. In this survey, we categorize the existing approaches based on how they conceptualize this problem and provide a detailed review of existing datasets, highlighting their diversity as well as advantages and disadvantages. We focus on recently developed datasets which link visual information with linguistic resources and provide a fine-grained syntactic and semantic analysis of actions in images.
and vision, resulting in a proliferation of datasets and methods. One such task
is action recognition, whose applications include image annotation, scene understanding and image retrieval. In this survey, we categorize the existing approaches based on how they conceptualize this problem and provide a detailed review of existing datasets, highlighting their diversity as well as advantages and disadvantages. We focus on recently developed datasets which link visual information with linguistic resources and provide a fine-grained syntactic and semantic analysis of actions in images.
Original language | English |
---|---|
Title of host publication | Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Short Papers) |
Publisher | Association for Computational Linguistics |
Pages | 64-71 |
Number of pages | 8 |
ISBN (Print) | 978-1-945626-76-0 |
DOIs | |
Publication status | Published - 4 Aug 2017 |
Event | 55th annual meeting of the Association for Computational Linguistics (ACL) - Vancouver, Canada Duration: 30 Jul 2017 → 4 Aug 2017 http://acl2017.org/ |
Conference
Conference | 55th annual meeting of the Association for Computational Linguistics (ACL) |
---|---|
Abbreviated title | ACL 2017 |
Country/Territory | Canada |
City | Vancouver |
Period | 30/07/17 → 4/08/17 |
Internet address |