Multi-Task Zero-Shot Action Recognition with Prioritised Data Augmentation

Xu Xun, Timothy Hospedales, Shaogang Gong

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Zero-Shot Learning (ZSL) promises to scale visual recognition by bypassing the conventional model training requirement of annotated examples for every category. This is achieved by establishing a mapping connecting low-level features and a semantic description of the label space, referred as visual-semantic mapping, on auxiliary data. Re-using the learned mapping to project target videos into an embedding space thus allows novel-classes to be recognised by nearest neighbour inference. However, existing ZSL methods suffer from auxiliary-target domain shift intrinsically induced by assuming the same mapping for the disjoint auxiliary and target classes. This compromises the generalisation accuracy of ZSL recognition on the target data. In this work, we improve the ability of ZSL to generalise across this domain shift in both model- and data-centric ways by formulating a visual-semantic mapping with better generalisation properties and a dynamic data re-weighting method to prioritise auxiliary data that are relevant to the target classes. Specifically: (1) We introduce a multi-task visual-semantic mapping to improve generalisation by constraining the semantic mapping parameters to lie on a low-dimensional manifold, (2) We explore prioritised data augmentation by expanding the pool of auxiliary data with additional instances weighted by relevance to the target domain. The proposed new model is applied to the challenging zero-shot action recognition problem to demonstrate its advantages over existing ZSL models.
Original languageEnglish
Title of host publicationComputer Vision – ECCV 2016
Subtitle of host publication14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II
PublisherSpringer
Pages343-359
Number of pages17
ISBN (Electronic)978-3-319-46475-6
ISBN (Print)978-3-319-46474-9
DOIs
Publication statusPublished - 17 Sept 2016
Event14th European Conference on Computer Vision 2016 - Amsterdam, Netherlands
Duration: 8 Oct 201616 Oct 2016
http://www.eccv2016.org/

Publication series

NameLecture Notes in Computer Science
PublisherSpringer International Publishing
Volume9906
ISSN (Print)0302-9743

Conference

Conference14th European Conference on Computer Vision 2016
Abbreviated titleECCV 2016
Country/TerritoryNetherlands
CityAmsterdam
Period8/10/1616/10/16
Internet address

Fingerprint

Dive into the research topics of 'Multi-Task Zero-Shot Action Recognition with Prioritised Data Augmentation'. Together they form a unique fingerprint.

Cite this