Projects per year
Abstract
We introduce the Multi30K dataset to stimulate multilingual multimodal research. Recent advances in image description have been demonstrated on English language datasets almost exclusively, but image description should not be limited to English. This dataset extends the Flickr30K dataset with i) German translations created by professional translators over a subset of the English descriptions, and ii) German descriptions crowd sourced independently of the original English descriptions. We describe the data and outline how it can be used for multilingual image description and multimodal machine translation, but we anticipate the data will be useful for a broader range of tasks.
Original language | English |
---|---|
Title of host publication | Proceedings of the 5th Workshop on Vision and Language, hosted by the 54th Annual Meeting of the Association for Computational Linguistics, VL@ACL 2016, August 12, Berlin, Germany |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 70-74 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 12 Aug 2016 |
Event | 5th Workshop on Vision and Language - Berlin, Germany Duration: 12 Aug 2016 → 12 Aug 2016 https://vision.cs.hacettepe.edu.tr/vl2016/ |
Conference
Conference | 5th Workshop on Vision and Language |
---|---|
Abbreviated title | VL 2016 |
Country/Territory | Germany |
City | Berlin |
Period | 12/08/16 → 12/08/16 |
Internet address |
Projects
- 1 Finished