Isomorphic Transfer of Syntactic Structures in Cross-Lingual NLP

Edoardo Maria Ponti, Roi Reichart, Anna Korhonen, Ivan Vulić

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

The transfer or share of knowledge between languages is a potential solution to resource scarcity in NLP. However, the effectiveness of cross-lingual transfer can be challenged by variation in syntactic structures. Frameworks such as Universal Dependencies (UD) are designed to be cross-lingually consistent, but even in carefully designed resources trees representing equivalent sentences may not always overlap. In this paper, we measure cross-lingual syntactic variation, or anisomorphism, in the UD treebank collection, considering both morphological and structural properties. We show that reducing the level of anisomorphism yields consistent gains in cross-lingual transfer tasks. We introduce a source language selection procedure that facilitates effective cross-lingual parser transfer, and propose a typologically driven method for syntactic tree processing which reduces anisomorphism. Our results show the effectiveness of this method for both machine translation and cross-lingual sentence similarity, demonstrating the importance of syntactic structure compatibility for boosting cross-lingual transfer in NLP.
Original languageEnglish
Title of host publicationProceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
EditorsIryna Gurevych, Yusuke Miyao
Place of PublicationStroudsburg, PA, USA
PublisherAssociation for Computational Linguistics
Pages1531-1542
Number of pages12
Volume1
ISBN (Electronic)978-1-948087-32-2
DOIs
Publication statusPublished - 1 Jul 2018
Event56th Annual Meeting of the Association for Computational Linguistics - Melbourne Convention and Exhibition Centre, Melbourne, Australia
Duration: 15 Jul 201820 Jul 2018
http://acl2018.org/

Conference

Conference56th Annual Meeting of the Association for Computational Linguistics
Abbreviated titleACL 2018
Country/TerritoryAustralia
CityMelbourne
Period15/07/1820/07/18
Internet address

Fingerprint

Dive into the research topics of 'Isomorphic Transfer of Syntactic Structures in Cross-Lingual NLP'. Together they form a unique fingerprint.

Cite this