Cross-lingual Pronoun Prediction for English, French and German with Maximum Entropy Classification

Dominikus Wetzel

Research output: Chapter in Book/Report/Conference proceedingConference contribution


We present our submission to the crosslingual pronoun prediction (CLPP) shared task for English-German and EnglishFrench at the First Conference on Machine Translation (WMT16). We trained a Maximum Entropy (MaxEnt) classifier based on features from Wetzel et al. (2015),that we ada pted to the new task and applied to a new language pair. Additional features such as n-grams of the pronoun context and prediction of NULLtranslations proved helpful to a varying degree. Experiments with a sequence classifier over pronoun sequences did not show any improvements. Our submission is among the top three systems for English-French (61.62% macro-averaged recall) and in the middle range for EnglishGerman (48.72%) out of nine submissions.
Original languageEnglish
Title of host publicationProceedings of the First Conference on Machine Translation, Volume 2: Shared Task Papers
Place of PublicationBerlin, Germany
PublisherAssociation for Computational Linguistics
Number of pages7
ISBN (Print)978-1-945626-10-4
Publication statusPublished - 12 Aug 2016
Externally publishedYes
EventFirst Conference on Machine Translation - Berlin, Germany
Duration: 11 Aug 201612 Aug 2016


ConferenceFirst Conference on Machine Translation
Abbreviated titleWMT16
Internet address

Fingerprint Dive into the research topics of 'Cross-lingual Pronoun Prediction for English, French and German with Maximum Entropy Classification'. Together they form a unique fingerprint.

Cite this