The aim of this paper is to investigate whether a treebank grammar can be used to automatically classify and annotate German phrases contained in a MT lexicon. Phrases from the lexicon appear in their citation form and may differ structurally from the phr ase tokens found in the corpus. We describe the grammar extraction proc ess for a formalism called Tree-Generating Binary Grammar a nd evaluate the performance of subsets of the obtained grammar on a set of four types of lexical phrases.
|Title of host publication||Proceedings of 3rd Language and Technology Conference|
|Editors||Zygmunt Vetulani, Paweł Konieczka, Jacek Marciniak, Tomasz Obrębski, Przemysław Rzepecki, Justyna Walkowska|
|Place of Publication||Poznań, Poland|
|Publisher||Wydawnictwo Poznańskie Sp. z o. o.|
|Publication status||Published - 2007|