Querying Linguistic Trees

Catherine Lai, Steven Bird

Research output: Contribution to journalArticlepeer-review


Large databases of linguistic annotations are used for testing linguistic hypotheses and for training language processing models. These linguistic annotations are often syntactic or prosodic in nature, and have a hierarchical structure. Query languages are used to select particular structures of interest, or to project out large slices of a corpus for external analysis. Existing languages suffer from a variety of problems in the areas of expressiveness, efficiency, and naturalness for linguistic query. We describe the domain of linguistic trees and discuss the expressive requirements for a query language. Then we present a language that can express a wide range of queries over these trees, and show that the language is first-order complete over trees.
Original languageEnglish
Pages (from-to)53-73
Number of pages21
JournalJournal of Logic, Language and Information
Issue number1
Publication statusPublished - 2010


Dive into the research topics of 'Querying Linguistic Trees'. Together they form a unique fingerprint.

Cite this