Abstract
Language technology makes extensive use of hierarchically annotated text and speech data. These databases are stored in flat files and manipulated using corpus-specific query tools or special-purpose scripts. While the size of these databases and the range of applications has grown rapidly in recent years, neither method for managing the data has led to reusable, scalable software. The formal properties of the query languages are not well understood. Hence established methods for indexing tree data and optimizing tree queries cannot be employed. We analyze a range of existing linguistic query languages, and adduce a set of requirements for a reusable, scalable linguistic query language.
Original language | English |
---|---|
Title of host publication | In Proceedings of the Australasian Language Technology Workshop |
Place of Publication | Sydney, Australia |
Publisher | Australian Speech Science & Technology Association Inc |
Pages | 139-146 |
Number of pages | 8 |
Volume | 2 |
ISBN (Electronic) | 0 9581946 1 0 |
Publication status | Published - 2004 |
Event | Australasian Language Technology Workshop 2004 - Macquarie University, Sydney, Australia Duration: 8 Dec 2004 → 8 Dec 2004 |
Conference
Conference | Australasian Language Technology Workshop 2004 |
---|---|
Country/Territory | Australia |
City | Sydney |
Period | 8/12/04 → 8/12/04 |