A software toolkit for sharing and accessing corpora over the internet

Saturnino Luz*

*Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper describes the Translational English Corpus (TEC) and the software tools developed in order to enable the use of the corpus remotely, over the internet. The model underlying these tools is based on an extensible client-server architecture implemented in Java. We discuss the data and processing constraints which motivated the TEC architecture design and its impact on the efficiency and scalability of the system. We also suggest that the kind of distributed processing model adopted in TEC could play a role in fostering the availability of corpus linguistic resources to the research community.

Original languageEnglish
Publication statusPublished - Aug 2000
Event2nd International Conference on Language Resources and Evaluation, LREC 2000 - Athens, Greece
Duration: 31 May 20002 Jun 2000

Conference

Conference2nd International Conference on Language Resources and Evaluation, LREC 2000
CountryGreece
CityAthens
Period31/05/002/06/00

Fingerprint

Dive into the research topics of 'A software toolkit for sharing and accessing corpora over the internet'. Together they form a unique fingerprint.

Cite this