Abstract
This paper describes the evaluation methodology and results of the 2001 DARPA Communicator evaluation. The experiment spanned 6 months of 2001 and involved eight DARPA Communicator systems in the travel planning domain. It resulted in a corpus of 1242 dialogs which include many more dialogues for complex tasks than the 2000 evaluation. We describe the experimental design, the approach to data collection, and the results. We compare the results by the type of travel plan and by system. The results demonstrate some large differences across sites and show that the complex trips are clearly more difficult.
Original language | English |
---|---|
Title of host publication | Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002) |
Publisher | ISCA |
Pages | 269-272 |
DOIs | |
Publication status | Published - 2002 |
Event | 7th International Conference on Spoken Language Processing (Interspeech 2002) - Denver, CO, United States Duration: 16 Sept 2002 → 20 Sept 2002 |
Conference
Conference | 7th International Conference on Spoken Language Processing (Interspeech 2002) |
---|---|
Country/Territory | United States |
City | Denver, CO |
Period | 16/09/02 → 20/09/02 |