Automatic evaluation: Using a date dialogue act tagger for user satisfaction and task completion prediction

Helen Wright Hastie*, Rashmi Prasad, Marilyn Walker

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The objective of the DARPA Communicator project is to support rapid, cost-effective development of multi-modal speech-enabled dialogue systems with advanced conversational capabilities. During the course of the Communicator program, we have been involved in developing methods for measuring progress towards the program goals and assessing advances in the component technologies required to achieve such goals. Our goal has been to develop a lightweight evaluation paradigm for heterogeneous systems. In this paper, we utilize the Communicator evaluation corpus from 2001 and build on previous work applying the PARADISE evaluation framework to establish a baseline for fully automatic system evaluation. We train a regression tree to predict User Satisfaction using a random 80 of the dialogues for training. The metrics (features) we use for prediction are a fully automatic Task Success Measure, Efficiency Measures, and System Dialogue Act Behaviors extracted from the dialogue logfiles using the DATE (Dialogue Act Tagging for Evaluation) tagging scheme. The learned tree with the DATE metrics has a correlation of 0.614 (R2 of 0.376) with the actual user satisfaction values for the held out test set, while the learned tree without the DATE metrics has a correlation of 0.595 (R2 of 0.35)
Original languageEnglish
Title of host publicationProceedings of the Third International Conference on Language Resources and Evaluation (LREC '02)
PublisherEuropean Language Resources Association (ELRA)
Pages641-648
Number of pages8
Publication statusPublished - 2002
Event3rd International Conference on Language Resources and Evaluation, LREC 2002 - Las Palmas, Spain
Duration: 29 May 200231 May 2002
Conference number: 3
http://www.lrec-conf.org/lrec2002/

Conference

Conference3rd International Conference on Language Resources and Evaluation, LREC 2002
Abbreviated titleLREC 2002
Country/TerritorySpain
CityLas Palmas
Period29/05/0231/05/02
Internet address

Fingerprint

Dive into the research topics of 'Automatic evaluation: Using a date dialogue act tagger for user satisfaction and task completion prediction'. Together they form a unique fingerprint.

Cite this