TY - CHAP
T1 - A speech-to-speech, machine translation mediated map task
T2 - 1st International Workshop on Future and Emergent Trends in Language Technology, FETLT 2015
AU - Cerrato, Loredana
AU - Akira, Hayakawa
AU - Campbell, Nick
AU - Luz, Saturnino
PY - 2016/2/26
Y1 - 2016/2/26
N2 - The aim of this study is to investigate how the language technologies of Automatic Speech Recognition (ASR), Machine Translation (MT), and Text To Speech (TTS) synthesis affect users during an interlingual interaction. In this paper, we describe the prototype system used for the data collection, we give details of the collected data and report the results of a usability test run to assess how the users of the interlingual system evaluate the interactions in a collaborative map task. We use widely adopted usability evaluation measures: ease of use, effectiveness and users satisfaction, and look at both qualitative and quantitative measures. Results indicate that both users taking part in the dialogues (instructions giver and follower) found the system similarly satisfactory in terms of ease of learning, ease of use, and pleasantness, even if they were less satisfied with its effectiveness in supporting the task. Users employed different strategies in order to adapt to the shortcomings of the technology, such as hyper-articulation, and rewording of utterances in relation to error of the ASR. We also report the results of a comparison of the map task in two different settings – one that includes a constant video stream (“video-on”) and one that does not (“no-video.”) Surprisingly, users rated the no-video setting consistently better.
AB - The aim of this study is to investigate how the language technologies of Automatic Speech Recognition (ASR), Machine Translation (MT), and Text To Speech (TTS) synthesis affect users during an interlingual interaction. In this paper, we describe the prototype system used for the data collection, we give details of the collected data and report the results of a usability test run to assess how the users of the interlingual system evaluate the interactions in a collaborative map task. We use widely adopted usability evaluation measures: ease of use, effectiveness and users satisfaction, and look at both qualitative and quantitative measures. Results indicate that both users taking part in the dialogues (instructions giver and follower) found the system similarly satisfactory in terms of ease of learning, ease of use, and pleasantness, even if they were less satisfied with its effectiveness in supporting the task. Users employed different strategies in order to adapt to the shortcomings of the technology, such as hyper-articulation, and rewording of utterances in relation to error of the ASR. We also report the results of a comparison of the map task in two different settings – one that includes a constant video stream (“video-on”) and one that does not (“no-video.”) Surprisingly, users rated the no-video setting consistently better.
KW - Adaptation
KW - Interlingual speech-to-speech translation
KW - Repair strategies
KW - Speaker alignment
KW - User evaluation
UR - http://www.scopus.com/inward/record.url?scp=84964906749&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-33500-1_5
DO - 10.1007/978-3-319-33500-1_5
M3 - Chapter (peer-reviewed)
AN - SCOPUS:84964906749
SN - 978-3-319-33499-8
VL - 9577
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 53
EP - 64
BT - Future and Emergent Trends in Language Technology
PB - Springer
Y2 - 19 November 2015 through 20 November 2015
ER -