Word-Level Emotion Recognition Using High-Level Features

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we investigate the use of high-level features for recognizing human emotions at the word-level in natural conversations with virtual agents. Experiments were carried out on the 2012 Audio/Visual Emotion Challenge (AVEC2012) database, where emotions are defined as vectors in the Arousal-Expectancy-Power-Valence emotional space. Our model using 6 novel disfluency features yields significant improvements compared to those using large number of low-level spectral and prosodic features, and the overall performance difference between it and the best model of the AVEC2012 Word-Level Sub-Challenge is not significant. Our visual model using the Active Shape Model visual features also yields significant improvements compared to models using the low-level Local Binary Patterns visual features. We built a bimodal model By combining our disfluency and visual feature sets and applying Correlation-based Feature-subset Selection. Considering overall performance on all emotion dimensions, our bimodal model outperforms the second best model of the challenge, and comes close to the best model. It also gives the best result when predicting Expectancy values.
Original languageEnglish
Title of host publicationComputational Linguistics and Intelligent Text Processing
Subtitle of host publication15th International Conference, CICLing 2014, Kathmandu, Nepal, April 6-12, 2014, Proceedings, Part II
EditorsAlexander Gelbukh
PublisherSpringer
Pages17-31
Number of pages15
ISBN (Electronic)978-3-642-54903-8
ISBN (Print)978-3-642-54902-1
DOIs
Publication statusPublished - 2014

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Berlin Heidelberg
Volume8404
ISSN (Print)0302-9743

Fingerprint

Dive into the research topics of 'Word-Level Emotion Recognition Using High-Level Features'. Together they form a unique fingerprint.

Cite this